fix: strip <think> tokens from reasoning model output by benjamin7007 · Pull Request #617 · OpenBMB/ChatDev

benjamin7007 · 2026-05-09T02:13:36Z

Problem

Models with thinking/reasoning capabilities (DeepSeek-R1, MiniMax-M2.7, QwQ, Qwen3, etc.) include <think>...</think> blocks in their response content when used via OpenAI-compatible API endpoints. These internal reasoning tokens leak into:

Agent output — downstream nodes receive thinking tokens as part of the input
Timeline content — execution logs show raw thinking blocks
Final workflow result — end users see <think> tags in the output

Root Cause

OpenAIProvider._deserialize_chat_response() and _append_chat_response_output() pass raw content from model responses without filtering reasoning tokens.

Fix

Add _strip_thinking_tokens() classmethod to OpenAIProvider:

Uses regex <think>.*?</think>\s* with re.DOTALL to strip thinking blocks
Fast path: skips regex if <think> substring not found (zero-cost for non-thinking models)
Applied in both deserialization paths (_deserialize_chat_response and _append_chat_response_output)

Testing

Verified with MiniMax-M2.7 (thinking model) in a Writer→Reviewer workflow:

Before fix: <think> blocks leaked into Reviewer input and final output
After fix: Clean output, no thinking tokens visible

Notes

This is a minimal, targeted fix in the OpenAI provider only
The Gemini provider uses a different content structure (MessageBlock) and would need separate handling if Gemini models add thinking tokens
No existing tests were broken

Models with thinking/reasoning capabilities (DeepSeek-R1, MiniMax-M2.7, QwQ, etc.) include <think>...</think> blocks in their response content. These internal reasoning tokens leak into agent output and downstream node inputs, corrupting the workflow. Add _strip_thinking_tokens() classmethod to OpenAIProvider that filters <think>...</think> blocks via regex. Applied in both: - _deserialize_chat_response() (Message content) - _append_chat_response_output() (timeline content) The fix is zero-cost for models without thinking tokens (fast path checks for '<think>' substring before regex). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: strip <think> tokens from reasoning model output#617

fix: strip <think> tokens from reasoning model output#617
benjamin7007 wants to merge 1 commit into
OpenBMB:mainfrom
benjamin7007:fix/strip-thinking-tokens

benjamin7007 commented May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

benjamin7007 commented May 9, 2026

Problem

Root Cause

Fix

Testing

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant