hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-27 11:22:03 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	2d286a6d00	fix(agent): close tool-call sequence on all interrupt aborts, not just finalize_turn #48879 closed the tool-call sequence on interrupt inside finalize_turn so a /stop after a tool no longer persists a `tool` tail that the next user message turns into a `tool -> user` role-alternation violation (which strict providers like Gemini/Claude react to by hallucinating a continuation and ignoring prior context — what users see as "lost context after stop"). But the retry-wait, error-handling, and post-error retry-wait interrupt aborts in conversation_loop return early and never reach finalize_turn, so they still persisted and returned a raw `tool` tail. Interrupting during provider backoff/rate-limiting (common under heavy work) hit exactly this path. Extract the close into a shared close_interrupted_tool_sequence helper and apply it at every interrupt abort (finalize_turn + the three early returns) so the whole bug class is fixed, not just the one site.	2026-06-25 12:24:34 -05:00
Teknium	2b5268f716	revert: drop cumulative-resend tool-arg heuristic from shared streaming path (#35718 ) (#35860 ) PR #35718 added a per-slot "cumulative-resend" latch to the universal streaming tool-call accumulator to fix DeepSeek / Baidu Qianfan (#35592). The latch fires when a delta is a strict superset of the accumulated buffer (len(_new) > len(_prev) and _new.startswith(_prev)) and then REPLACES the buffer instead of appending. That superset test is not an unambiguous cumulative signature. A normal incremental stream can emit a single fragment that restates an already- accumulated prefix — trivially common in large code-patch arguments with repeated lines / indentation — which trips the latch and clobbers the accumulated buffer, corrupting the tool call. Observed in the wild on Anthropic Opus (the primary model) building a large patch: corrupted / short arguments → finish_reason='length' dead-end → session killed. A guessing heuristic that can silently clobber a tool-call buffer has no place on the path every provider and model shares. Reverting restores the known-good plain `+=` accumulator. The #35592 narrow provider bug should be re-addressed provider-gated so it is structurally impossible to touch Anthropic / OpenAI incremental streams, rather than via a heuristic on the shared path. Reverts `ca03486b6`.	2026-05-31 06:14:32 -07:00
Teknium	ca03486b6a	fix(streaming): stop duplicating tool-call args from cumulative-resend providers (#35718 ) DeepSeek / Baidu Qianfan stream tool-call arguments in cumulative mode: each chunk resends the full arguments-so-far instead of the new fragment. The stream accumulator blindly concatenated arg deltas with +=, turning that into '{...}{...}{...}', which failed json.loads and got nuked to '{}' — a silently corrupted tool call (#35592). Worse on multi-param tools (search_files, session_search, memory replace) because longer args take more chunks, giving more resend opportunities. - Per-slot cumulative latch in the stream accumulator: a delta that is a strict superset of the accumulated buffer marks the slot cumulative and replaces (not appends); exact duplicates are dropped only after latching. Incremental fragments are untouched (default += path). - Backstop _collapse_repeated_json_arguments() in the repair pipeline collapses pure identical-resend buffers (K exact repeats of a valid-JSON unit) for providers that resend the complete object from chunk 1. Only reached after json.loads already failed, so compliant single objects are never touched. Not a gateway or DeepSeek-model bug — any OpenAI-wire provider in cumulative streaming mode is affected.	2026-05-31 00:19:39 -07:00
teknium1	885d1242a2	refactor(run_agent): extract message sanitization to agent/message_sanitization.py Pull the 10 pure sanitization/repair helpers (\_sanitize_surrogates, \_sanitize_structure_surrogates, \_sanitize_messages_surrogates, \_escape_invalid_chars_in_json_strings, \_repair_tool_call_arguments, \_strip_non_ascii, \_sanitize_messages_non_ascii, \_sanitize_tools_non_ascii, \_strip_images_from_messages, \_sanitize_structure_non_ascii) and the \_SURROGATE_RE constant out of run_agent.py into a new module. These are stateless byte-walking helpers with no AIAgent dependency. Backward compatibility: run_agent re-exports every name via a single import block, so existing 'from run_agent import _sanitize_surrogates' imports in tests and cli.py keep working unchanged. Same pattern the file already uses for _summarize_user_message_for_log (codex_responses_adapter). run_agent.py: 16077 -> 15682 lines (-395).	2026-05-16 17:41:09 -07:00

4 commits