hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-19 10:02:16 +00:00

Author	SHA1	Message	Date
Teknium	2b5268f716	revert: drop cumulative-resend tool-arg heuristic from shared streaming path (#35718 ) (#35860 ) PR #35718 added a per-slot "cumulative-resend" latch to the universal streaming tool-call accumulator to fix DeepSeek / Baidu Qianfan (#35592). The latch fires when a delta is a strict superset of the accumulated buffer (len(_new) > len(_prev) and _new.startswith(_prev)) and then REPLACES the buffer instead of appending. That superset test is not an unambiguous cumulative signature. A normal incremental stream can emit a single fragment that restates an already- accumulated prefix — trivially common in large code-patch arguments with repeated lines / indentation — which trips the latch and clobbers the accumulated buffer, corrupting the tool call. Observed in the wild on Anthropic Opus (the primary model) building a large patch: corrupted / short arguments → finish_reason='length' dead-end → session killed. A guessing heuristic that can silently clobber a tool-call buffer has no place on the path every provider and model shares. Reverting restores the known-good plain `+=` accumulator. The #35592 narrow provider bug should be re-addressed provider-gated so it is structurally impossible to touch Anthropic / OpenAI incremental streams, rather than via a heuristic on the shared path. Reverts `ca03486b6`.	2026-05-31 06:14:32 -07:00
Teknium	ca03486b6a	fix(streaming): stop duplicating tool-call args from cumulative-resend providers (#35718 ) DeepSeek / Baidu Qianfan stream tool-call arguments in cumulative mode: each chunk resends the full arguments-so-far instead of the new fragment. The stream accumulator blindly concatenated arg deltas with +=, turning that into '{...}{...}{...}', which failed json.loads and got nuked to '{}' — a silently corrupted tool call (#35592). Worse on multi-param tools (search_files, session_search, memory replace) because longer args take more chunks, giving more resend opportunities. - Per-slot cumulative latch in the stream accumulator: a delta that is a strict superset of the accumulated buffer marks the slot cumulative and replaces (not appends); exact duplicates are dropped only after latching. Incremental fragments are untouched (default += path). - Backstop _collapse_repeated_json_arguments() in the repair pipeline collapses pure identical-resend buffers (K exact repeats of a valid-JSON unit) for providers that resend the complete object from chunk 1. Only reached after json.loads already failed, so compliant single objects are never touched. Not a gateway or DeepSeek-model bug — any OpenAI-wire provider in cumulative streaming mode is affected.	2026-05-31 00:19:39 -07:00
kshitijk4poor	66827f8947	chore: prune unused imports and duplicate import redefinitions Remove unused imports (F401) and duplicate/shadowed import redefinitions (F811) across the codebase using ruff's safe autofixes. No behavioral changes -- imports only. - ~1400 safe autofixes applied across 644 files (net -1072 lines) - __init__.py re-exports preserved (excluded from F401 removal so public re-export surfaces stay intact) - Re-exports that are imported or monkeypatched by tests but look unused in their defining module are kept with explicit # noqa: F401 (gateway/run.py load_dotenv; run_agent re-exports from agent.message_sanitization, agent.context_compressor, agent.retry_utils, agent.prompt_builder, agent.process_bootstrap, agent.codex_responses_adapter) - Unsafe F841 (unused-variable) fixes deliberately skipped -- those can change behavior when the RHS has side effects - ruff lints remain disabled in pyproject.toml (only PLW1514 is selected); this is a one-time cleanup, not a config change Verification: - python -m compileall: clean - pytest --collect-only: all 27161 tests collect (zero import errors) - core entry points import clean (run_agent, model_tools, cli, toolsets, hermes_state, batch_runner, gateway) - static scan: every name any test imports directly from an edited module still resolves	2026-05-28 22:26:25 -07:00
Teknium	2d444fc84d	fix(run_agent): handle unescaped control chars in tool_call arguments (#15356 ) Extends _repair_tool_call_arguments() to cover the most common local-model JSON corruption pattern: llama.cpp/Ollama backends emit literal tabs and newlines inside JSON string values (memory save summaries, file contents, etc.). Previously fell through to '{}' replacement, losing the call. Adds two repair passes: - Pass 0: json.loads(strict=False) + re-serialise to canonical wire form - Pass 4: escape 0x00-0x1F control chars inside string values, then retry Ports the core utility from #12068 / PR #12093 without the larger plumbing change (that PR also replaced json.loads at 8 call sites; current main's _repair_tool_call_arguments is already the single chokepoint, so the upgrade happens transparently for every existing caller). Credit: @truenorth-lj for the original utility design. 4 new regression tests covering literal newlines, tabs, re-serialisation to strict=True-valid output, and the trailing-comma + control-char combination case.	2026-04-24 15:06:41 -07:00
Teknium	9725b452a1	fix: extract _repair_tool_call_arguments helper, add tests, bound loop Follow-up for PR #12252 salvage: - Extract 75-line inline repair block to _repair_tool_call_arguments() module-level helper for testability and readability - Remove redundant 'import re as _re' (re already imported at line 33) - Bound the while-True excess-delimiter removal loop to 50 iterations - Add 17 tests covering all 6 repair stages - Add sirEven to AUTHOR_MAP in release.py	2026-04-20 05:12:55 -07:00

5 commits