hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-23 10:42:00 +00:00

History

Teknium 2a285d5ec2 fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924 ) (#20184 ) * revert(gateway): remove stale-code self-check and auto-restart Removes the _detect_stale_code / _trigger_stale_code_restart mechanism introduced in #17648 and iterated in #19740. On every incoming message the gateway compared the boot-time git HEAD SHA to the current SHA on disk, and if they differed it would reply with Gateway code was updated in the background -- restarting this gateway so your next message runs on the new code. Please retry in a moment. and then kick off a graceful restart. This is unwanted behaviour: users who run a long-lived gateway and do their own ad-hoc git operations on the checkout end up with their chat interrupted and the current message dropped every time HEAD moves, with no way to opt out. If an operator really needs the old protection against stale sys.modules after "hermes update", the SIGKILL-survivor sweep in hermes update (hermes_cli/main.py, also tagged #17648) already handles the supervisor-respawn case on its own. Removed: gateway/run.py: - _STALE_CODE_SENTINELS, _GIT_SHA_CACHE_TTL_SECS - _read_git_head_sha(), _compute_repo_mtime() module helpers - class-level _boot_wall_time / _boot_repo_mtime / _boot_git_sha / _stale_code_restart_triggered defaults - __init__ boot-snapshot block (_boot_, _cached_current_sha, _repo_root_for_staleness, _stale_code_notified) - _current_git_sha_cached(), _detect_stale_code(), _trigger_stale_code_restart() methods - stale-code check + user-facing restart notice at the top of _handle_message() tests/gateway/test_stale_code_self_check.py (deleted, 412 lines) No new logic added. Zero remaining references to any removed symbol. Gateway test suite passes the same 4589 tests it passed before; the 3 pre-existing unrelated failures (discord free-channel, feishu bot admission, teams typing) are unchanged by this commit. * fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924) Per-delta _strip_think_blocks ran at _fire_stream_delta and destroyed downstream state. When MiniMax-M2.7 / DeepSeek / Qwen3 streamed a tag split across deltas (delta1='<think>', delta2='Let me check'), the regex case-2 match erased delta1 entirely, so CLI/gateway state machines never learned a block was open and leaked delta2 as content. Raw consumers (ACP, api_server, TTS) had no downstream defense at all. Replace the per-delta regex with a stateful StreamingThinkScrubber that survives delta boundaries: - Closed <tag>X</tag> pairs always stripped (matches _strip_think_blocks case 1). - Unterminated open at block boundary enters a block; content discarded until close tag arrives. At end-of-stream, held content is dropped. - Orphan close tags stripped without boundary gating. - Partial tags at delta boundaries held back until resolved. - Block-boundary rule (start-of-stream, after \n, or whitespace-only since last \n) preserves prose that mentions tag names. Reset at turn start alongside the existing context scrubber; flush at turn end so a benign '<' held back at end-of-stream reaches the UI. E2E-verified on live OpenRouter->MiniMax-m2 streams: closed pairs strip cleanly, first word of post-block content is preserved, pure content passes through unchanged. Stefan's screenshot case (#17924) — 'Let me check' getting chopped to ' me check' — no longer happens. Final _strip_think_blocks calls on completed strings (final_response, replay, compression) are preserved; only the streaming per-delta call site switched to the scrubber.		2026-05-05 04:33:38 -07:00
..
acp	fix(acp): compact Zed tool replay rendering	2026-05-03 01:44:23 -07:00
acp_adapter	fix(acp): run /steer as a regular prompt on idle sessions (#18258 )	2026-04-30 22:45:14 -07:00
agent	fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924 ) (#20184 )	2026-05-05 04:33:38 -07:00
cli	fix(tui): respect voice.record_key config (supersedes #19028 , #19339 ) (#19835 )	2026-05-04 15:49:28 -07:00
cron	fix(cron): add concurrency regression test for parallel job state writes	2026-05-04 12:36:29 -07:00
e2e	fix(gateway): move quick-command dispatch before built-in handlers	2026-05-04 01:39:23 -07:00
environments/benchmarks	fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )	2026-04-07 17:28:37 -07:00
fakes
gateway	fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924 ) (#20184 )	2026-05-05 04:33:38 -07:00
hermes_cli	fix: include default profile in kanban assignees	2026-05-05 04:25:05 -07:00
hermes_state	fix(resume): redirect --resume to the descendant that actually holds the messages	2026-04-24 03:04:42 -07:00
honcho_plugin	feat(honcho): explain why when honcho_profile returns an empty card	2026-04-27 12:37:33 -07:00
integration	fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )	2026-04-16 16:50:15 -07:00
openviking_plugin	fix(openviking): pre-check fs/stat to route file URIs before hitting directory-only endpoints	2026-04-30 02:35:29 -07:00
plugins	test(kanban): patch dashboard websocket token stub	2026-05-04 20:50:24 -07:00
run_agent	fix(run_agent): acquire lock in IterationBudget.used property	2026-05-04 12:37:28 -07:00
skills	fix(google-workspace): restore required_credential_files in SKILL.md (#16452 )	2026-05-04 12:43:14 -07:00
stress	feat(kanban): durable multi-profile collaboration board (#17805 )	2026-04-30 13:36:47 -07:00
tools	fix(tool-schemas): reactive strip of pattern/format on llama.cpp grammar 400s	2026-05-05 04:25:18 -07:00
tui_gateway	fix(tui_gateway): guard sys.path against local package shadowing (#15989 )	2026-05-04 12:42:43 -07:00
website	fix(website): auto-wrap ASCII-art code blocks in generated skill pages (#16497 )	2026-04-27 03:38:39 -07:00
__init__.py
conftest.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
run_interrupt_test.py
test_account_usage.py	feat(account-usage): add per-provider account limits module	2026-04-21 01:56:35 -07:00
test_atomic_replace_symlinks.py	refactor: consolidate symlink-safe atomic replace into shared helper	2026-04-28 04:58:22 -07:00
test_base_url_hostname.py	security(runtime_provider): close OLLAMA_API_KEY substring-leak sweep miss (#13522 )	2026-04-21 06:06:16 -07:00
test_batch_runner_checkpoint.py	test: regression coverage for checkpoint dedup and inf/nan coercion	2026-04-24 14:32:21 -07:00
test_cli_file_drop.py	fix(tui): improve macOS paste and shortcut parity	2026-04-21 08:00:00 -07:00
test_cli_manual_compress.py	test(cli): regression test for manual /compress system_message	2026-04-28 05:21:49 -07:00
test_cli_skin_integration.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
test_ctx_halving_fix.py	fix(tests): fix 78 CI test failures and remove dead test (#9036 )	2026-04-13 10:50:24 -07:00
test_empty_model_fallback.py	fix: fall back to provider's default model when model config is empty (#8303 )	2026-04-12 03:53:30 -07:00
test_evidence_store.py
test_get_tool_definitions_cache_isolation.py	fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335 )	2026-04-30 04:32:06 -07:00
test_hermes_constants.py	fix(gateway): harden Docker/container gateway pathway	2026-04-12 16:36:11 -07:00
test_hermes_home_profile_warning.py	fix(constants): warn once when get_hermes_home() falls back under an active profile (#18746 )	2026-05-02 01:49:55 -07:00
test_hermes_logging.py	fix(logging): attach gateway log after cli init	2026-04-26 19:01:26 -07:00
test_hermes_state.py	fix(telegram): polish topic mode — CASCADE, General-topic handling, rename guard, debounce	2026-05-04 12:07:17 -07:00
test_honcho_client_config.py
test_install_sh_setup_wizard_tty_probe.py	fix(install): widen /dev/tty open-probe to sibling gates (#16746 )	2026-04-28 06:45:55 -07:00
test_ipv4_preference.py	feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )	2026-04-11 23:12:11 -07:00
test_mcp_serve.py
test_mini_swe_runner.py	fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )	2026-04-20 12:23:05 -07:00
test_minimax_model_validation.py	fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )	2026-04-19 22:44:47 -07:00
test_minimax_oauth.py	test(cli): cover minimax-oauth resolution, refresh, menu wiring	2026-04-29 09:53:42 -07:00
test_minisweagent_path.py
test_model_picker_scroll.py	fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )	2026-04-07 17:59:42 -07:00
test_model_tools.py	fix(plugins): stop firing pre_tool_call hook twice per tool execution (#17611 )	2026-04-29 12:43:39 -07:00
test_model_tools_async_bridge.py	fix(model_tools): cancel coroutine on timeout so worker thread exits + log full traceback	2026-04-29 05:00:40 -07:00
test_ollama_num_ctx.py	fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )	2026-04-07 22:23:28 -07:00
test_packaging_metadata.py
test_plugin_skills.py	fix(tests): attach caplog to specific logger in 3 order-dependent tests (#11453 )	2026-04-17 00:20:40 -07:00
test_project_metadata.py	build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )	2026-04-17 13:31:53 -07:00
test_retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
test_sql_injection.py
test_subprocess_home_isolation.py	fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )	2026-04-10 13:37:45 -07:00
test_timezone.py	test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )	2026-04-17 14:21:22 -07:00
test_toolset_distributions.py
test_toolsets.py	feat(discord): split discord_server into discord + discord_admin tools	2026-04-25 04:50:14 -07:00
test_trajectory_compressor.py	fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )	2026-04-20 12:23:05 -07:00
test_trajectory_compressor_async.py	fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )	2026-04-20 12:23:05 -07:00
test_transform_tool_result_hook.py	test: stop testing mutable data — convert change-detectors to invariants (#13363 )	2026-04-20 23:20:33 -07:00
test_tui_gateway_server.py	fix(tui): respect voice.record_key config (supersedes #19028 , #19339 ) (#19835 )	2026-05-04 15:49:28 -07:00
test_utils_truthy_values.py
test_yuanbao_integration.py	yuanbao platform (#16298 )	2026-04-26 18:50:49 -07:00
test_yuanbao_markdown.py	yuanbao platform (#16298 )	2026-04-26 18:50:49 -07:00
test_yuanbao_pipeline.py	yuanbao platform (#16298 )	2026-04-26 18:50:49 -07:00
test_yuanbao_proto.py	yuanbao platform (#16298 )	2026-04-26 18:50:49 -07:00