hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-14 14:12:44 +00:00

History

Ziliang Peng c3a09f7835 fix(background_review): propagate parent toolset config to keep tools[] cache-stable ## Summary The background skill/memory-review fork constructed a child `AIAgent` without propagating `enabled_toolsets` / `disabled_toolsets` from the parent. When the parent narrowed its toolset (via `hermes tools disable` or `config.yaml`), the fork's default `enabled_toolsets=None` expanded to "all registered tools" — and the fork's outbound request body sent a wider `tools[]` array than the parent's main-turn request. Anthropic's prompt-cache key includes the `tools[]` array byte-for-byte, so this divergence forked the cache lineage on every nudge and forced a full prefix rewrite. On a captured ~4 hour Claude-via-Hermes session this cost roughly 4.3 M cache-write tokens — about half of those attributable to the per-nudge alternation between the main turn's narrowed `tools[]` and the review fork's wider `tools[]`. ## Goal Extend the byte-stability invariant established by PR #17276 (which fixed `system`) to the `tools[]` slot of the request body, so the review fork's outbound request hits the parent's warmed Anthropic prefix cache regardless of how the parent's toolset is configured. ## Implementation Two-line change in `agent/background_review.py`: pass `enabled_toolsets=getattr(agent, "enabled_toolsets", None)` and the matching `disabled_toolsets` kwarg into the `AIAgent(...)` call inside `_spawn_background_review`. Adds an explanatory block comment that calls out the cache-key dependency and the relationship to PR #17276. The post-construction runtime whitelist (`set_thread_tool_whitelist({memory, skills})`) is untouched — it still gates which tools the model is allowed to dispatch. This change aligns only what the request body transmits, not what the review is allowed to do, so the safety contract from issue #15204 remains intact. ## Testing - `tests/run_agent/test_background_review_cache_parity.py`: new `test_review_fork_inherits_parent_toolset_config` asserts the parent's `enabled_toolsets` and `disabled_toolsets` reach the review-fork constructor as kwargs. - `tests/run_agent/test_background_review_toolset_restriction.py`: the existing `test_background_review_does_not_narrow_toolset_schema` was inverted (its old "must NOT pass enabled_toolsets" rule was built on the assumption that the parent always ran with the registry default — wrong in practice when the parent is narrowed). Renamed to `test_background_review_matches_parent_toolset_config` and updated to assert the parent's value propagates verbatim. - Verified the new positive test fails without the fix and passes with it. - Full suite for `test_background_review*`: ``` $ python -m pytest tests/run_agent/test_background_review.py \ tests/run_agent/test_background_review_summary.py \ tests/run_agent/test_background_review_toolset_restriction.py \ tests/run_agent/test_background_review_cache_parity.py -q 18 passed in 1.85s ``` ## Scope - `agent/background_review.py`: 2 added kwargs + explanatory comment. - Two test files: one new positive test, one inverted existing test. - No production code paths outside the review fork; no schema changes; no public-API changes. Refs: ziliangpeng/hermes-agent#1 (root-cause analysis with wire-level cache-write measurements). Extends PR #17276's `system`-bytes invariant to the `tools[]` slot.		2026-05-21 12:49:21 +05:30
..
lsp	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
transports	fix(xai): restore encrypted reasoning replay across turns	2026-05-20 23:12:45 -07:00
__init__.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
account_usage.py	chore: ruff auto-fix PLR6201 — tuple → set in membership tests (#23937 )	2026-05-11 11:13:25 -07:00
agent_init.py	feat(sessions): opt-in per-session JSON snapshot writer	2026-05-20 11:44:10 -07:00
agent_runtime_helpers.py	fix(gateway): harden kanban and provider cleanup races	2026-05-20 14:31:22 -07:00
anthropic_adapter.py	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
async_utils.py	fix(async): close unscheduled coroutines in all threadsafe bridges (#26584 )	2026-05-15 14:00:01 -07:00
auxiliary_client.py	fix(xai-oauth): pin inference base_url to x.ai origin (#28952 )	2026-05-19 14:51:21 -07:00
azure_identity_adapter.py	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
background_review.py	fix(background_review): propagate parent toolset config to keep tools[] cache-stable	2026-05-21 12:49:21 +05:30
bedrock_adapter.py	chore(deps): lazy-install boto3/botocore for bedrock adapter	2026-05-17 02:31:18 -07:00
browser_provider.py	fix(browser): self-review pass — dead-import, log levels, future-proofing	2026-05-17 04:04:15 -07:00
browser_registry.py	fix(browser): self-review pass — dead-import, log levels, future-proofing	2026-05-17 04:04:15 -07:00
chat_completion_helpers.py	fix(gateway): harden kanban and provider cleanup races	2026-05-20 14:31:22 -07:00
codex_responses_adapter.py	fix(xai): restore encrypted reasoning replay across turns	2026-05-20 23:12:45 -07:00
codex_runtime.py	fix(xai): surface provider 'error' SSE frame in Codex fallback stream (#27184 )	2026-05-16 23:41:09 -07:00
context_compressor.py	fix(compress): make abort-on-summary-failure opt-in via config flag (#28117 )	2026-05-18 10:28:20 -07:00
context_engine.py	fix(compression): keep default protect_first_n at 3 + align ABC	2026-05-13 22:25:16 -07:00
context_references.py	fix(agent): fall back when rg is blocked for @folder references	2026-04-20 01:56:41 -07:00
conversation_compression.py	refactor(session-log): drop branch/compress re-point of session_log_file	2026-05-20 11:44:10 -07:00
conversation_loop.py	refactor(session-log): delete _save_session_log and all callers	2026-05-20 11:44:10 -07:00
copilot_acp_client.py	fix: guard yaml.safe_load, flock unlock, TOCTOU races, and atomic writes	2026-05-19 00:12:41 -07:00
credential_pool.py	fix(codex-oauth): quarantine terminal refresh errors so dead tokens are not replayed across sessions	2026-05-18 10:31:40 -07:00
credential_sources.py	feat(xai-oauth): add xAI Grok OAuth (SuperGrok Subscription) provider	2026-05-15 12:11:32 -07:00
curator.py	feat(curator): hint at `hermes curator pin` in the rename block (#23212 )	2026-05-10 06:44:53 -07:00
curator_backup.py	fix(curator): authoritative absorbed_into on delete + restore cron skill links on rollback (#18671 ) (#18731 )	2026-05-02 01:29:57 -07:00
display.py	chore: remove Atropos RL environments and tinker-atropos integration (#26106 )	2026-05-15 10:36:38 +05:30
error_classifier.py	fix(error_classifier): classify xAI Grok entitlement SSE errors as auth	2026-05-18 10:24:13 -07:00
file_safety.py	security(file-safety): also write-deny <root>/.env when running under a profile (#15981 )	2026-05-20 23:37:37 -07:00
gemini_cloudcode_adapter.py	fix(agent/gemini-cloudcode): seed delta defaults for reasoning-only stream chunks	2026-05-14 08:03:56 -07:00
gemini_native_adapter.py	fix(auxiliary): evict async wrappers on poisoned client (follow-up to #23482 )	2026-05-11 11:13:20 -07:00
gemini_schema.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
google_code_assist.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
google_oauth.py	fix(security): guard os.chmod(parent) against / and top-level dirs	2026-05-20 22:56:55 -07:00
i18n.py	feat(i18n): localize all gateway commands + web dashboard, add 8 new locales (16 total) (#22914 )	2026-05-10 07:14:14 -07:00
image_gen_provider.py	feat(plugins): pluggable image_gen backends + OpenAI provider (#13799 )	2026-04-21 21:30:10 -07:00
image_gen_registry.py	fix(plugins): filter resolution by is_available() in web + image_gen registries	2026-05-13 22:31:28 -07:00
image_routing.py	fix(agent): consult supports_vision override in auto-mode routing	2026-05-20 23:27:10 -07:00
insights.py	Merge branch 'main' into feat/dashboard-skill-analytics	2026-04-20 05:25:49 -07:00
iteration_budget.py	refactor(run_agent): extract OpenAI proxy, safe stdio, IterationBudget	2026-05-16 17:59:32 -07:00
lmstudio_reasoning.py	feat(agent): add lmstudio integration	2026-04-28 12:27:36 -07:00
manual_compression_feedback.py	fix(compression): include system prompt + tool schemas in token estimates (#18265 )	2026-04-30 23:03:54 -07:00
markdown_tables.py	fix(cli): vertical fallback for markdown tables wider than terminal (#23948 )	2026-05-11 16:49:13 -07:00
memory_manager.py	🐛 fix(memory): require newline after context tag	2026-05-18 10:53:08 -07:00
memory_provider.py	docs(agent): remove stale BuiltinMemoryProvider references from memory module docstrings	2026-05-05 13:33:49 -07:00
message_sanitization.py	refactor(run_agent): extract message sanitization to agent/message_sanitization.py	2026-05-16 17:41:09 -07:00
model_metadata.py	fix(metadata): qwen3.6-plus has a 1M context window (#27008 )	2026-05-17 02:31:18 -07:00
models_dev.py	feat: add NovitaAI as LLM provider	2026-05-13 23:51:15 -07:00
moonshot_schema.py	fix(moonshot): strip $ref siblings and collapse tuple items in tool schemas (#27104 )	2026-05-16 13:02:19 -07:00
nous_rate_guard.py	codebase: add encoding='utf-8' to all bare open() calls (PLW1514)	2026-05-08 14:27:40 -07:00
onboarding.py	docs(onboarding): lead OpenClaw residue banner with migrate, warn that cleanup breaks OpenClaw (#17507 )	2026-04-29 08:08:36 -07:00
plugin_llm.py	feat(plugins): run any LLM call from inside a plugin via ctx.llm (#23194 )	2026-05-10 07:09:28 -07:00
portal_tags.py	feat(nous): unified client=hermes-client-v<version> tag on every Portal request (#24779 )	2026-05-12 20:49:20 -07:00
process_bootstrap.py	refactor(run_agent): extract OpenAI proxy, safe stdio, IterationBudget	2026-05-16 17:59:32 -07:00
prompt_builder.py	fix(kanban): stale reclaim must not tick failure counter (#28680 )	2026-05-19 03:15:18 -07:00
prompt_caching.py	fix(cache): kill long-lived prefix layout — system prompt is now byte-static within a session (#24778 )	2026-05-12 20:46:04 -07:00
rate_limit_tracker.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
redact.py	perf(agent-loop): cut 47% of per-conversation function calls via 3 targeted hot-path optimizations (#28866 )	2026-05-19 14:25:10 -07:00
retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
shell_hooks.py	fix: guard yaml.safe_load, flock unlock, TOCTOU races, and atomic writes	2026-05-19 00:12:41 -07:00
skill_bundles.py	feat(skills): add skill bundles — alias /<name> loads multiple skills (#28373 )	2026-05-18 21:38:05 -07:00
skill_commands.py	fix(skills): load symlinked skill slash commands	2026-05-18 00:34:29 -07:00
skill_preprocessing.py	fix: treat inline-shell timeout guard as timeout	2026-05-18 19:36:04 -07:00
skill_utils.py	perf(cli): cut ~19s from 'hermes' cold start (skills cache + lazy Feishu + no Nous HTTP) (#22138 )	2026-05-08 16:39:32 -07:00
stream_diag.py	refactor(run_agent): extract stream diagnostics to agent/stream_diag.py	2026-05-16 18:28:17 -07:00
subdirectory_hints.py	fix(agent): catch PermissionError in subdirectory hint discovery	2026-04-09 03:10:30 -07:00
system_prompt.py	perf(prompt): cache kanban worker guidance at session init	2026-05-18 20:56:44 -07:00
think_scrubber.py	fix(agent): stateful streaming scrubber for reasoning-block leaks (#17924 ) (#20184 )	2026-05-05 04:33:38 -07:00
title_generator.py	fix: improve telegram topic mode setup	2026-05-04 12:07:17 -07:00
tool_dispatch_helpers.py	fix(agent): set tool_name on tool-result messages at construction time	2026-05-19 20:49:11 +01:00
tool_executor.py	fix(agent): set tool_name on tool-result messages at construction time	2026-05-19 20:49:11 +01:00
tool_guardrails.py	fix: add recovery hints to loop guard warnings	2026-05-19 00:12:12 -07:00
tool_result_classification.py	fix: classify landed file mutations with diagnostics	2026-05-13 06:46:23 -07:00
trajectory.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
usage_pricing.py	fix(pricing): add deepseek-v4-pro to official docs pricing table	2026-05-12 16:32:57 -07:00
video_gen_provider.py	feat(video_gen): unified video_generate tool with pluggable provider backends (#25126 )	2026-05-13 16:39:41 -07:00
video_gen_registry.py	feat(video_gen): unified video_generate tool with pluggable provider backends (#25126 )	2026-05-13 16:39:41 -07:00
web_search_provider.py	fix(web): align _LEGACY_PREFERENCE with legacy 7-provider order + doc cleanup	2026-05-13 22:31:28 -07:00
web_search_registry.py	fix(web): align _LEGACY_PREFERENCE with legacy 7-provider order + doc cleanup	2026-05-13 22:31:28 -07:00