hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-17 09:41:58 +00:00

History

Teknium af22421e87 feat(dashboard): page-scoped plugin slots for built-in pages (#15658 ) * fix(terminal): three-layer defense against watch_patterns notification spam Background processes that stack notify_on_complete=True with watch_patterns can flood the user with duplicate, delayed notifications — matches deliver asynchronously via the completion queue and continue arriving minutes after the process has exited. The docstring warning against this (PR #12113) has proven insufficient; agents still misuse the combination. Three layered defenses, each sufficient on its own: 1. Mutual exclusion (terminal_tool.py): When both flags are set on a background process, drop watch_patterns with a warning. notify_on_complete wins because 'let me know when it's done' is the more useful signal and fires exactly once. Extracted as _resolve_notification_flag_conflict() so the rule is testable in isolation. 2. Suppress-after-exit (process_registry.py): _check_watch_patterns() now bails the moment session.exited is True. Post-exit chunks (buffered reads draining after the process is gone) no longer produce notifications. This is the fix flagged as future work in session 20260418_020302_79881c. 3. Global circuit breaker (process_registry.py): Per-session rate limits don't catch the sibling-flood case — N concurrent processes can each stay under 8/10s and still collectively spam. New WATCH_GLOBAL_MAX_PER_WINDOW=15 cap trips a 30-second cooldown across ALL sessions, emits a single watch_overflow_tripped event, silently counts dropped events, and emits a watch_overflow_released summary when the cooldown ends. Also updates the tool schema + docstring to document the new behavior. Tests: 8 new tests covering all three fixes (suppress-after-exit x2, mutual-exclusion resolver x4, global breaker trip/cooldown/release x2). All 60 tests across test_watch_patterns.py, test_notify_on_complete.py, test_terminal_tool.py pass. Real-world trigger: self-inflicted in session 20260425_051924 — three concurrent hermes-sweeper review subprocesses each set watch_patterns= ['failed validation', 'errored'] AND notify_on_complete=True, then iterated over multiple items, producing enough matches per process to defeat the per-session cap while staying under the global cap that didn't yet exist. * fix(terminal): aggressive 1-per-15s watch_patterns rate limit + strike-3 promotion Per Teknium's direction, the watch_patterns rate limit is now much more aggressive and self-healing. ## New rule — per session - HARD cap: 1 watch-match notification per 15 seconds per process. - Any match arriving inside the cooldown window is dropped and counts as ONE strike for that window (many drops in the same window still = 1 strike). - After 3 consecutive strike windows, watch_patterns is permanently disabled for the session and the session is auto-promoted to notify_on_complete semantics — exactly one notification when the process actually exits. - A cooldown window that expires with zero drops resets the consecutive strike counter — healthy cadence is forgiven. ## Schema + docstring rewritten The tool schema description now gives the model explicit guidance: - notify_on_complete is 'the right choice for almost every long-running task' - watch_patterns is for RARE one-shot signals on LONG-LIVED processes - Do NOT use watch_patterns with loops/batch jobs — error patterns fire every iteration and will hit the strike limit fast - Mutual exclusion is stated on both parameter descriptions - 1/15s cooldown and 3-strike promotion are stated in the watch_patterns description so the model sees the contract every turn ## Removed - WATCH_MAX_PER_WINDOW (8/10s) and WATCH_OVERLOAD_KILL_SECONDS (45) — the new 1/15s limit subsumes both; keeping them would double-count. - _watch_window_hits / _watch_window_start / _watch_overload_since fields on ProcessSession. Replaced by _watch_last_emit_at / _watch_cooldown_until / _watch_strike_candidate / _watch_consecutive_strikes. ## Kept - Global circuit breaker across all sessions (15/10s → 30s cooldown) as a secondary safety net for concurrent siblings. Still valuable when 20 short-lived processes each fire once — none individually violates the per-session limit. - Suppress-after-exit guard. - Mutual exclusion resolver at the tool entry point. ## Tests - 6 new tests in TestPerSessionRateLimit covering: first match delivers, second in cooldown suppressed, multi-drop = single strike, 3 strikes disables + promotes, clean window resets counter, suppressed count carried to next emit. - Global circuit breaker tests rewritten to use fresh sessions instead of hacking removed per-window fields. - 50/50 watch_patterns + notify_on_complete tests pass. - 60/60 including test_terminal_tool.py pass. * feat(dashboard): page-scoped plugin slots for built-in pages Dashboard plugins can now inject components into specific built-in pages (Sessions, Analytics, Logs, Cron, Skills, Config, Env, Docs, Chat) without overriding the whole route. Previously, plugins could only: 1. Add new tabs (tab.path) 2. Replace whole built-in pages (tab.override) 3. Inject into global shell slots (header-, footer-, pre-main, ...) None of those let a plugin add a banner, card, or widget to an existing page. The new <page>:top / <page>:bottom slots close that gap, reusing the existing registerSlot() API. Changes - web/src/plugins/slots.ts: 18 new KNOWN_SLOT_NAMES entries (sessions:top, sessions:bottom, analytics:top, ..., chat:bottom), grouped under "Shell-wide" vs "Page-scoped" in the docblock - web/src/pages/*: each built-in page now renders <PluginSlot name="<page>:top" /> as the first child of its outer wrapper and <PluginSlot name="<page>:bottom" /> as the last child -- zero visual cost when no plugin registers - plugins/example-dashboard: registers a demo banner into sessions:top via registerSlot(), with matching slots entry in the manifest -- so freshly-setup users can see what page-scoped slots look like without writing any plugin code - website/docs: new "Page-scoped slots" table in the plugin authoring guide, with a worked example - tests/hermes_cli/test_web_server.py: round-trip test for colon-bearing slot names (sessions:top, analytics:bottom, ...) Validation - npm run build: clean (tsc -b + vite build, 2761 modules) - scripts/run_tests.sh tests/hermes_cli/test_web_server.py::TestDashboardPluginManifestExtensions: 5/5 pass		2026-04-25 06:55:35 -07:00
..
_category_.json	feat: add documentation website (Docusaurus)	2026-03-05 05:24:55 -08:00
acp.md	docs(acp): fix zed config	2026-04-03 01:46:45 -07:00
api-server.md	feat(api-server): inline image inputs on /v1/chat/completions and /v1/responses (#12969 )	2026-04-20 04:16:13 -07:00
batch-processing.md	fix: normalize remaining reasoning effort orderings and add missing 'minimal'	2026-04-09 14:20:16 -07:00
browser.md	feat(browser): CDP supervisor — dialog detection + response + cross-origin iframe eval (#14540 )	2026-04-23 22:23:37 -07:00
built-in-plugins.md	feat(plugins): make all plugins opt-in by default	2026-04-20 04:46:45 -07:00
code-execution.md	docs(execute_code): document project/strict execution modes (#12073 )	2026-04-18 01:53:09 -07:00
context-files.md	feat: progressive subdirectory hint discovery (#5291 )	2026-04-05 12:33:47 -07:00
context-references.md	docs: comprehensive documentation audit — fix stale info, expand thin pages, add depth (#5393 )	2026-04-05 19:45:50 -07:00
credential-pools.md	docs: comprehensive docs audit — cover 13 features from last week's PRs (#5815 )	2026-04-07 10:21:03 -07:00
cron.md	feat(cron): per-job workdir for project-aware cron runs (#15110 )	2026-04-24 05:07:01 -07:00
delegation.md	docs: document delegation width + depth knobs (#13745 )	2026-04-21 17:54:39 -07:00
extending-the-dashboard.md	feat(dashboard): page-scoped plugin slots for built-in pages (#15658 )	2026-04-25 06:55:35 -07:00
fallback-providers.md	docs: fix fallback behavior description — it is per-turn, not per-session	2026-04-22 21:29:49 -07:00
honcho.md	feat(honcho): wizard cadence default 2, surface reasoning level, backwards-compat fallback	2026-04-18 22:50:55 -07:00
hooks.md	docs(plugins): correct pre_gateway_dispatch doc text and add hooks.md section	2026-04-24 03:02:03 -07:00
image-generation.md	feat(image-gen): add GPT Image 2 to FAL catalog (#13677 )	2026-04-21 13:35:31 -07:00
mcp.md	docs: deep quality pass — expand 10 thin pages, fix specific issues (#4134 )	2026-03-30 20:30:11 -07:00
memory-providers.md	feat(hindsight): richer session-scoped retain metadata	2026-04-22 05:27:10 -07:00
memory.md	docs: add Supermemory to memory providers docs, env vars, CLI reference	2026-04-06 22:15:58 -07:00
overview.md	feat(delegate): orchestrator role and configurable spawn depth (default flat)	2026-04-21 14:23:45 -07:00
personality.md	docs: document SOUL.md as primary agent identity (#1927 )	2026-03-18 04:18:08 -07:00
plugins.md	docs(plugins): correct pre_gateway_dispatch doc text and add hooks.md section	2026-04-24 03:02:03 -07:00
provider-routing.md	docs: fallback providers + /background command documentation	2026-03-15 06:24:28 -07:00
rl-training.md	fix(docs): Add links to Atropos and wandb in user guide	2026-04-23 03:07:06 -07:00
skills.md	refactor(commands): drop /provider, /plan handler, and clean up slash registry (#15047 )	2026-04-24 03:10:52 -07:00
skins.md	feat(skin): add warm-lightmode skin from PR #4811	2026-04-13 23:51:21 -07:00
spotify.md	feat(spotify): wire setup wizard into 'hermes tools' + document cron usage (#15180 )	2026-04-24 07:24:28 -07:00
tool-gateway.md	feat(image_gen): upgrade Recraft V3 → V4 Pro, Nano Banana → Pro (#11406 )	2026-04-16 22:05:41 -07:00
tools.md	docs: add Nous Tool Gateway documentation	2026-04-16 12:36:49 -07:00
tts.md	feat(tts): complete KittenTTS integration (tools/setup/docs/tests)	2026-04-21 01:28:32 -07:00
vision.md	fix(tui): improve macOS paste and shortcut parity	2026-04-21 08:00:00 -07:00
voice-mode.md	feat(voice): add cli beep toggle	2026-04-21 00:29:29 -07:00
web-dashboard.md	docs: consolidate dashboard themes and plugins into Extending the Dashboard (#15530 )	2026-04-24 23:26:51 -07:00