hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-24 16:54:43 +00:00

Author	SHA1	Message	Date
teknium1	077e41330d	test(docker): update network-reuse harness fake ps output for egress-aware 3-field probe Some checks are pending CI / Detect affected areas (push) Waiting to run Details CI / Python tests (push) Blocked by required conditions Details CI / Python lints (push) Blocked by required conditions Details CI / JS & TS checks (push) Blocked by required conditions Details CI / Desktop E2E (push) Blocked by required conditions Details CI / Docs Site (push) Blocked by required conditions Details CI / Deny unrelated histories (push) Blocked by required conditions Details CI / Check contributors (push) Blocked by required conditions Details CI / Check uv.lock (push) Blocked by required conditions Details CI / package-lock.json diff (push) Blocked by required conditions Details CI / Lint Docker scripts (push) Blocked by required conditions Details CI / Build&Test Docker image (push) Blocked by required conditions Details CI / Supply-chain scan (push) Blocked by required conditions Details CI / Review label gate (push) Blocked by required conditions Details CI / OSV scan (push) Waiting to run Details CI / CI review comment (live) (push) Blocked by required conditions Details CI / All required checks pass (push) Blocked by required conditions Details CI / CI timing report (push) Blocked by required conditions Details Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build, Test, and Publish / build (amd64, type=gha,scope=docker-amd64, type=gha,mode=max,scope=docker-amd64, linux/amd64, ubuntu-latest) (push) Waiting to run Details Docker Build, Test, and Publish / build (arm64, type=gha,scope=docker-arm64, type=gha,mode=max,scope=docker-arm64, linux/arm64, ubuntu-24.04-arm) (push) Waiting to run Details Docker Build, Test, and Publish / publish (amd64, type=gha,scope=docker-amd64, type=gha,mode=max,scope=docker-amd64, linux/amd64, ubuntu-latest) (push) Blocked by required conditions Details Docker Build, Test, and Publish / publish (arm64, type=gha,scope=docker-arm64, type=gha,mode=max,scope=docker-arm64, linux/arm64, ubuntu-24.04-arm) (push) Blocked by required conditions Details Docker Build, Test, and Publish / merge (push) Blocked by required conditions Details auto-fix lint issues & formatting / Generate eslint --fix patch (push) Waiting to run Details auto-fix lint issues & formatting / Apply patch (push) Blocked by required conditions Details test_docker_network_config.py landed on main after the #58489 revert and stubbed docker ps with the 2-field ID\tState format. The re-landed egress-aware reuse probe requests ID\tState\tEgressLabel when egress is off, so the fake line failed to parse and the reuse path never fired. Fixture-only change; production behavior is unchanged.	2026-07-24 09:49:00 -07:00
teknium1	397e9fc1e4	Reapply "Merge pull request #30179 from NousResearch/feat/iron-proxy" This reverts commit `c6dc7c03c3`.	2026-07-24 09:49:00 -07:00
Teknium	8611b69dad	fix(skills): scope 60-char description enforcement to the create path The blanket MAX_DESCRIPTION_LENGTH=1024->60 change is narrowed: create-time validation now rejects new skills whose description exceeds SKILL_PROMPT_DESC_LIMIT (60) with actionable guidance, while edit/patch paths stay permissive (warning via system_prompt_preview) so existing over-limit skills remain maintainable. Runtime display truncation in skills_tool is left at 1024 (display behavior is a separate concern from authoring validation). Boundary tests: 60 accepted, 61 rejected at create; edit/patch on over-budget skills still succeed.	2026-07-24 07:54:21 -07:00
flyingdoubleg	e9a7c18890	fix(memory): honor disabled toolsets for provider tools	2026-07-24 13:00:53 +05:30
Alan Harman-Box	accbf4d912	feat: show system_prompt_preview when skill description exceeds prompt limit When a skill is created or edited with a description longer than SKILL_PROMPT_DESC_LIMIT (60 chars), the tool response now includes a system_prompt_preview field showing exactly what the system prompt skill index will display. This gives the agent immediate feedback to self-correct truncated trigger phrases. Also adds tool schema guidance about the 57-char window and fixes a stale docstring in skill_commands.py that incorrectly claimed the system prompt renders the full description.	2026-07-23 21:06:56 -07:00
Julien Talbot	b9b100da11	docs(xai): clarify x_search vs xurl routing without schema cross-refs Make the x_search / xurl boundary explicit in the skill, feature docs, toolset metadata, setup note, and reference pages, while keeping the model-facing x_search schema generic (no static xurl name). Regression tests assert behavioral routing invariants rather than frozen prose snapshots. Drop the stale CI-only plugin/hangup hunks already on main so this rebases cleanly.	2026-07-23 21:06:47 -07:00
Ben Barclay	ef6ce56cad	fix(cron): reconcile external provider after a claimed direct run (#70479 ) A direct run (cronjob action='run' / webhook-triggered manual fire) takes the job's fire_claim and advances next_run_at. When it races the external provider's scheduled fire for the same occurrence, Chronos loses the claim and — by design — does not re-arm (the winner owns the re-arm). But the direct-run winner never notified the provider, so the NAS one-shot for the consumed occurrence was left stale forever and the recurring job silently stopped firing. Observed in production: a managed 1-minute review job stalled for 20 hours because a GitHub-webhook direct run claimed the job 2s before the Chronos fire arrived; every subsequent occurrence was orphaned while /api/status stayed green. Fix: after a claimed direct execution completes (success or failure — next_run_at advances at claim time either way), call _notify_provider_jobs_changed_safe() so the active provider re-arms the post-run next_run_at. No-op for the built-in ticker; claim-lost direct runs still never notify (the winning scheduler owns the re-arm).	2026-07-24 13:37:04 +10:00
Teknium	d4b6165018	fix(skills): move dogfood skill into the software-development category skills/dogfood/SKILL.md sat at the root of the bundled skills tree, making it one of the few uncategorized skills (Discord /skill autocomplete listed it under 'uncategorized'; hermes_cli/commands.py cited it as the example). Move it to skills/software-development/dogfood/ alongside the other QA/testing skills (test-driven-development, systematic-debugging, requesting-code-review). - git mv skills/dogfood -> skills/software-development/dogfood (history preserved) - fix test fixture path in tests/tools/test_browser_console.py - update root-level-skill docstring examples (commands.py, generate-skill-docs.py) to cite computer-use, which is still root-level - drop 'dogfood' from the category list in hermes-agent-skill-authoring SKILL.md (en + zh-Hans docs mirrors) - docs: regenerate/move the bundled page to software-development-dogfood (en + zh-Hans), update sidebars.ts, skills-catalog.md, and the adversarial-ux-test related-skills link E2E validated: fresh sync_skills() copies to the new nested path; existing installs with the old flat copy keep it untouched (manifest is name-keyed, hash unchanged -> skipped, no duplicate); _get_category_from_path resolves 'software-development'; docusaurus build green.	2026-07-23 19:45:54 -07:00
Gabriel Steenhoek	30bb55588f	fix(gateway): retry detached restart watcher without breakaway The Windows /restart watcher's outer Popen spawns the watcher with windows_detach_popen_kwargs() (which carries CREATE_BREAKAWAY_FROM_JOB), but a restrictive parent job object can reject that bit with OSError and the current call has no retry. Preserve the current watcher implementation and add a focused breakaway-denied fallback. Preserved from current main: watcher_python / pythonw.exe selection, the str(restart_after_s) deadline, the scrubbed watcher_env, the intentional no-breakaway inline respawn, and the entire POSIX setsid/bash path. - primary keeps **windows_detach_popen_kwargs() - on OSError, retry the same argv/env with creationflags=windows_detach_flags_without_breakaway() - on dual failure, log a definitive, path-safe warning (interpreter basename + numeric winerror/errno only) and return without crashing Replace the superseded breakaway-first inline design and its AST tests with focused behavioral coverage that drives the real coroutine with a mocked subprocess.Popen (retry, argv/env/DEVNULL preservation, POSIX single-session kwarg, no-breakaway inline respawn, secret-safe logging). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-07-23 17:59:47 -07:00
replygirl	d9fe008db8	fix(slack): prefer live send adapter and try multi-workspace tokens individually Two related Slack delivery fixes for send_message text sends: - Route Slack text delivery through _send_via_adapter so the live in-process gateway adapter (multi-workspace aware, channel→client mapping, adapter-side gates) is preferred, with the plugin's _standalone_send as the out-of-process fallback — matching how the media path already behaves. - _standalone_send: SLACK_BOT_TOKEN can be a comma-separated list in multi-workspace installs and slack_tokens.json carries OAuth per-workspace tokens; the standalone Web-API path used to send the literal comma-joined string, which Slack rejects as invalid_auth. Try each token individually, retrying on token-scoped errors (invalid_auth / not_in_channel / channel_not_found …) and stopping on terminal ones. User-DM resolution (U…/W… targets) also tries each token. Adapted from #47547 by @replygirl — the original patched the legacy tools/send_message_tool.py::_send_slack helper, which moved to the Slack plugin's _standalone_send in #41112. Salvaged from #47547	2026-07-23 12:01:24 -07:00
Teknium	f50c3d904c	fix(delegation): persist origin_session_id in durable dispatch records origin_session_id (the api_server wake self-post target) lived only in the in-memory record: durable dispatch persistence and abandoned- delegation recovery omitted it, leaving completions recovered after a process restart unroutable to api_server sessions. Persist it in the async_delegations table (CREATE TABLE + ALTER TABLE migration for legacy DBs), restore it on recovery, and expose it via get_durable_delegation. Also adds the contributors mapping for ianks (PR #64998 author). Follow-up to #64998 (sweeper review F3).	2026-07-23 11:55:17 -07:00
Ian Ker-Seymer	246eacea7b	fix(gateway): deliver kanban/delegate wake-ups to api_server sessions Wake-ups for kanban notifications and background delegation completions were injected via handle_message() using a build_session_key()-derived key, which can never match the raw X-Hermes-Session-Id key that api_server sessions run under — so the wake landed in a session nobody was reading. On top of that, ApiServerAdapter.send() reports failure without raising, and that was treated as a successful delivery, so the notify cursor advanced past events that were permanently lost; and background delegation was forced synchronous on api_server since there was no way to wake the session afterward. Fix: route wake-ups for non-push adapters through a self-post to /v1/chat/completions with the original session id, treat non-raising send failures as failures (rewind instead of advancing the cursor), and re-enable background delegation whenever a session id is available to wake. The origin session id is captured from the request-scoped api_server chat_id binding rather than HERMES_SESSION_ID: constructing a child agent calls set_current_session_id() with the subagent's internal id, clobbering that variable right before dispatch would read it and misrouting the wake into the subagent's own session. Related: #56580, #64609, #53027, #63169, #56531, #50319, #64113	2026-07-23 11:55:17 -07:00
Eugeniusz Gilewski	0cd4afeafd	fix(security): guard remaining preflighted HTTP fetches Several platform fetch paths called is_safe_url before constructing ordinary httpx clients, leaving a second DNS lookup at connection time. This preserved the rebinding window for Slack batch images, Feishu documents, Telegram URL-photo fallback, and WeCom remote media. Route each path through create_ssrf_safe_async_client and the shared redirect guard so direct connections validate and dial vetted IPs while configured proxies remain an explicit trusted egress boundary. Add per-path regressions that change DNS from public at preflight to metadata at connect time. The Skills Hub provenance fixture intentionally serves content over loopback. Opt that test-scoped server into private-address access so it keeps exercising the real HTTP transport without weakening production blocking. Related #8033 Co-authored-by: teknium1 <127238744+teknium1@users.noreply.github.com>	2026-07-23 11:44:43 -07:00
Eugeniusz Gilewski	42626da1ce	fix(security): pin DNS resolutions for SSRF-safe fetches Install connect-time DNS validation for Hermes-owned direct httpx clients so SSRF-sensitive fetch paths dial a vetted IP instead of re-resolving after preflight. This preserves Host/SNI semantics for direct HTTP(S) connections and keeps proxy routing as an explicit trusted egress boundary. Wire the guarded clients into media cache downloads, vision downloads, Skills Hub direct/raw fetches, and platform attachment fetch paths that already perform SSRF preflight and redirect validation. Fixes #8033 Co-authored-by: Tom Qiao <zqiao@microsoft.com>	2026-07-23 11:44:43 -07:00
Teknium	0dbf639bc8	fix(windows): hidden-console daemons — extend the parent-console fix to every detached spawn path (#70205 ) Extends the desktop backend's root-cause fix (`aa2ae36c3f`) to all remaining console-less parent launch paths. The Windows console-flash class (#54220/#56747) is governed by the PARENT's console: a DETACHED_PROCESS or pythonw.exe daemon has no console, so every console-subsystem descendant (git, gh, cmd, node, wmic, powershell) allocates its own visible conhost — one flash per spawn, unreachable by any per-call-site CREATE_NO_WINDOW sweep. Worse, MSDN specifies CREATE_NO_WINDOW is IGNORED when combined with DETACHED_PROCESS, so the hide bit in the old detach bundle was dead. Changes: - _subprocess_compat: drop DETACHED_PROCESS from windows_detach_flags() and windows_detach_flags_without_breakaway(); the daemon now owns a single hidden console (CREATE_NO_WINDOW) that all descendants inherit. - gateway_windows: _resolve_detached_python() returns the venv console python.exe (no pythonw/base-interpreter detour — the uv-shim flash premise only held while DETACHED_PROCESS was masking the hide bit); UAC handoff launches console python under SW_HIDE; cmd/vbs launchers render console python (vbs runs it window-style 0). - gateway/run.py: restart watcher keeps sys.executable instead of swapping in GUI-subsystem pythonw. - web_server: dashboard actions spawn sys.executable (already carries windows_detach_flags()). Tests updated to pin the new invariants, including an explicit DETACHED_PROCESS-must-stay-out regression guard.	2026-07-23 11:13:14 -07:00
Willie Peacock	b9b5481d62	fix(kanban): preserve cross-profile project child routing	2026-07-23 08:34:06 -07:00
Willie Peacock	6833eabb53	fix(kanban): isolate worker-created child workspaces Default kanban_create children now keep fresh scratch paths, while explicit dir sharing remains supported and project context resolves to a per-task worktree. Surface resolved workspace fields in create responses/events and cover scratch mutation, nesting, explicit sharing, and project inheritance. Fixes #67567	2026-07-23 08:34:06 -07:00
Mustafa	dc3e4e8428	fix(kanban): keep delegated results in worker turn Dispatcher-spawned Kanban workers are finite one-shot processes, so detached delegation completions can outlive their only consumer. Mark that runtime as unable to deliver async completions and reuse the synchronous delegation fallback, returning required child results before the worker exits.\n\nAlso make unsupported-session notes runtime-generic and cover the delayed-child lifecycle regression.\n\nRefs #63169	2026-07-23 08:33:55 -07:00
Dickson Neoh	08298dabbd	fix(computer-use): handle Linux cua window metadata Treat cua-driver's Linux `is_on_screen: null` as unknown instead of off-screen, and skip GNOME Shell desktop/backdrop helper windows (ding "Desktop Icons", @!x,y;BDHF) when selecting the default capture target — they are targetable X11 windows but capture as empty. Reconciled with the _NET_ACTIVE_WINDOW fallback from #58030: helper windows are filtered out of the candidate pool first, then the tied z-order active-window probe runs on the remaining real app windows. Also falls back to the requested app name for _last_app when Linux windows carry no app_name. Salvaged from #54173 by @dnth.	2026-07-23 08:11:58 -07:00
Teknium	792ede0a36	test: pin repo root on PYTHONPATH for subprocess-boundary kanban isolation tests The three subprocess tests spawn 'sys.executable -c' children that import hermes_cli. From a worktree, the child resolved the MAIN checkout's editable install instead of the tree under test, so the new DB/CLI guards appeared missing and the tests failed with rc=0. Route the spawns through a helper that pins the repo root under test on PYTHONPATH.	2026-07-23 07:33:36 -07:00
trkim	a7dcf9787b	fix(kanban): harden delegated-child mutation boundary	2026-07-23 07:33:36 -07:00
trkim	47bbc12e18	fix(kanban): isolate delegated children from parent task	2026-07-23 07:33:36 -07:00
Brooklyn Nicholson	69b97a97f7	Merge remote-tracking branch 'origin/main' into bb/salvage-53841-no-overlay # Conflicts: # tools/computer_use/cua_backend.py	2026-07-23 01:26:21 -05:00
Brooklyn Nicholson	8b6d34ad85	test(computer_use): pin resolved driver for update-check tests cua_driver_update_check now short-circuits to None when no driver resolves; CI has none installed, so pin resolve_cua_driver_cmd in the update-check and env-sanitization tests.	2026-07-23 01:10:19 -05:00
David Metcalfe	f43ff5b4bb	fix(computer_use): mock _cua_no_overlay in existing tests, fix platform-dependent assertions - Add autouse fixture to TestMcpInvocationResolution to disable --no-overlay flag so existing tests assert baseline args - Make test_config_load_failure_fails_safe and test_missing_section_enables platform-aware (Linux auto-detect returns True, macOS/Windows False)	2026-07-23 00:44:43 -05:00
Tianqing Yun	2f9d88caee	fix: resolve cua-driver across computer-use surfaces	2026-07-23 00:42:46 -05:00
Tianqing Yun	030822b68d	fix: resolve cua-driver from user-local install paths	2026-07-23 00:42:46 -05:00
Teknium	b0358cf3c8	fix(slack): unify DM-resolution caches and bound wave-2 caches (C16 policy) Post-rebase consolidation over the merged C7/C16/C10 work: - _ensure_dm_conversation now records workspace ownership via _remember_channel_team and bounds _dm_conversation_cache (cap 5000) - module-level _slack_dm_cache (C7 standalone path) bounded oldest-first - _user_is_bot_cache (C10) bounded with _trim_oldest_dict_entries - caption-mode contract tests updated for the C7+C8 merged media path	2026-07-22 21:58:13 -07:00
davesecops	57f8ba3b19	fix(slack): open DMs for user send targets	2026-07-22 21:58:13 -07:00
HexLab98	8df8f86784	test(slack): cover send_message MEDIA delivery Add standalone-sender media cases and route coverage; point the non-media platform assertions at SMS now that Slack supports MEDIA.	2026-07-22 21:58:13 -07:00
brooklyn!	507d479c8c	fix(clarify): one canonical timeout across CLI, TUI/desktop, and gateway (#69774 ) * test(clarify-gateway): cover signature, timeout fallback, and notify paths for 100% coverage Fixes #36531 (cherry picked from commit `5265dfe2f5`) * fix(clarify): one canonical timeout across CLI, TUI/desktop, and gateway The clarify wait timeout was resolved three different (wrong) ways: - CLI (`cli.py`, `hermes_cli/callbacks.py`) read a non-existent top-level `clarify.timeout`, so it always fell through to a hardcoded 120s instead of the canonical `agent.clarify_timeout` (default 3600) the gateway uses (#42969). - The TUI/desktop bridge called `_block("clarify.request", …)` with no timeout, so it used the hardcoded 300s `_block` default and ignored config (#51960). - There was no way to disable the auto-skip: a user who wanted the agent to wait indefinitely while they think couldn't get it. Collapse all of this onto a single resolver: - `tools.clarify_gateway.resolve_clarify_timeout(config)` is the one source of truth. Order: explicit legacy `clarify.timeout` (back-compat) → canonical `agent.clarify_timeout` → 3600. `<= 0` is preserved verbatim as "unlimited". - CLI, callbacks, and the TUI bridge (`_clarify_timeout_seconds`) all route through it, so the three surfaces can't drift. - `<= 0` means unlimited everywhere: `wait_for_response` and `_block` drop the deadline (heartbeat still fires), and the CLI hides its countdown. Tests: resolver order / default / non-numeric / unlimited-sentinel; an unlimited `wait_for_response` blocks until resolved rather than auto-skipping; the TUI clarify bridge passes the configured timeout to `_block`. Supersedes #42974 (CLI key), #51993 (TUI honors config), and #68986 (unlimited wait); folds in #52031 (clarify_gateway coverage). Co-authored-by: liuhao1024 <liuhao1024@users.noreply.github.com> Co-authored-by: lkevincc0 <lkevincc0@users.noreply.github.com> Co-authored-by: theone139344 <theone139344@users.noreply.github.com> Co-authored-by: baauzi <baauzi@users.noreply.github.com> --------- Co-authored-by: Christopher-Schulze <210261288+Christopher-Schulze@users.noreply.github.com> Co-authored-by: liuhao1024 <liuhao1024@users.noreply.github.com> Co-authored-by: lkevincc0 <lkevincc0@users.noreply.github.com> Co-authored-by: theone139344 <theone139344@users.noreply.github.com> Co-authored-by: baauzi <baauzi@users.noreply.github.com>	2026-07-22 23:25:13 -05:00
HexLab98	b0b7f15598	test(computer-use): assert exact capture skips X11 active-window probe Cover exact_target selection and capture(pid=, window_id=) so the xprop helper is not invoked on the hot path.	2026-07-22 21:15:18 -07:00
HexLab98	a57bab1c84	test(computer-use): cover Linux X11 active-window selection Direct xprop parse-boundary coverage plus tied/unknown z_index and higher-z-frontmost regressions for #58026.	2026-07-22 21:15:18 -07:00
kandotrun	4f726ed467	fix(slack): preserve media in standalone cron delivery	2026-07-22 20:57:25 -07:00
Teknium	e907ecccef	test(search): prove discovery scope and bookend bounding work together on compacted sessions Integration test for the two compaction layers landing this sweep: #63144's discovery-scope fix (archived rows surface from the current session) and #69334's bookend bounding (summary exclusion + content caps). A single compacted session exercises both: the archived FTS hit must surface while the compaction handoff at the session tail stays out of bookend_end and long messages stay capped.	2026-07-22 16:59:09 -07:00
MacroAnarchy	98c0d8b291	fix(search): require compacted=1 not just active=0, scope delegation-under-compression Addresses @teknium1's second review round on #63144: 1. _is_compacted_message now checks both active=0 AND compacted=1. Previously checked active=0 alone, which also matched rewind/undo rows (active=0, compacted=0) that must stay hidden. 2. Added _is_compression_ended() — checks only the session's own end_reason, not the lineage-wide has_compression_hop flag. This prevents delegation children living under a compression continuation from leaking through the lineage filter. 3. _discover lineage skip now uses is_ended_session (session-level check) instead of has_compression (lineage-level flag). New tests: - TestRewindExclusion: rewind rows stay hidden alongside compacted rows - TestCompressionEndedHelper: session-level end_reason checks - TestLegacyContinuationPlusDelegation: delegate child excluded while compression ancestors surface 78 passed (71 + 7 new).	2026-07-22 16:59:09 -07:00
MacroAnarchy	711f1c2f1a	fix(search): surface compaction-archived and compression-parent sessions in discovery After context compaction, pre-compaction content was invisible to session_search — a memory black hole. The _discover() skip logic filtered both same-session and same-lineage hits unconditionally, without distinguishing compression-summarised content (gone from live context) from delegation children (still visible to the parent agent). Reworked _resolve_to_parent to return (root_id, has_compression_hop), checking end_reason='compression' on every hop during the same db.get_session() traversal — zero extra queries. _discover() now has three compression-aware paths: - In-place compaction: FTS hits on active=0 (compacted=1) rows pass through even when raw_sid == current_session_id - Legacy rotation: lineage hits pass through when has_compression_hop is true on either side of the chain - Delegation children: still excluded (no compression edge) 18 new tests covering all three scenarios + unit tests for the helpers. Addresses Teknium's review feedback on #6256. Closes #13840, #13841.	2026-07-22 16:59:09 -07:00
Brooklyn Nicholson	393c100a92	feat(voice): speech-interrupted latch in the TTS streaming core mark_speech_interrupted() / take_speech_interrupted(): a one-shot, TTL'd (120s) latch plus SPEECH_INTERRUPTED_NOTE. Barge-in paths mark it when they cut live speech; the next turn's submit path pops it and prepends the note to the model-bound message — API-call local, never persisted, so history and prompt caching are untouched.	2026-07-22 17:53:06 -05:00
Brooklyn Nicholson	b135a8badd	feat(voice): stream any provider in the CLI, with barge-in capture The streaming gate broadens from ElevenLabs-only to any provider that passes check_tts_requirements(). In continuous voice mode a mic monitor runs during playback: talking over the agent cuts TTS at detection while the monitor keeps recording, then the captured interruption is transcribed and queued as the next turn (process_loop's auto-restart stands down while the capture owns the mic). New voice.barge_in config key, default true.	2026-07-22 17:47:15 -05:00
Brooklyn Nicholson	8ce18d2557	feat(voice): barge-in detector with pre-roll utterance capture listen_for_speech(): sustained-RMS speech detection with the noise floor calibrated against audible playback (speaker bleed doesn't self-trigger). With capture=True it keeps a rolling pre-roll buffer and records through to a silence endpoint, so the interruption is transcribed from its first syllable — detection alone loses the opening words to the detector's sustain window plus the mic re-open. transcribe_recording() maps no_speech provider results to a successful empty transcript (silence, not an error).	2026-07-22 17:47:05 -05:00
Brooklyn Nicholson	d8f30b1df7	fix(stt): flag empty transcripts as no_speech An STT provider that hears no words is reporting silence, not failing. ElevenLabs/xAI empty-transcript errors now carry no_speech so live voice loops can re-listen quietly instead of surfacing an error on every pause.	2026-07-22 17:47:04 -05:00
Brooklyn Nicholson	592effcb2a	feat(tts): provider-agnostic streaming core with a shared sentence chunker StreamingTTSProvider ABC + registry + resolve_streaming_provider() — ElevenLabs (pcm_24000) and OpenAI (response_format=pcm) stream chunked PCM; every other provider keeps its configured voice and falls back to per-sentence sync synthesis. SentenceChunker is the one incremental cutter every surface shares: sentence-boundary cuts on the delta stream, <think> blocks stripped even when split across deltas, short fragments merged forward so they never stall as tiny clips.	2026-07-22 17:46:55 -05:00
Brooklyn Nicholson	70ba3c4828	feat(desktop): agent can focus panes + shared desktop-UI event bridge Extract the open_preview emitter into a shared tools/desktop_ui bridge (one gateway-injected sink, routed by HERMES_UI_SESSION_ID) and add a second desktop-gated tool on top of it: - focus_pane(chat\|files\|terminal\|review\|sessions) -> pane.reveal event. The desktop runs each pane's own reveal path (revealDesktopPane table) and only acts on the active window -- a background turn never moves the user's focus (desktop AGENTS.md: offer, don't hijack). open_preview now emits through the same bridge. Both tools are check_fn on HERMES_DESKTOP (zero footprint elsewhere), sitting beside read_terminal/close_terminal in _HERMES_CORE_TOOLS. Deliberately not adding run_slash: letting the agent fire slash commands mid-turn (/model, /new, /clear) fights prompt-cache + conversation invariants.	2026-07-22 12:13:01 -05:00
Brooklyn Nicholson	f071f42244	feat(desktop): let the agent open the preview pane Add a desktop-gated open_preview tool so 'open cnn.com in the preview pane' works. The tool (check_fn on HERMES_DESKTOP, zero footprint elsewhere) emits a preview.open event through a gateway-injected emitter, mirroring the close_terminal -> terminal.close bridge. The desktop handles it in usePreviewRouting, normalizing the target and opening the pane for the active session only -- a background turn never hijacks it. Bare domains and localhost are coaxed into fetchable URLs (www.cnn.com -> https://, localhost:3000 -> http://); file paths and schemes pass through to the renderer's normalizer.	2026-07-22 12:03:24 -05:00
liuhao1024	07cbb500b6	fix(clarify): reject arbitrary prose for native interactive multi-choice clarifies In Slack threads, ordinary follow-up messages sent while a native multi-choice clarify was pending were consumed as clarify answers — _coerce_text_response accepted arbitrary text for any pending entry, so the gateway text-intercept swallowed unrelated thread messages and the user's messages appeared to be ignored. Tighten resolve_text_response_for_session for native interactive multi-choice prompts (buttons rendered, awaiting_text=False): - numeric selections ("2") still resolve to the canonical choice - exact choice-label matches (case-insensitive) still resolve - arbitrary prose is now REJECTED (returns False) so the message continues as a normal turn instead of vanishing into the clarify Behavior is preserved everywhere free text is legitimately the answer: open-ended clarifies, explicit 'Other' text-capture mode, and the base adapter's numbered-text fallback (which flips awaiting_text at send time). Salvaged from PR #62042 by @liuhao1024. Fixes #62034	2026-07-22 07:00:47 -07:00
liuhao1024	9bb253d4fa	fix(tools): filter compaction summaries from session_search bookends and cap content length Context-compaction handoff summaries (prefixed with [CONTEXT COMPACTION]) were being returned as normal bookend_start/bookend_end messages in session_search discovery mode. A single compaction handoff could be 57K+ chars, immediately bloating a fresh session prompt to 73K+ chars from one search hit. Changes: - Add _COMPACTION_PREFIXES and _is_compaction_summary() helper - Filter compaction summaries from bookend_start and bookend_end in _discover() - Cap bookend content to 1200 chars and window content to 4000 chars - Add content_truncated/original_content_chars metadata when truncation occurs - Add 6 regression tests covering prefix detection, bookend filtering, content capping, and legacy [CONTEXT SUMMARY] prefix Fixes #43175	2026-07-22 07:00:04 -07:00
Teknium	295e20358c	feat(delegation): let subagents use execute_code (#69325 ) Children inherit the parent's env, repo, and toolsets but were denied execute_code ('children should reason step-by-step, not write scripts'). That forces subagents doing mechanical multi-step work (batch file reads, fetch-N-pages loops, filter-before-context reductions) to burn reasoning iterations one tool call at a time. - Remove execute_code from DELEGATE_BLOCKED_TOOLS - Stop stripping the code_execution toolset from child bundles - No recursion risk: the sandbox bridges only the 7 SANDBOX_ALLOWED_TOOLS (web/file/terminal) — delegate_task and execute_code itself are not reachable from inside a sandbox script - Update schema text, AGENTS.md, and delegation docs - Tests: blocked-constant, strip, and child-assembly tests updated; new test pins execute_code as intentionally unblocked	2026-07-22 06:58:45 -07:00
Andy	e89216e7f5	fix(secrets): harden encrypted Bitwarden cache	2026-07-22 04:40:07 -07:00
SeoYeonKim	2beed8b53d	fix(mcp): pass secret-source-injected env vars to stdio servers Surgical reapply of PR #37523 onto current main (the original branch predates the SecretSource registry refactor). _build_safe_env() now forwards env vars tagged in env_loader._SECRET_SOURCES — widened from Bitwarden-only to any registered secret source (Bitwarden, 1Password, plugin backends), since the provenance map is source-agnostic. Explicit server env: config still wins; untagged secrets stay filtered. Fixes #37499.	2026-07-22 03:19:42 -07:00
brooklyn!	75be8fb463	test(mcp): stamp breaker-open time on the monotonic clock, not a literal (#69003 ) test_session_expired_retry_waits_for_new_session hardcoded _server_breaker_opened_at["hindsight"] = 123.0 to simulate a circuit breaker whose cooldown has already elapsed. But the breaker in tools/mcp_tool.py compares that stamp against time.monotonic() (age = monotonic() - opened_at, elapsed when age >= _CIRCUIT_BREAKER_COOLDOWN_SEC). time.monotonic()'s origin is arbitrary and small on a freshly-booted CI container, so age worked out to only a few seconds there (< the 60s cooldown) — the breaker stayed open, the half-open probe never fired, and the retry returned the "unreachable" error instead of "bank ok". It passed on long-uptime dev boxes (large monotonic) and failed under CI, with the reported "Auto-retry available in ~Ns" drifting run to run as the container's monotonic clock varied. Stamp opened_at relative to the same clock the code reads (time.monotonic() - _CIRCUIT_BREAKER_COOLDOWN_SEC - 1.0) so the cooldown is provably elapsed regardless of the monotonic origin, exercising the intended half-open transition deterministically.	2026-07-21 19:54:40 -05:00

1 2 3 4 5 ...

1653 commits