hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-25 00:51:20 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	f7587dd445	chore: uptick	2026-04-20 13:02:32 -05:00
Brooklyn Nicholson	f1f438e7f9	refactor(tui): drop done-in sys line; FaceTicker counter only The transcript line was noisy. Keep the one thing the issue really needs: live elapsed next to the busy verb.	2026-04-20 11:40:12 -05:00
Brooklyn Nicholson	2de1aad028	refactor(tui): turn elapsed lives in FaceTicker; emit done-in sys line Drops `lastUserAt` plumbing and the right-edge idle ticker. Matches the claude-code / opencode convention: elapsed rides with the busy indicator (spinner verb), nothing at idle. - `turnStartedAt` driven by a useEffect on `ui.busy` — stamps on rising edge, clears on falling edge. Covers agent turns and !shell alike. - FaceTicker renders ` · {fmtDuration}` while busy; 1 s clock for the counter, existing 2500 ms cycle for face/verb rotation. - On busy → idle, if the block ran ≥ 1 s, emit a one-shot `done in {fmtDuration}` sys line (≡ claude-code's `thought for Ns`).	2026-04-20 11:38:11 -05:00
Brooklyn Nicholson	9910681b85	refactor(tui): move last-msg elapsed from status bar to prompt right-edge Status bar ticker was too hot in peripheral vision. The moment the elapsed value matters is when the prompt returns — so surface it there. Dim `fmtDuration` next to the GoodVibesHeart, idle-only (hidden while busy), so quick turns and active streaming stay quiet.	2026-04-20 11:23:58 -05:00
Brooklyn Nicholson	1e7de177e8	feat(tui): show time-since-last-user-message alongside session total (#8541 ) StatusRule now renders `{sinceLastMsg}/{sinceSession}` (e.g. `12s/3m 45s`) when a user has submitted in the current session; falls back to the total alone otherwise. Wires `lastUserAt` through the state/session lifecycle: - useSubmission stamps `setLastUserAt(Date.now())` on send - useSessionLifecycle nulls it in reset/resetVisibleHistory - /branch slash nulls it on fork	2026-04-20 11:17:34 -05:00
kshitijk4poor	b7e71fb727	fix(tui): fix Linux Ctrl+C regression, remove double clipboard write - Fix critical regression: on Linux, Ctrl+C could not interrupt/clear/exit because isAction(key,'c') shadowed the isCtrl block (both resolve to k.ctrl on non-macOS). Restructured: isAction block now falls through to interrupt logic on non-macOS when no selection exists. - Remove double pbcopy: ink's copySelection() already calls setClipboard() which handles pbcopy+tmux+OSC52. The extra writeClipboardText call in useInputHandlers copySelection() was firing pbcopy a second time. - Remove allowClipboardHotkeys prop from TextInput — every caller passed isMac, and TextInput already imports isMac. Eliminated prop-drilling through appLayout, maskedPrompt, and prompts. - Remove dead code: the isCtrl copy paths (lines 277-288) were unreachable on any platform after the isAction block changes. - Simplify textInput Cmd+C: use writeClipboardText directly without the redundant OSC52 fallback (this path is macOS-only where pbcopy works).	2026-04-20 07:14:33 -07:00
kshitijk4poor	e388910fe6	fix(tui): make mac copy use pbcopy	2026-04-20 07:14:33 -07:00
kshitijk4poor	1d0b94a1b9	fix(tui): reserve control on macOS	2026-04-20 07:14:33 -07:00
kshitijk4poor	88396698ea	fix(tui): enable clipboard hotkeys in mac input fields	2026-04-20 07:14:33 -07:00
kshitijk4poor	c3af012a35	fix(tui): restore clipboard hotkeys in clarify mode	2026-04-20 07:14:33 -07:00
kshitijk4poor	8c9fdedaf5	fix(tui): use command shortcuts on macOS Make the Ink TUI match macOS keyboard expectations: Command handles copy and common editor/session shortcuts, while Control remains reserved for interrupt/cancel flows. Update the visible hotkey help to show platform-appropriate labels.	2026-04-20 07:14:33 -07:00
Austin Pickett	dcd763c284	Merge pull request #10125 from arihantsethia/feat/dashboard-skill-analytics feat: add skill analytics to the dashboard	2026-04-20 05:25:58 -07:00
Austin Pickett	720e1c65b2	Merge branch 'main' into feat/dashboard-skill-analytics	2026-04-20 05:25:49 -07:00
Mibayy	3273f301b7	fix(stt): map cloud-only model names to valid local size for faster-whisper (#2544 ) Cherry-picked from PR #2545 by @Mibayy. The setup wizard could leave stt.model: "whisper-1" in config.yaml. When using the local faster-whisper provider, this crashed with "Invalid model size 'whisper-1'". Voice messages were silently ignored. _normalize_local_model() now detects cloud-only names (whisper-1, gpt-4o-transcribe, etc.) and maps them to the default local model with a warning. Valid local sizes (tiny, base, small, medium, large-v3) pass through unchanged. - Renamed _normalize_local_command_model -> _normalize_local_model (backward-compat wrapper preserved) - 6 new tests including integration test - Added lowercase AUTHOR_MAP alias for @Mibayy Closes #2544	2026-04-20 05:18:48 -07:00
Ruzzgar	0613f10def	fix(gateway): use persisted session origin for shutdown notifications Prefer session_store origin over _parse_session_key() for shutdown notifications. Fixes misrouting when chat identifiers contain colons (e.g. Matrix room IDs like !room123:example.org). Falls back to session-key parsing when no persisted origin exists. Co-authored-by: Ruzzgar <ruzzgarcn@gmail.com> Ref: #12766	2026-04-20 05:15:54 -07:00
Teknium	9725b452a1	fix: extract _repair_tool_call_arguments helper, add tests, bound loop Follow-up for PR #12252 salvage: - Extract 75-line inline repair block to _repair_tool_call_arguments() module-level helper for testability and readability - Remove redundant 'import re as _re' (re already imported at line 33) - Bound the while-True excess-delimiter removal loop to 50 iterations - Add 17 tests covering all 6 repair stages - Add sirEven to AUTHOR_MAP in release.py	2026-04-20 05:12:55 -07:00
Severin Bretscher	9eeaaa4f1b	fix(agent): repair malformed tool_call arguments before API send Cherry-picked from PR #12252 by @sirEven. Models like GLM-5.1 via Ollama can produce malformed tool_call arguments (truncated JSON, trailing commas, Python None). The existing except Exception: pass silently passes broken args to the API, which rejects them with HTTP 400, crashing the session. Adds a multi-stage repair pipeline at the pre-send normalization point: 1. Empty/whitespace-only → {} 2. Python None literal → {} 3. Strip trailing commas 4. Auto-close unclosed brackets 5. Remove excess closing delimiters 6. Last resort: replace with {} (logged at WARNING)	2026-04-20 05:12:55 -07:00
Sanjays2402	570f8bab8f	fix(compression): exclude completion tokens from compression trigger (#12026 ) Cherry-picked from PR #12481 by @Sanjays2402. Reasoning models (GLM-5.1, QwQ, DeepSeek R1) inflate completion_tokens with internal thinking tokens. The compression trigger summed prompt_tokens + completion_tokens, causing premature compression at ~42% actual context usage instead of the configured 50% threshold. Now uses only prompt_tokens — completion tokens don't consume context window space for the next API call. - 3 new regression tests - Added AUTHOR_MAP entry for @Sanjays2402 Closes #12026	2026-04-20 05:12:10 -07:00
Teknium	42c30985c7	fix: enable plugins in config.yaml for lazy-discovery tests The opt-in-by-default change (`70111eea`) requires plugins to be listed in plugins.enabled. The cherry-picked test fixtures didn't write this config, so two tests failed on current main.	2026-04-20 05:11:39 -07:00
Stephen Schoettler	a5e368ebfb	fix: publish plugin slash commands in Telegram menu - discover plugin commands before building Telegram command menus - make plugin command and context engine accessors lazy-load plugins - add regression coverage for Telegram menu and plugin lookup paths	2026-04-20 05:11:39 -07:00
Teknium	34ae13e6ed	chore: add jplew to AUTHOR_MAP	2026-04-20 05:10:23 -07:00
JP Lew	9fdfb09aed	fix(telegram): cache inbound videos and accept mp4 uploads	2026-04-20 05:10:23 -07:00
Junass1	aebf32229b	fix(session_search): restore same-session context when message ids are interleaved Replaces global id +/- 1 context lookup with CTE-based same-session neighbor queries. When multiple sessions write concurrently, id adjacency does not imply session adjacency — the old query missed real neighbors. Co-authored-by: Junass1 <ysfalweshcan@gmail.com>	2026-04-20 05:10:03 -07:00
PStarH	00192d51f1	fix(install): quote PYTHON_PATH and UV_CMD for paths with spaces on macOS (#10009 ) Cherry-picked from PR #10019 by @PStarH. On macOS, uv stores Python in ~/Library/Application Support/uv/... which contains a space. Unquoted $PYTHON_PATH and $UV_CMD caused word-splitting under set -e, silently aborting install.sh. Quotes all variable expansions in check_python(): - "$PYTHON_PATH" in command invocations - "$UV_CMD" in uv calls - Outer quotes on $(...) assignments Closes #10009	2026-04-20 05:03:14 -07:00
sprmn24	ed76185c15	feat(whatsapp): implement send_voice for audio message delivery WhatsApp already receives incoming voice messages (audio/ogg via the bridge) but lacked a send_voice implementation, so TTS and audio responses fell back to the base class send_image path instead of being delivered as native audio messages. Route send_voice through the existing _send_media_to_bridge helper with media_type='audio', matching the pattern used by send_video and send_document.	2026-04-20 05:00:30 -07:00
Jason	23b81ab243	fix(cli): send User-Agent in /v1/models probe to pass Cloudflare 1010 Custom Claude proxies fronted by Cloudflare with Browser Integrity Check enabled (e.g. `packyapi.com`) reject requests with the default `Python-urllib/*` signature, returning HTTP 403 "error code: 1010". `probe_api_models` swallowed that in its blanket `except Exception: continue`, so `validate_requested_model` returned the misleading "Could not reach the <provider> API to validate `<model>`" error even though the endpoint is reachable and lists the requested model. Advertise the probe request as `hermes-cli/<version>` so Cloudflare treats it as a first-party client. This mirrors the pattern already used by `agent/gemini_native_adapter.py` and `agent/anthropic_adapter.py`, which set a descriptive UA for the same reason. Reproduction (pre-fix): python3 -c " import urllib.request req = urllib.request.Request( 'https://www.packyapi.com/v1/models', headers={'Authorization': 'Bearer sk-...'}) urllib.request.urlopen(req).read() " urllib.error.HTTPError: HTTP Error 403: Forbidden (body: b'error code: 1010') Any non-urllib UA (Mozilla, curl, reqwest) returns 200 with the OpenAI-compatible models listing. Tested on macOS (Python 3.11). No cross-platform concerns — the change is a single header addition to an existing `urllib.request.Request`.	2026-04-20 04:56:30 -07:00
houguokun	6cdab70320	fix(batch_runner): mark discarded no-reasoning prompts as completed (#9950 ) Cherry-picked from PR #10005 by @houziershi. Discarded prompts (has_any_reasoning=False) were skipped by `continue` before being added to completed_in_batch. On --resume they were retried forever. Now they are added to completed_in_batch before the continue. - Added AUTHOR_MAP entry for @houziershi Closes #9950	2026-04-20 04:56:06 -07:00
Teknium	7242afaa5f	chore: defer WhatsApp bridge install to first use (#12992 ) Remove eager npm install of @whiskeysockets/baileys during install.sh, install.ps1, and Docker build. The bridge deps are already installed on-demand by `hermes whatsapp` (Step 4 checks for node_modules and runs npm install if missing), so there is no need to pay the cost at initial install for users who never use WhatsApp.	2026-04-20 04:55:33 -07:00
luyao618	2cdae233e2	fix(config): validate providers config entries — reject non-URL base, accept camelCase aliases (#9332 ) Cherry-picked from PR #9359 by @luyao618. - Accept camelCase aliases (apiKey, baseUrl, apiMode, keyEnv, defaultModel, contextLength, rateLimitDelay) with auto-mapping to snake_case + warning - Validate URL field values with urlparse (scheme + netloc check) — reject non-URL strings like 'openai-reverse-proxy' that were silently accepted - Warn on unknown keys in provider config entries - Re-order URL field priority: base_url > url > api (was api > url > base_url) - 12 new tests covering all scenarios Closes #9332	2026-04-20 04:52:50 -07:00
kshitijk4poor	bc2559c44d	fix: remove codex spark model support Drop gpt-5.3-codex-spark from Codex forward-compat synthesis, provider catalogs, and context metadata now that the API no longer supports it.	2026-04-20 04:51:44 -07:00
Teknium	70111eea24	feat(plugins): make all plugins opt-in by default Plugins now require explicit consent to load. Discovery still finds every plugin — user-installed, bundled, and pip — so they all show up in `hermes plugins` and `/plugins`, but the loader only instantiates plugins whose name appears in `plugins.enabled` in config.yaml. This removes the previous ambient-execution risk where a newly-installed or bundled plugin could register hooks, tools, and commands on first run without the user opting in. The three-state model is now explicit: enabled — in plugins.enabled, loads on next session disabled — in plugins.disabled, never loads (wins over enabled) not enabled — discovered but never opted in (default for new installs) `hermes plugins install <repo>` prompts "Enable 'name' now? [y/N]" (defaults to no). New `--enable` / `--no-enable` flags skip the prompt for scripted installs. `hermes plugins enable/disable` manage both lists so a disabled plugin stays explicitly off even if something later adds it to enabled. Config migration (schema v20 → v21): existing user plugins already installed under ~/.hermes/plugins/ (minus anything in plugins.disabled) are auto-grandfathered into plugins.enabled so upgrades don't silently break working setups. Bundled plugins are NOT grandfathered — even existing users have to opt in explicitly. Also: HERMES_DISABLE_BUNDLED_PLUGINS env var removed (redundant with opt-in default), cmd_list now shows bundled + user plugins together with their three-state status, interactive UI tags bundled entries [bundled], docs updated across plugins.md and built-in-plugins.md. Validation: 442 plugin/config tests pass. E2E: fresh install discovers disk-cleanup but does not load it; `hermes plugins enable disk-cleanup` activates hooks; migration grandfathers existing user plugins correctly while leaving bundled plugins off.	2026-04-20 04:46:45 -07:00
Teknium	a25c8c6a56	docs(plugins): rename disk-guardian to disk-cleanup + bundled-plugins docs The original name was cute but non-obvious; disk-cleanup says what it does. Plugin directory, script, state path, log lines, slash command, and test module all renamed. No user-visible state exists yet, so no migration path is needed. New website page "Built-in Plugins" documents the <repo>/plugins/<name>/ source, how discovery interacts with user/project plugins, the HERMES_DISABLE_BUNDLED_PLUGINS escape hatch, disk-cleanup's hook behaviour and deletion rules, and guidance on when a plugin belongs bundled vs. user-installable. Added to the Features → Core sidebar next to the main Plugins page, with a cross-reference from plugins.md.	2026-04-20 04:46:45 -07:00
Teknium	1386e277e5	feat(plugins): convert disk-guardian skill into a bundled plugin Rewires @LVT382009's disk-guardian (PR #12212) from a skill-plus-script into a plugin that runs entirely via hooks — no agent compliance needed. - post_tool_call hook auto-tracks files created by write_file / terminal / patch when they match test_/tmp_/.test. patterns under HERMES_HOME - on_session_end hook runs cmd_quick cleanup when test files were auto-tracked during the turn; stays quiet otherwise - /disk-guardian slash command keeps status / dry-run / quick / deep / track / forget for manual use - Deterministic cleanup rules, path safety, atomic writes, and audit logging preserved from the original contribution - Protect well-known top-level state dirs (logs/, memories/, sessions/, cron/, cache/, etc.) from empty-dir removal so fresh installs don't get gutted on first session end The plugin system gains a bundled-plugin discovery path (<repo>/plugins/ <name>/) alongside user/project/entry-point sources. Memory and context_engine subdirs are skipped — they keep their own discovery paths. HERMES_DISABLE_BUNDLED_PLUGINS=1 suppresses the scan; the test conftest sets it by default so existing plugin tests stay clean. Co-authored-by: LVT382009 <levantam.98.2324@gmail.com>	2026-04-20 04:46:45 -07:00
Nox	32e6baea31	Update disk_guardian.py	2026-04-20 04:46:45 -07:00
Nox	aeecf06dee	Update SKILL.md	2026-04-20 04:46:45 -07:00
LVT382009	068b224887	feat(skills): add disk-guardian — autonomous cleanup of Hermes temp files and disk optimization	2026-04-20 04:46:45 -07:00
Teknium	9a57aa2b1f	fix(docs): unbreak docs-site-checks — ascii-guard diagram + MDX `<1%` (#12984 ) * fix(docs): unbreak ascii-guard lint on github-pr-review-agent diagram The intro diagram used 4 side-by-side boxes in one row. ascii-guard can't parse that layout — it reads the whole thing as one 80-wide outer box and flags the inner box borders at columns 17/39/60 as 'extra characters after right border'. Per the ascii-guard-lint-fixing skill, the only fix is to merge into a single outer box. Rewritten as one 69-char outer box with four labeled regions separated by arrows. Same semantic content, lint-clean. Was blocking docs-site-checks CI as 'action_required' across multiple PRs (see e.g. run 24661820677). * fix(docs): backtick-wrap `<1%` to avoid MDX JSX parse error Docusaurus MDX parses `<1%` as the start of a JSX tag, but `1` isn't a valid tag-name start so compilation fails with 'Unexpected character `1` (U+0031) before name'. Wrap in backticks so MDX treats it as literal code text. Found by running Build Docusaurus step on the PR that unblocked the ascii-guard step; full docs tree scanned for other `<digit>` patterns outside backticks/fences, only this one was unsafe.	2026-04-20 04:29:02 -07:00
Teknium	e04a55f37f	fix(xurl skill): fix default app pitfall in setup, add agent detection and troubleshooting (#12985 ) - Setup step 5: add --app my-app to xurl auth oauth2 so token binds to the correct app - Setup step 6: add xurl auth default my-app to set the named app as default - Add pitfall callout explaining the empty 'default' profile trap - Agent Workflow step 2: detect when default app has no oauth2 tokens - Add Troubleshooting table with common xurl issues (auth errors, unauthorized_client, enrollment, credits, media upload, dashboard UI bug) - Bump to v1.1.0 Community report by @0xHarryWeb3	2026-04-20 04:27:57 -07:00
Teknium	f683132c1d	feat(api-server): inline image inputs on /v1/chat/completions and /v1/responses (#12969 ) OpenAI-compatible clients (Open WebUI, LobeChat, etc.) can now send vision requests to the API server. Both endpoints accept the canonical OpenAI multimodal shape: Chat Completions: {type: text\|image_url, image_url: {url, detail?}} Responses: {type: input_text\|input_image, image_url: <str>, detail?} The server validates and converts both into a single internal shape that the existing agent pipeline already handles (Anthropic adapter converts, OpenAI-wire providers pass through). Remote http(s) URLs and data:image/* URLs are supported. Uploaded files (file, input_file, file_id) and non-image data: URLs are rejected with 400 unsupported_content_type. Changes: - gateway/platforms/api_server.py - _normalize_multimodal_content(): validates + normalizes both Chat and Responses content shapes. Returns a plain string for text-only content (preserves prompt-cache behavior on existing callers) or a canonical [{type:text\|image_url,...}] list when images are present. - _content_has_visible_payload(): replaces the bare truthy check so a user turn with only an image no longer rejects as 'No user message'. - _handle_chat_completions and _handle_responses both call the new helper for user/assistant content; system messages continue to flatten to text. - Codex conversation_history, input[], and inline history paths all share the same validator. No duplicated normalizers. - run_agent.py - _summarize_user_message_for_log(): produces a short string summary ('[1 image] describe this') from list content for logging, spinner previews, and trajectory writes. Fixes AttributeError when list user_message hit user_message[:80] + '...' / .replace(). - _chat_content_to_responses_parts(): module-level helper that converts chat-style multimodal content to Responses 'input_text'/'input_image' parts. Used in _chat_messages_to_responses_input for Codex routing. - _preflight_codex_input_items() now validates and passes through list content parts for user/assistant messages instead of stringifying. - tests/gateway/test_api_server_multimodal.py (new, 38 tests) - Unit coverage for _normalize_multimodal_content, including both part formats, data URL gating, and all reject paths. - Real aiohttp HTTP integration on /v1/chat/completions and /v1/responses verifying multimodal payloads reach _run_agent intact. - 400 coverage for file / input_file / non-image data URL. - tests/run_agent/test_run_agent_multimodal_prologue.py (new) - Regression coverage for the prologue no-crash contract. - _chat_content_to_responses_parts round-trip coverage. - website/docs/user-guide/features/api-server.md - Inline image examples for both endpoints. - Updated Limitations: files still unsupported, images now supported. Validated live against openrouter/anthropic/claude-opus-4.6: POST /v1/chat/completions → 200, vision-accurate description POST /v1/responses → 200, same image, clean output_text POST /v1/chat/completions [file] → 400 unsupported_content_type POST /v1/responses [input_file] → 400 unsupported_content_type POST /v1/responses [non-image data URL] → 400 unsupported_content_type Closes #5621, #8253, #4046, #6632. Co-authored-by: Paul Bergeron <paul@gamma.app> Co-authored-by: zhangxicen <zhangxicen@example.com> Co-authored-by: Manuel Schipper <manuelschipper@users.noreply.github.com> Co-authored-by: pradeep7127 <pradeep7127@users.noreply.github.com>	2026-04-20 04:16:13 -07:00
Teknium	3218d58fc5	chore(release): add Swift42 to AUTHOR_MAP	2026-04-20 04:15:04 -07:00
Swift42	b68bc0ad33	Update SKILL.md Use -q instead of the deprecated/not working -k	2026-04-20 04:15:04 -07:00
Swift42	d41ca86f74	Update duckduckgo.sh	2026-04-20 04:15:04 -07:00
Teknium	04068c5891	feat(plugins): add transform_tool_result hook for generic tool-result rewriting (#12972 ) Closes #8933 more fully, extending the per-tool transform_terminal_output hook from #12929 to a generic seam that fires after every tool dispatch. Plugins can rewrite any tool's result string (normalize formats, redact fields, summarize verbose output) without wrapping individual tools. Changes - hermes_cli/plugins.py: add "transform_tool_result" to VALID_HOOKS - model_tools.py: invoke the hook in handle_function_call after post_tool_call (which remains observational); first valid str return replaces the result; fail-open - tests/test_transform_tool_result_hook.py: 9 new tests covering no-op, None return, non-string return, first-match wins, kwargs, hook exception fallback, post_tool_call observation invariant, ordering vs post_tool_call, and an end-to-end real-plugin integration - tests/hermes_cli/test_plugins.py: assert new hook in VALID_HOOKS - tests/test_model_tools.py: extend the hook-call-sequence assertion to include the new hook Design - transform_tool_result runs AFTER post_tool_call so observers always see the original (untransformed) result. This keeps post_tool_call's observational contract. - transform_terminal_output (from #12929) still runs earlier, inside terminal_tool, so plugins can canonicalize BEFORE the 50k truncation drops middle content. Both hooks coexist; they target different layers.	2026-04-20 03:48:08 -07:00
Teknium	9f22977fc0	chore(release): add haileymarshall to AUTHOR_MAP	2026-04-20 03:10:19 -07:00
haileymarshall	6b408e131c	fix(gateway): pass session_key (not session_id) to active-process check during prune SessionStore.prune_old_entries was calling self._has_active_processes_fn(entry.session_id) but the callback wired up in gateway/run.py is process_registry.has_active_for_session, which compares against session_key, not session_id. Every other caller in session.py (_is_session_expired, _should_reset) already passes session_key, so prune was the only outlier — and because session_id and session_key live in different namespaces, the guard never fired. Result in production: sessions with live background processes (queued cron output, detached agents, long-running Bash) were pruned out of _entries despite the docstring promising they'd be preserved. When the process finished and tried to deliver output, the session_key to session_id mapping was gone and the work was effectively orphaned. Also update the existing test_prune_skips_entries_with_active_processes, which was checking the wrong interface (its mock callback took session_id so it agreed with the buggy implementation). The test now uses a session_key-based mock, matching the production callback's real contract, and a new regression guard test pins the behaviour. Swallowed exceptions inside the prune loop now log at debug level instead of silently disappearing.	2026-04-20 03:10:19 -07:00
Teknium	eba7c869bb	fix(steer): drain /steer between individual tool calls, not at batch end (#12959 ) Previously, /steer text was only injected after an entire tool batch completed (_execute_tool_calls_sequential/concurrent returned). If the batch had a long-running tool (delegate_task, terminal build), the steer waited for ALL tools to finish before landing — functionally identical to /queue from the user's perspective. Now _apply_pending_steer_to_tool_results() is called after EACH individual tool result is appended to messages, in both the sequential and concurrent paths. A steer arriving during Tool 1 lands in Tool 1's result before Tool 2 starts executing. Also handles leftover steers in the gateway: if a steer arrives during the final API call (no tool batch to drain into), it's now delivered as the next user turn instead of being silently dropped. Fixes user report from Utku.	2026-04-20 03:08:04 -07:00
Teknium	22efc81cd7	fix(sessions): surface compression tips in session lists and resume lookups (#12960 ) After a conversation gets compressed, run_agent's _compress_context ends the parent session and creates a continuation child with the same logical conversation. Every list affordance in the codebase (list_sessions_rich with its default include_children=False, plus the CLI/TUI/gateway/ACP surfaces on top of it) hid those children, and resume-by-ID on the old root landed on a dead parent with no messages. Fix: lineage-aware projection on the read path. - hermes_state.py::get_compression_tip(session_id) — walk the chain forward using parent.end_reason='compression' AND child.started_at >= parent.ended_at. The timing guard separates compression continuations from delegate subagents (which were created while the parent was still live) without needing a schema migration. - hermes_state.py::list_sessions_rich — new project_compression_tips flag (default True). For each compressed root in the result, replace surfaced fields (id, ended_at, end_reason, message_count, tool_call_count, title, last_active, preview, model, system_prompt) with the tip's values. Preserve the root's started_at so chronological ordering stays stable. Projected rows carry _lineage_root_id for downstream consumers. Pass False to get raw roots (admin/debug). - hermes_cli/main.py::_resolve_session_by_name_or_id — project forward after ID/title resolution, so users who remember an old root ID (from notes, or from exit summaries produced before the sibling Bug 1 fix) land on the live tip. All downstream callers of list_sessions_rich benefit automatically: - cli.py _list_recent_sessions (/resume, show_history affordance) - hermes_cli/main.py sessions list / sessions browse - tui_gateway session.list picker - gateway/run.py /resume titled session listing - tools/session_search_tool.py - acp_adapter/session.py Tests: 7 new in TestCompressionChainProjection covering full-chain walks, delegate-child exclusion, tip surfacing with lineage tracking, raw-root mode, chronological ordering, and broken-chain graceful fallback. Verified live: ran a real _compress_context on a live Gemini-backed session, confirmed the DB split, then verified - db.list_sessions_rich surfaces tip with _lineage_root_id set - hermes sessions list shows the tip, not the ended parent - _resolve_session_by_name_or_id(old_root_id) -> tip_id - _resolve_last_session -> tip_id Addresses #10373.	2026-04-20 03:07:51 -07:00
Teknium	0cff992f0a	chore(release): add alexzhu0 to AUTHOR_MAP	2026-04-20 03:07:32 -07:00
Alexazhu	64a1368210	fix(tools): keep SSH ControlMaster socket path under macOS 104-byte limit On macOS, Unix domain socket paths are capped at 104 bytes (sun_path). SSH appends a 16-byte random suffix to the ControlPath when operating in ControlMaster mode. With an IPv6 host embedded literally in the filename and a deeply-nested macOS $TMPDIR like /var/folders/XX/YYYYYYYYYYYY/T/, the full path reliably exceeds the limit — every terminal/file-op tool call then fails immediately with ``unix_listener: path "…" too long for Unix domain socket``. Swap the ``user@host:port.sock`` filename for a sha256-derived 16-char hex digest. The digest is deterministic for a given (user, host, port) triple, so ControlMaster reuse across reconnects is preserved, and the full path fits comfortably under the limit even after SSH's random suffix. Collision space is 2^64 — effectively unreachable for the handful of concurrent connections any single Hermes process holds. Regression tests cover: path length under realistic macOS $TMPDIR with the IPv6 host from the issue report, determinism for reconnects, and distinctness across different (user, host, port) triples. Closes #11840	2026-04-20 03:07:32 -07:00
Teknium	649ef5c8f1	chore(release): add sjz-ks to AUTHOR_MAP	2026-04-20 03:04:06 -07:00

1 2 3 4 5 ...

5074 commits