hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-17 09:41:58 +00:00

Author	SHA1	Message	Date
Teknium	a1f51feb72	fix(telegram): avoid rich final duplicate previews (#46206 )	2026-06-14 11:13:38 -07:00
kshitij	6c34088a17	Merge pull request #46237 from kshitijk4poor/salvage/46095-cross-process-cache fix(gateway): cross-process agent-cache coherence (#45966) + preserve prompt caching	2026-06-14 23:05:17 +05:30
kshitij	fc2b8b3d31	Merge pull request #46236 from kshitijk4poor/salvage/disabled-skills-union fix(skills): platform-disabled skills still appear in <available_skills> + unify all resolution sites (#46201)	2026-06-14 23:00:11 +05:30
kshitijk4poor	3bc4a2ff78	fix(gateway): re-baseline agent-cache message_count after each turn The #45966 cross-process coherence guard snapshots a session's on-disk message_count next to the cached agent and rebuilds the agent when the count changes. But the snapshot is taken at agent-BUILD time — before the turn writes its own user + assistant (+ tool) rows — and the cache entry is never rewritten on a reuse. So this process's OWN turn grows message_count, and the very next turn sees a mismatch and rebuilds the agent. That happens every turn, for every conversation, silently destroying the per-conversation prompt caching the cache exists to protect (AGENTS.md: prompt caching is sacred). Add _refresh_agent_cache_message_count(): after a turn completes and the agent has flushed its rows to the SessionDB, re-baseline the stored count to the now-current value. The guard then fires ONLY when a DIFFERENT process changes the transcript — preserving the #45966 fix while keeping the cache warm for normal single-process operation. Tests drive the real SessionDB + the real guard condition: 5 consecutive same-process turns now all REUSE the cached agent (0 before the fix); a cross-process append still invalidates; and the re-baseline is fail-safe (no DB, falsy session_id, raising probe, legacy 2-tuple, pending sentinel all no-op).	2026-06-14 22:58:55 +05:30
kshitijk4poor	ce19fdb7ce	fix(skills): apply global\|platform disabled union to all resolution sites The platform-disabled fix landed only in agent.skill_utils.get_disabled_skill_names (the system-prompt path). Two sibling resolvers still used the old replace-not-union semantics, so the same skill could be hidden from the <available_skills> prompt yet reported enabled elsewhere: - hermes_cli/skills_config.get_disabled_skills (the 'hermes skills config' UI) returned only the platform list, so a globally-disabled skill showed as enabled (unchecked) on any platform with a platform_disabled entry. - tools/skills_tool._is_skill_disabled (gates whether skill_view loads a skill) ignored the global list when a platform list existed, so a globally-disabled skill could still be loaded on such a platform. Both now union the global list with the platform list, matching get_disabled_skill_names. An explicit empty platform list no longer re-enables a globally-disabled skill — global disables hold on every platform (#46201). Also: fix the now-stale get_disabled_skill_names docstring and drop a stray blank line. Regression tests added for both sites (proven to fail on the old replace semantics).	2026-06-14 22:54:54 +05:30
kyssta-exe	7f245b0035	fix(gateway): invalidate agent cache on cross-process session writes (#45966 ) (cherry picked from commit `6d0f79defe`)	2026-06-14 22:54:39 +05:30
ibrahim özsaraç	7bbe7024c2	fix: filter platform-disabled skills from <available_skills> prompt (#46201 ) build_skills_system_prompt() already resolved _platform_hint but called get_disabled_skill_names() with no argument, so the resolved platform never reached the filter and the prompt cache_key varied by platform while the disabled set did not. Pass _platform_hint or None. get_disabled_skill_names() also fully ignored the global 'disabled' list once a platform-specific list was found. Return the union (global \| platform) so a globally-disabled skill stays disabled on every platform. Salvaged from #46203 by @iborazzi; the unrelated apps/shared/tsconfig.json ES2023 bump is intentionally dropped (one concern per PR).	2026-06-14 22:52:57 +05:30
Teknium	7433d5f0eb	fix(gateway): scope early duplicate guard to pid file	2026-06-14 08:42:06 -07:00
konsisumer	1436793051	fix(gateway): block shell gateway run when a service supervises the profile	2026-06-14 08:42:06 -07:00
brooklyn!	08d89e7aba	fix(desktop): limit thinking shimmer to the disclosure label (#46197 ) Reasoning body text was inheriting tw-shimmer while streaming even though the "Thinking" header already pulses — keep shimmer on the label only.	2026-06-14 10:14:58 -05:00
Teknium	2c174bce24	fix(gateway): preserve new input on interrupted replay cleanup	2026-06-14 05:10:39 -07:00
Arnaud L	5191c1c2ce	fix(gateway): stop replaying interrupted tool-call tails and auto-continue notes Three changes to prevent infinite re-execution loops when a user sends a new message while long-running tools are executing: 1. Filter interrupted tool results in _build_gateway_agent_history: skip tool messages whose content contains [Command interrupted] or exit_code 130 — they represent partial execution, not valid results. 2. Don't replay auto-continue notes as user messages: detect gateway-injected [System note: ...] / [IMPORTANT: ...] prefixes and skip them in _build_gateway_agent_history so the LLM doesn't see 4+ messages from 'the user' telling it to finish old work. 3. Fix the wording: the system note now instructs the model to address the user's NEW message FIRST, IGNORE pending results, and NOT re-execute old tool calls. Closes #45230	2026-06-14 05:10:39 -07:00
Teknium	0f3670ba79	chore(release): map Diyoncrz18 author email	2026-06-14 04:52:54 -07:00
Diyon18	288f7026e3	fix(messaging): correct Weixin personal account labeling	2026-06-14 04:52:54 -07:00
Teknium	efbe1635dd	fix(gateway): include replied-to media attachments (#46107 )	2026-06-14 04:51:50 -07:00
Teknium	a27d7e68cc	fix(mcp): block suspicious stdio configs before probe (#46112 )	2026-06-14 04:46:54 -07:00
Teknium	13a1bd0f83	perf(model-metadata): persist OpenRouter metadata cache (#46114 )	2026-06-14 04:45:46 -07:00
Teknium	0e22bf6439	docs(gateway): document exact silence tokens (#46105 )	2026-06-14 04:37:18 -07:00
Teknium	972a9885ee	fix(mcp): block exfil-shaped stdio server configs (#46083 )	2026-06-14 04:24:14 -07:00
Teknium	9459057d7f	fix(telegram): guard rich details math crash (#46102 )	2026-06-14 04:22:22 -07:00
Teknium	cf7d5932f8	fix(email): make IPv4 SMTP fallback use supported sockets	2026-06-14 04:16:26 -07:00
liuhao1024	04d4471d79	fix(email): use SMTP_SSL for port 465 and fall back to IPv4 on timeout Port 465 expects implicit TLS (SMTP_SSL) from the first byte. The email adapter always used SMTP() + starttls(), which is correct for port 587 but hangs/fails on port 465 providers (e.g., Swiss ISPs). Additionally, when the SMTP host has AAAA DNS records but IPv6 is unreachable, socket.create_connection() tries IPv6 first and hangs until timeout. Add an IPv4 fallback via AF_INET socket. Extract _connect_smtp() helper to consolidate the 4 duplicate SMTP connection sites into a single method with correct protocol selection and IPv6 fallback logic.	2026-06-14 04:16:26 -07:00
Teknium	5105c3651a	perf(api-server): normalize chat content linearly (#46079 )	2026-06-14 03:25:49 -07:00
Aldo	293c04fef6	fix(gateway): suppress exact silence tokens without mutating history	2026-06-14 03:25:08 -07:00
Teknium	10bad2faf1	fix(gateway): serialize startup auto-resume before inbound (#46074 ) Gateway startup now queues real inbound messages until restart-interrupted auto-resume turns have completed, preventing duplicate agents for the same session after a restart.	2026-06-14 03:21:06 -07:00
Teknium	2b4873f7fb	fix(agent): persist repaired-turn responses (#46071 )	2026-06-14 03:20:25 -07:00
Teknium	723c2331bd	fix: make profile subprocess HOME policy explicit	2026-06-14 03:20:21 -07:00
zccyman	b00060ce54	fix(agent): expose HERMES_REAL_HOME in subprocess envs for profile isolation When profile isolation activates ({HERMES_HOME}/home/ exists), child processes receive HOME={HERMES_HOME}/home/ for tool config isolation (git, ssh, gh). However, scripts using Path.home() to locate ~/.hermes/ would incorrectly resolve to the isolated profile home, breaking helpers that rely on the real user home directory. New get_real_home() helper in hermes_constants resolves the actual user home independently of profile isolation. All four subprocess spawners now inject HERMES_REAL_HOME alongside the profile HOME: - tools/code_execution_tool.py (execute_code) - tools/environments/local.py (terminal background, run_env) - agent/copilot_acp_client.py (Copilot ACP) Child scripts can now use: Path(os.environ.get("HERMES_REAL_HOME", os.environ.get("HOME", ""))) to reliably find the real user home regardless of profile isolation. Closes #25114	2026-06-14 03:20:21 -07:00
Teknium	0428945b5b	fix(desktop): keep profile homes out of bootstrap (#46073 )	2026-06-14 03:08:52 -07:00
Teknium	afc8615509	perf(webhook): prune request caches incrementally (#46065 )	2026-06-14 02:40:54 -07:00
LeonSGP43	89bdb1e546	fix: read dashboard spa assets as utf-8 Co-Authored-By: Paperclip <noreply@paperclip.ing>	2026-06-14 02:31:04 -07:00
Teknium	7b9dc7cd0a	test(gateway): align web profile wrapper expectation	2026-06-14 02:20:55 -07:00
helix4u	d76a58bd15	fix(gateway): resolve sudo profile system installs	2026-06-14 02:20:55 -07:00
Teknium	1f5eef8093	test(tui): tolerate resume init kwargs in protocol tests	2026-06-14 02:15:33 -07:00
Teknium	9f33d673e9	fix(tui): persist resumed profile cwd updates to profile db	2026-06-14 02:15:33 -07:00
dsad	d842155da1	Keep resumed profile cwd scoped to profile DB	2026-06-14 02:15:33 -07:00
helix4u	4936a49a0c	fix(mcp): preserve loop during probes Some checks failed Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details Typecheck / typecheck (apps/bootstrap-installer) (push) Waiting to run Details Typecheck / typecheck (apps/desktop) (push) Waiting to run Details Typecheck / typecheck (apps/shared) (push) Waiting to run Details Typecheck / typecheck (ui-tui) (push) Waiting to run Details Typecheck / typecheck (web) (push) Waiting to run Details uv.lock check / uv lock --check (push) Waiting to run Details Nix Lockfile Fix / auto-fix-main (push) Has been cancelled Details Nix Lockfile Fix / fix (push) Has been cancelled Details Build Skills Index / build-index (push) Has been cancelled Details Build Skills Index / trigger-deploy (push) Has been cancelled Details	2026-06-14 02:09:45 -07:00
helix4u	85e6232a07	fix(providers): support anthropic proxy v1 endpoints	2026-06-14 02:09:16 -07:00
Teknium	81e42335a1	fix(file-safety): relax user-write deny policy (#45947 ) Allow file tools to edit shell startup files, user package-manager configs, and Hermes control files that the user can already modify directly. Keep hard blocks for SSH keys, .env/OAuth token stores, mcp-tokens, pairing files, and system privilege files.	2026-06-14 02:07:32 -07:00
brooklyn!	526a1e24b5	Merge pull request #46029 from NousResearch/bb/summarize-gui fix(desktop): show summarizing indicator during auto-compaction	2026-06-14 02:53:14 -05:00
Brooklyn Nicholson	1eb13744b4	fix(desktop): polish compaction indicator and preserve scrollback Show a shimmering "Summarizing thread" label during auto-compaction, skip the post-turn hydrate when compaction fired so the live transcript does not collapse to the stored summary-only session.	2026-06-14 02:48:48 -05:00
brooklyn!	49dd91d682	fix(desktop): show copied checkmark on session Copy ID (#46030 ) Route sidebar Copy ID through CopyButton so dropdown and context menus get the same checkmark feedback as every other copy action.	2026-06-14 07:38:55 +00:00
Brooklyn Nicholson	715b691723	fix(desktop): show summarizing indicator during auto-compaction Auto-compression rewrites history mid-turn, which made long threads look like they reset. Re-tag the gateway lifecycle status as compacting and surface it in the desktop thread loading indicators.	2026-06-14 02:28:07 -05:00
brooklyn!	9cbb91abd3	fix(desktop): clarify UX — loading, enter-to-send, radio align (#46014 ) * fix(desktop): clarify enter-to-send and top-align choice radios Match the composer keyboard contract in clarify freeform answers and align choice-row radio dots to the start of wrapped labels. * fix(desktop): clarify loading spinner until request is ready Hold the clarify panel on a centered Loader2 until clarify.request arrives instead of showing disabled choices or a loading-question stub. * refactor(desktop): dedupe clarify shell and drop stale ready gates Extract the shared clarify panel wrapper and remove disabled-state checks that loading already makes unreachable.	2026-06-14 07:06:40 +00:00
kshitij	c8ad2ca997	Merge pull request #46013 from kshitijk4poor/salvage/refusal-content-filter fix(agent): surface model refusals as content_filter (salvage #43108 + edge-case fix)	2026-06-14 12:28:51 +05:30
kshitijk4poor	10bd01972b	refactor(agent): share the content_policy_blocked result builder + recovery hint The HTTP-200 refusal handler (finish_reason=content_filter) and the exception-path handler (a provider moderation error classified as content_policy_blocked) independently built the same terminal turn result — the same {final_response, messages, api_calls, completed:False, failed:True, error:'content_policy_blocked: ...'} dict — and ended their user-facing message with the same 'Try rephrasing... hermes fallback add' trailer, copied verbatim. The two copies could drift. Funnel both through a shared _content_policy_blocked_result() builder and a shared _CONTENT_POLICY_RECOVERY_HINT constant. Also collapse the HTTP-200 path's two near-identical with/without-explanation templates into one (compute the detail fragment once) and pass reason=FailoverReason.content_policy_blocked .value to the error hook instead of a hand-written string literal, matching the sibling hook call. Behavior-preserving: the provider/refusal lead-in wording stays distinct (a provider safety filter vs the model declining are genuinely different signals), the with-text and exception messages are byte-identical to before, and the no-explanation case only gains a paragraph break for consistency. Surfaced by the simplify-code reuse/quality reviewers. The efficiency reviewer's 'redundant normalize_response' flag was deliberately NOT applied: that branch is cold (refusal-only) and pure-CPU, and reusing the sibling-branch normalized locals would risk a NameError on the codex_responses path (which sets finish_reason without normalizing) — re-normalizing is the robust choice.	2026-06-14 12:19:19 +05:30
kshitijk4poor	12c84d6c77	fix(transports): only treat a refusal as terminal when it is the sole payload A chat-completions response that carries real text or tool calls alongside a `message.refusal` note is a normal, usable turn — the model did work. The prior logic flipped finish_reason to `content_filter` whenever a refusal string was present, so the conversation loop reframed a content-bearing turn as a failed safety refusal (failed=True) and buried the model's actual output inside the "model declined" template, or dropped tool calls entirely. Only promote to a terminal `content_filter` when the refusal is the sole payload (no visible text AND no tool calls). The refusal explanation is still recorded in provider_data in every case for observability. Refusal-only responses (the bug this feature targets) are unaffected and still surface terminally; the empty+refusal, bare content_filter passthrough, and no-refusal common cases are byte-identical to before. Updates the partial-content test to the corrected contract and adds a tool_calls-alongside-refusal regression guard.	2026-06-14 12:12:52 +05:30
SHL0MS	ab26541b9a	test(transports): lock in content_filter passthrough for OpenRouter OpenRouter (and every other OpenAI-compatible provider) uses the default chat_completions transport, so it is already covered by the refusal fix: an upstream Claude / moderation refusal arrives as finish_reason="content_filter" (often empty content, no message.refusal). Add a regression test asserting the transport passes that finish reason straight through to the loop's content_filter handler. (cherry picked from commit `60168a513b`)	2026-06-14 12:10:08 +05:30
SHL0MS	bb46bf8ce4	fix(agent): surface model refusals instead of retrying them as errors A Claude refusal (HTTP 200, stop_reason="refusal", empty content) was laundered into a generic retry loop and surfaced as a misleading "rate limited / invalid response" or "no content after retries" error, burning paid attempts reproducing a deterministic refusal. This hit two distinct paths: - Direct Anthropic (anthropic_messages): validate_response rejected the empty-content refusal before normalize_response mapped refusal -> content_filter, so it fell into the invalid-response retry loop. - Nous Portal / OpenAI-compatible (chat_completions): the portal surfaces a Claude refusal via message.refusal with empty content, which sailed past validation and died in the empty-response retry loop. Fix (one unified content_filter dispatch for all backends): - AnthropicTransport.validate_response: accept empty content when stop_reason == "refusal" so it flows to normalize_response. - ChatCompletionsTransport.normalize_response: promote message.refusal to content + a content_filter finish reason. - conversation_loop: handle finish_reason == "content_filter" - fire the api_request_error hook (content_policy_blocked), try a configured fallback once, else return a clear terminal refusal message. Never retry a deterministic refusal. Supersedes #43084, which fixed only the direct-Anthropic path and could not reach the chat_completions/portal path. Tests: transport-level (validate_response refusal, message.refusal promotion) + end-to-end loop (refusal surfaced, exactly one API call). (cherry picked from commit `01f546f92c`)	2026-06-14 12:10:08 +05:30
brooklyn!	4b5ba112ad	fix: shrink images to reported provider dimension limit (#45979 ) Parse provider-reported image pixel ceilings so many-image Anthropic requests can recover by shrinking Retina screenshots below the stricter limit instead of retrying the same rejected payload.	2026-06-14 01:07:43 -05:00

1 2 3 4 5 ...

11690 commits