hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-15 09:21:36 +00:00

Author	SHA1	Message	Date
Teknium	81e42335a1	fix(file-safety): relax user-write deny policy (#45947 ) Allow file tools to edit shell startup files, user package-manager configs, and Hermes control files that the user can already modify directly. Keep hard blocks for SSH keys, .env/OAuth token stores, mcp-tokens, pairing files, and system privilege files.	2026-06-14 02:07:32 -07:00
brooklyn!	526a1e24b5	Merge pull request #46029 from NousResearch/bb/summarize-gui fix(desktop): show summarizing indicator during auto-compaction	2026-06-14 02:53:14 -05:00
Brooklyn Nicholson	1eb13744b4	fix(desktop): polish compaction indicator and preserve scrollback Show a shimmering "Summarizing thread" label during auto-compaction, skip the post-turn hydrate when compaction fired so the live transcript does not collapse to the stored summary-only session.	2026-06-14 02:48:48 -05:00
brooklyn!	49dd91d682	fix(desktop): show copied checkmark on session Copy ID (#46030 ) Route sidebar Copy ID through CopyButton so dropdown and context menus get the same checkmark feedback as every other copy action.	2026-06-14 07:38:55 +00:00
Brooklyn Nicholson	715b691723	fix(desktop): show summarizing indicator during auto-compaction Auto-compression rewrites history mid-turn, which made long threads look like they reset. Re-tag the gateway lifecycle status as compacting and surface it in the desktop thread loading indicators.	2026-06-14 02:28:07 -05:00
brooklyn!	9cbb91abd3	fix(desktop): clarify UX — loading, enter-to-send, radio align (#46014 ) * fix(desktop): clarify enter-to-send and top-align choice radios Match the composer keyboard contract in clarify freeform answers and align choice-row radio dots to the start of wrapped labels. * fix(desktop): clarify loading spinner until request is ready Hold the clarify panel on a centered Loader2 until clarify.request arrives instead of showing disabled choices or a loading-question stub. * refactor(desktop): dedupe clarify shell and drop stale ready gates Extract the shared clarify panel wrapper and remove disabled-state checks that loading already makes unreachable.	2026-06-14 07:06:40 +00:00
kshitij	c8ad2ca997	Merge pull request #46013 from kshitijk4poor/salvage/refusal-content-filter fix(agent): surface model refusals as content_filter (salvage #43108 + edge-case fix)	2026-06-14 12:28:51 +05:30
kshitijk4poor	10bd01972b	refactor(agent): share the content_policy_blocked result builder + recovery hint The HTTP-200 refusal handler (finish_reason=content_filter) and the exception-path handler (a provider moderation error classified as content_policy_blocked) independently built the same terminal turn result — the same {final_response, messages, api_calls, completed:False, failed:True, error:'content_policy_blocked: ...'} dict — and ended their user-facing message with the same 'Try rephrasing... hermes fallback add' trailer, copied verbatim. The two copies could drift. Funnel both through a shared _content_policy_blocked_result() builder and a shared _CONTENT_POLICY_RECOVERY_HINT constant. Also collapse the HTTP-200 path's two near-identical with/without-explanation templates into one (compute the detail fragment once) and pass reason=FailoverReason.content_policy_blocked .value to the error hook instead of a hand-written string literal, matching the sibling hook call. Behavior-preserving: the provider/refusal lead-in wording stays distinct (a provider safety filter vs the model declining are genuinely different signals), the with-text and exception messages are byte-identical to before, and the no-explanation case only gains a paragraph break for consistency. Surfaced by the simplify-code reuse/quality reviewers. The efficiency reviewer's 'redundant normalize_response' flag was deliberately NOT applied: that branch is cold (refusal-only) and pure-CPU, and reusing the sibling-branch normalized locals would risk a NameError on the codex_responses path (which sets finish_reason without normalizing) — re-normalizing is the robust choice.	2026-06-14 12:19:19 +05:30
kshitijk4poor	12c84d6c77	fix(transports): only treat a refusal as terminal when it is the sole payload A chat-completions response that carries real text or tool calls alongside a `message.refusal` note is a normal, usable turn — the model did work. The prior logic flipped finish_reason to `content_filter` whenever a refusal string was present, so the conversation loop reframed a content-bearing turn as a failed safety refusal (failed=True) and buried the model's actual output inside the "model declined" template, or dropped tool calls entirely. Only promote to a terminal `content_filter` when the refusal is the sole payload (no visible text AND no tool calls). The refusal explanation is still recorded in provider_data in every case for observability. Refusal-only responses (the bug this feature targets) are unaffected and still surface terminally; the empty+refusal, bare content_filter passthrough, and no-refusal common cases are byte-identical to before. Updates the partial-content test to the corrected contract and adds a tool_calls-alongside-refusal regression guard.	2026-06-14 12:12:52 +05:30
SHL0MS	ab26541b9a	test(transports): lock in content_filter passthrough for OpenRouter OpenRouter (and every other OpenAI-compatible provider) uses the default chat_completions transport, so it is already covered by the refusal fix: an upstream Claude / moderation refusal arrives as finish_reason="content_filter" (often empty content, no message.refusal). Add a regression test asserting the transport passes that finish reason straight through to the loop's content_filter handler. (cherry picked from commit `60168a513b`)	2026-06-14 12:10:08 +05:30
SHL0MS	bb46bf8ce4	fix(agent): surface model refusals instead of retrying them as errors A Claude refusal (HTTP 200, stop_reason="refusal", empty content) was laundered into a generic retry loop and surfaced as a misleading "rate limited / invalid response" or "no content after retries" error, burning paid attempts reproducing a deterministic refusal. This hit two distinct paths: - Direct Anthropic (anthropic_messages): validate_response rejected the empty-content refusal before normalize_response mapped refusal -> content_filter, so it fell into the invalid-response retry loop. - Nous Portal / OpenAI-compatible (chat_completions): the portal surfaces a Claude refusal via message.refusal with empty content, which sailed past validation and died in the empty-response retry loop. Fix (one unified content_filter dispatch for all backends): - AnthropicTransport.validate_response: accept empty content when stop_reason == "refusal" so it flows to normalize_response. - ChatCompletionsTransport.normalize_response: promote message.refusal to content + a content_filter finish reason. - conversation_loop: handle finish_reason == "content_filter" - fire the api_request_error hook (content_policy_blocked), try a configured fallback once, else return a clear terminal refusal message. Never retry a deterministic refusal. Supersedes #43084, which fixed only the direct-Anthropic path and could not reach the chat_completions/portal path. Tests: transport-level (validate_response refusal, message.refusal promotion) + end-to-end loop (refusal surfaced, exactly one API call). (cherry picked from commit `01f546f92c`)	2026-06-14 12:10:08 +05:30
brooklyn!	4b5ba112ad	fix: shrink images to reported provider dimension limit (#45979 ) Parse provider-reported image pixel ceilings so many-image Anthropic requests can recover by shrinking Retina screenshots below the stricter limit instead of retrying the same rejected payload.	2026-06-14 01:07:43 -05:00
brooklyn!	cdf30a7ac6	Merge pull request #45866 from NousResearch/bb/desktop-notifications feat(desktop): native OS notifications with per-type toggles	2026-06-14 00:36:38 -05:00
Brooklyn Nicholson	b0288ae9b6	feat(desktop): move completion-sound picker into Notifications settings The turn-end sound is a notification concern, not an appearance one — relocate the variant picker + preview from the Appearance tab to the Notifications tab (its i18n keys move from settings.appearance to settings.notifications with it).	2026-06-14 00:31:09 -05:00
Brooklyn Nicholson	630a4ef03c	feat(desktop): native OS notifications with per-type toggles Adds a native OS notification system (Electron Notification, routed cross-OS) distinct from the in-app toast feed. Before this, one hardcoded cue existed (message.complete while document.hidden) with no settings or event coverage. - Engine (store/native-notifications.ts): localStorage-backed prefs (master switch + per-kind toggles) and a gated dispatcher over five kinds — approval, input, turnDone, turnError, backgroundDone — with a 1s per-(kind,session) self-evicting throttle. - Gating: "backgrounded" = document.hidden OR !document.hasFocus(), so an alt-tabbed window still counts as away. Completion kinds fire only when backgrounded and for the active session (no spam from a busy gateway); attention kinds (approval/input) also break through for off-screen sessions. - Wired into real event sites (use-message-stream.ts): message.complete, error, approval/clarify/sudo/secret.request; backgroundDone from composer-status at the running -> exited transition. - Click focuses the window and jumps to the originating session; approval notifications carry Approve/Reject buttons that resolve in place over approval.respond, mirroring the in-app Run/Reject bar. - Settings: new Notifications panel (master + per-kind switches, test button with real OS-result feedback). Full i18n (en/ja/zh/zh-hant).	2026-06-14 00:31:03 -05:00
brooklyn!	b4ba3f5e3b	feat(desktop): add curated completion cue for agent turn completion (#42480 ) * feat(desktop): add curated completion sound bank for turn completion Replace the prior haptic-only completion cue with a curated Web Audio completion sound flow, defaulting to the minimal two-note comfort preset while keeping alternate presets available for quick iteration. Play the cue on every message completion event (including background sessions) so turn-end feedback is consistent across active and non-active chats. * refactor(desktop): drop done1 byte sample from completion bank Keep the curated Web Audio presets only; the embedded sample added bulk without shipping as the default cue. * feat(desktop): expand completion sounds and add Appearance picker Add fourteen synthesized turn-end presets with preview in settings, persisted variant selection, and softer default mixing for late-night use. Co-authored-by: Cursor <cursoragent@cursor.com> * refactor(desktop): dedupe completion-sound resolver, trim audio comments Make the store the single source of truth for the variant default + range validation and have the sound lib import it (one-way lib→store edge, no cycle), instead of two divergent copies. Extract the shared white-noise buffer used by the air/whoosh voices and cut the synth comments down to why-only notes. --------- Co-authored-by: Austin Pickett <pickett.austin@gmail.com> Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-14 00:21:40 -05:00
Teknium	8f278403d1	perf(execute-code): stop waiting on idle RPC accept (#45948 )	2026-06-13 21:57:15 -07:00
Teknium	1b16c48170	fix: guard OAuth account removal	2026-06-13 21:47:13 -07:00
Flownium	e986e3fc68	fix: add provider account removal	2026-06-13 21:47:13 -07:00
Justin Sunseri	12682d96b9	feat(telegram): restore rich messages opt-out Salvages PR #45840's client-compatibility opt-out while keeping rich messages enabled by default via telegram.extra.rich_messages: true.	2026-06-13 21:45:49 -07:00
aimable100	8d5d36d793	fix(dispatch): forward session_id into registry.dispatch (#28479 ) Both the regular and execute_code dispatch paths forward task_id into registry.dispatch via middleware _dispatch lambdas but silently dropped session_id. Dispatch-layer hooks (e.g. set_enforcement_fn) that correlate calls with the active session received "" for every invocation. Pass session_id=session_id at both _dispatch call sites inside handle_function_call, matching the existing task_id pattern. Hooks already received session_id; this closes the registry.dispatch gap. Rebased onto current main where dispatch is wrapped by run_tool_execution_middleware — the old direct-dispatch sites from #28479 no longer exist. test(dispatch): add tests for session_id forwarding (NousResearch#28479) Covers standard and execute_code paths through the middleware wrapper. Verifies task_id forwarding is not broken by the change.	2026-06-14 00:27:59 -04:00
Teknium	7aaae7acd0	fix(ssl): align guard docs and escape hatch	2026-06-13 21:14:32 -07:00
Teknium	73d1357747	style(agent): keep run_agent import order stable	2026-06-13 21:14:32 -07:00
Teknium	af1995a838	chore(release): map chromalinx noreply author	2026-06-13 21:14:32 -07:00
Teknium	dc90ca4e17	fix(ssl): run CA guard during agent initialization	2026-06-13 21:14:32 -07:00
Teknium	af5b526472	fix(ssl): validate CA bundle paths before provider calls	2026-06-13 21:14:32 -07:00
chromalinx	b42c5bf652	test(ssl_guard): fix macOS fallback test that passed for the wrong reason The previous test patched ssl.create_default_context globally with a bare SSLContext that has zero CA certs. Both verify_ca_bundle() and the macOS fallback got the same mocked context, so the test verified nothing useful: both paths produced empty get_ca_certs() and the assertion that no exception escaped was vacuously satisfied. Only mock the fallback call (no cafile) — let the certifi call hit the real SSL stack and fail with SSLError on the broken PEM. The mock fallback returns a context with load_default_certs() so the test now verifies the real scenario: broken certifi → SSLConfigurationError, macOS system trust store → success. Also pads the broken PEM past the 1 KB size guard so the size check doesn't short-circuit before ssl.create_default_context(cafile=...) runs. Reported by @liuhao1024 in PR review.	2026-06-13 21:14:32 -07:00
chromalinx	a218a0f156	fix(agent,gateway,doctor): add SSL CA cert bundle fail-fast guard A stale certifi CA bundle after a partial `hermes update` used to crash the agent on the first outbound HTTPS call with a raw traceback and trap the gateway in a retry loop. This patch: * Adds `agent/errors.py` with a typed `SSLConfigurationError` * Adds `agent/ssl_guard.py` with a `verify_ca_bundle()` pre-flight that asserts the bundle exists, is non-trivial in size, and can build a working SSLContext. On macOS, it falls back to the system trust store when the bundle is empty but the system store is healthy (covers corporate proxies / MDM setups). * Wires the guard into `run_agent.py` and `gateway/run.py` right after the `hermes_bootstrap` import, inside a try/except so a bug in the guard itself can never prevent startup. * Adds a `SSL / CA Certificates` section to `hermes_cli doctor` so users can detect the failure with one command. * Adds unit tests covering the healthy, missing, empty, skip-env, and macOS-fallback paths. * Adds an RCA document describing the failure mode and the recovery path (`pip install -e .`). When the bundle is broken the user sees: \u26a0\ufe0f SSL certificate bundle issue detected. Run: pip install -e . `HERMES_SKIP_SSL_GUARD=1` disables the check for sandboxed environments that ship their own trust store.	2026-06-13 21:14:32 -07:00
Teknium	1106879147	perf(process): wake waiters on background completion (#45831 )	2026-06-13 21:11:19 -07:00
brooklyn!	6b76284c77	fix(desktop): surface off-screen approvals via the jump-to-bottom control (#45853 ) * fix(desktop): jump-to-approval pill for off-screen approvals A blocked approval's only response surface is the inline Run/Reject bar on the pending tool row. When that row is scrolled out of view the session looks stalled with no visible action. Surface a composer-anchored "Approval needed" pill only when an approval is pending AND its inline bar is scrolled away; clicking scrolls the bar back into view. Preserves the deliberate inline (not modal) approval design — the pill never duplicates the approve/reject controls. The inline bar mirrors its own viewport visibility via IntersectionObserver (tracks scroll/resize/layout) and registers a scroll-into-view handler the pill fires, mirroring the existing thread-scroll jump-button bridge. Supersedes #45828. * fix(desktop): morph jump-to-bottom into approval prompt; drop scroll bridge Collapse the separate "jump to approval" pill into the existing scroll-to-bottom control: when scrolled away from the bottom while an approval is pending, it relabels to "Approval needed". A parked approval's inline Run/Reject bar is always the bottom-most content, so the existing scroll-to-bottom action lands the user right on it — one control, no collision. This also fixes the layout corruption from the first cut: the pill called native el.scrollIntoView(), which scrolls every scrollable ancestor including the overflow:hidden chat shell containers. Those have no scrollbar to scroll back and don't remount on session switch, so the composer stayed shoved and the breakage persisted across sessions. Reusing requestScrollToBottom() (the use-stick-to-bottom path) only touches the one designated scroll container. Removes the now-unused approval-scroll store + IntersectionObserver wiring.	2026-06-13 23:07:22 +00:00
Teknium	4026f526d5	chore(release): map MaxFreedomPollard author email	2026-06-13 15:01:42 -07:00
Max Pollard	9a2b976326	test(skills): add regression tests for bundled-update backup recovery Three tests covering: a stale .bak poisoning a failed update's move/restore, an orphaned .bak misread as a user deletion, and a partially written dest blocking restore-on-failure. All three fail on current main without the fix. Refs #44942	2026-06-13 15:01:42 -07:00
Max Pollard	3581131e7d	fix(skills): make bundled-update backup handling crash-safe and idempotent Recover an orphaned .bak before classification (interrupted updates no longer read as user deletions), clear a stale .bak before shutil.move (replace, not nest), and clear a partial dest before restore so restore-on-failure actually runs. Fixes #44942	2026-06-13 15:01:42 -07:00
Teknium	bf8effad02	fix(utils): copy fallback for atomic replace across devices (#43852 ) Fallback from `os.replace` on EXDEV/EBUSY using copy+fsync+unlink while preserving symlink target semantics and metadata.	2026-06-13 14:50:05 -07:00
Teknium	817f392311	feat(read): extract notebook and office documents (#37082 ) Add stdlib-only extraction for `.ipynb`, `.docx`, and `.xlsx` in read_file with lazy integration and malformed-document fallback.	2026-06-13 14:42:51 -07:00
Teknium	2b67e96aec	fix(approval): gate in-place edits to sensitive user files Cover sed, perl, and ruby in-place mutations against shell rc, SSH, and credential files so terminal approvals pair the redirection and copy guards.	2026-06-13 14:35:27 -07:00
helix4u	abd69b8117	fix(approval): detect absolute home shell rc writes	2026-06-13 14:35:27 -07:00
briandevans	da28d5d113	fix(security): gate cp/mv/install into ~/.ssh, credential, and shell-rc files tools/approval.py already denies tee/redirection writes to every _SENSITIVE_WRITE_TARGET (~/.ssh/*, ~/.netrc/.pgpass/.npmrc/.pypirc, shell rc files, ~/.hermes/config.yaml/.env) via the DANGEROUS_PATTERNS tee/`>` rules, but cp/mv/install were only paired for _SYSTEM_CONFIG_PATH (/etc) and the project-relative env/config target. So `cp evil ~/.ssh/authorized_keys` (SSH-key implant / persistence), `cp creds ~/.netrc`, and `cp evil ~/.bashrc` (login-time command injection) auto-approved while the equivalent tee/`>` forms were denied — an unpaired write deny is theater (same rationale as #14639 / commit `4e9d886d`, which paired the terminal side for ~/.hermes/config.yaml writes but did not touch these cp/mv/install verbs on the broader sensitive set). Add one (cp\|mv\|install) DANGEROUS_PATTERNS entry reusing the existing _SENSITIVE_WRITE_TARGET fragment, anchored via _COMMAND_TAIL so it fires on the destination (last arg) only: reading OUT of a sensitive path (`cp ~/.ssh/config /tmp/x`) stays auto-approved. Description differs from the system-config cp entry so the two keep distinct approval keys (no silent cross-approval). Additive — does not subsume the /etc or project-config rules. Adds TestSensitiveCopyMovePattern: 5 positive cases (ssh authorized_keys, ssh private key via mv, netrc via install, bashrc, ~/.hermes/config.yaml) + 2 negative guards (copy FROM ssh, unrelated copy). The ssh/netrc/bashrc positives fail on main and pass on this branch; the negatives stay green both ways.	2026-06-13 14:35:27 -07:00
Teknium	1fa761f8de	fix(search): keep partial results on search timeout (#36142 ) Treat search command budget timeouts as soft truncation so partial results survive, while real search failures still return structured errors.	2026-06-13 14:35:21 -07:00
Teknium	069bfd6545	fix(agent): keep Codex reasoning replay on Codex path	2026-06-13 14:35:00 -07:00
briandevans	1d584a301e	fix(agent): treat Codex reasoning items as thinking-only	2026-06-13 14:35:00 -07:00
ITheEqualizer	57c2a55be4	fix(telegram): harden rich message fallback handling Carry forward focused follow-ups from PR #45741: treat PTB's raw Bot API 10.1 response shapes safely, recognize real missing-endpoint errors, preserve link preview settings on rich sends, and lock the rich limit to Telegram's character-based cap.	2026-06-13 14:34:53 -07:00
brooklyn!	0a865e5948	fix(desktop): bypass Chromium editing pipeline for large paste & select-delete (#45812 ) Large paste and Ctrl+A → Delete froze the composer for seconds — both routed through Chromium's contenteditable editing pipeline (~O(n²) on multiline DOM). - insertPlainTextAtCaret: Range + text/<br> fragment (paste path) - deleteSelectionInEditor: range.deleteContents for non-collapsed Backspace/Delete - Shared composerSelectionRange helper; both flush via flushEditorToDraft Profiled live (47 KB / 122 paragraphs): paste 4474 ms → 13 ms; select-delete 1304 ms → 4 ms. Collapsed-caret deletes still native.	2026-06-13 20:49:58 +00:00
Teknium	c8e5f34f24	fix(gemini): strip native self prefixes before generateContent (#36141 ) Strip `google/` and `gemini/` self-prefixes before native Gemini generateContent calls, and keep provider-normalization expectations aligned.	2026-06-13 13:47:08 -07:00
briandevans	7d11fa4e9e	fix(codex-responses): let final_answer complete top-level incomplete responses	2026-06-13 13:45:29 -07:00
ITheEqualizer	7c0605bf22	fix(telegram): preserve rich formatting on stream final	2026-06-13 13:44:45 -07:00
achaljhawar	819def44c7	fix(agent): scope Nous tags to Nous auxiliary calls	2026-06-13 13:24:40 -07:00
Teknium	08890d77e6	fix(plugins): normalize browser-pasted GitHub repo URLs (#33539 ) Accept common GitHub web URLs in `hermes plugins install` by normalizing repository views back to cloneable `.git` URLs, with focused parser coverage.	2026-06-13 13:23:59 -07:00
brooklyn!	425e777f54	fix(desktop): polish slash command completion (space/tab/click + typed args) (#45760 ) * fix(desktop): accept slash command on space at command stage Pressing space on a no-arg slash command (e.g. /hermes-agent) fell through to the arg-completion stage and dead-ended on "No matches" instead of inserting the directive. Space now mirrors Tab/Enter while the command name is still being typed: no-arg commands commit the chip, arg-taking commands expand to their options step. * fix(desktop): suppress arg popover for no-arg slash commands Committing a no-arg command (`/hermes-agent `) re-detected the chip+space as an arg query and re-opened the popover on "No matches". The arg-stage menu now only opens when the command actually takes args. * fix(desktop): polish slash arg completion (space/tab/click + typed args) Unify Enter/Tab/Space accept of the highlighted item at both the command and arg stages: no-arg commands commit a chip, arg commands expand to options, and an arg option commits the full `/cmd arg` chip. A fully-typed arg (which the backend completer drops from suggestions) now commits on Space/Tab via the verbatim text instead of dead-ending, and the "No matches" empty state is suppressed past a command's name. Space stays slash-only so @ mentions keep a literal space.	2026-06-13 18:43:52 +00:00
kshitij	7be22e37e1	Merge pull request #45753 from kshitijk4poor/salvage/gateway-auto-resume-duplicate-agent fix(gateway): claim session slot before auto-resume task to prevent duplicate agents (#45456)	2026-06-13 23:46:17 +05:30

1 2 3 4 5 ...

11652 commits