hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-26 17:38:36 +00:00

Author	SHA1	Message	Date
xxxigm	59ea2f98e6	fix(desktop): always show the Manage-profiles overflow (#42871 ) The "..." overflow that opens the profile manager (the only UI to edit a profile's SOUL.md) was gated behind profiles.length > 1, so a user with only the default profile couldn't edit its persona without first creating a throwaway second profile. Render it unconditionally.	2026-06-09 19:32:25 -05:00
brooklyn!	aecdacb11b	Merge pull request #43109 from NousResearch/fix/desktop-remote-attach-drops fix(desktop): stage dropped files into the remote session workspace	2026-06-09 19:22:11 -05:00
Brooklyn Nicholson	7ffc216bc0	fix(agent): make a binary @file: reference actionable instead of a dead end A binary @file: ref (PDF, docx, spreadsheet, …) expanded to a bare "binary files are not supported" warning with no content. The model saw a failure and gave up — e.g. a dropped PDF came back as a text note claiming the type was unsupported, even though the file was staged on disk right next to it. Inject an actionable content block instead: the path, mime type, size, and a nudge to use its tools to read/convert/view the file (and explicitly not to tell the user the type is unsupported). General across every binary type — not PDF-specific. The file already resolves where the agent's tools run (local cwd or the staged copy in a remote session workspace), so it can act on it directly.	2026-06-09 19:16:46 -05:00
brooklyn!	218452b050	fix(state.db): recover from malformed sqlite_master so hidden sessions reappear (#43149 ) * fix(state.db): recover from malformed sqlite_master so hidden sessions reappear The corruption class behind "Desktop/Dashboard show no sessions while hundreds of session files sit on disk" is a malformed sqlite_master — most often a duplicate object row, e.g. two CREATE VIRTUAL TABLE messages_fts entries — surfacing as: sqlite3.DatabaseError: malformed database schema (messages_fts) - table messages_fts already exists SQLite parses the whole schema while preparing the FIRST statement on a connection, so on this class every statement fails before it runs: PRAGMA journal_mode (which is where SessionDB.__init__ actually trips, in apply_wal_with_fallback, BEFORE _init_schema), PRAGMA integrity_check, and even DROP TABLE. The only operations that still work are PRAGMA writable_schema=ON plus direct sqlite_master surgery. A plain FTS-index rebuild at the _init_schema layer therefore cannot reach or fix this; the canonical sessions/messages rows are intact — only the derived schema is broken. Add a dedicated recovery that operates where the failure actually happens: - hermes_state.repair_state_db_schema(): backs up the raw file first, then a least-destructive ladder — (1) de-duplicate sqlite_master keeping the lowest rowid per object (preserves the existing FTS index), escalating to (2) drop every messages_fts* schema object + VACUUM and let the next open rebuild the FTS index from messages. sessions/messages are never modified. Plus is_malformed_db_error() to discriminate this class. - SessionDB.__init__ auto-heals: on a malformed-schema open error it repairs once (process-guarded against loops / concurrent web_server opens) and reopens, so Desktop/Dashboard recover on their own instead of silently showing "no sessions". - hermes doctor --fix detects the malformed class and repairs it (reporting the recovered session count + backup name). - hermes sessions repair [--check-only] [--no-backup] runs on the raw file path, since SessionDB() itself cannot open a malformed DB. Supersedes #32589 and #33869: both targeted FTS corruption but gated their repair behind statements (integrity_check / SELECT / DROP TABLE) that themselves fail on this class, and neither addressed the apply_wal_with_fallback open-time failure. Credit preserved via Co-authored-by. Closes #33865. Co-authored-by: João Vitor Cunha <145560011+plcunha@users.noreply.github.com> Co-authored-by: Tuna Dev <273476039+tuancookiez-hub@users.noreply.github.com> * test(state.db): cover strat-B escalation + unrepairable safe-fail paths --------- Co-authored-by: João Vitor Cunha <145560011+plcunha@users.noreply.github.com> Co-authored-by: Tuna Dev <273476039+tuancookiez-hub@users.noreply.github.com>	2026-06-09 18:49:08 -05:00
Brooklyn Nicholson	29147afd63	fix(desktop): friendlier toast when a remote attachment exceeds the 16MB cap Remote attachments read their bytes through the readFileDataUrl IPC, which is hard-capped at 16MB and rejects with a raw "file is too large (N bytes; limit M bytes)" string straight into the failure toast (helix4u review note on #43109). Translate that into "<file> is too large to upload to the remote gateway (max 16 MB)", parsing the limit out of the message so it tracks the real cap. Applies to both the image and non-image remote read paths; non-cap errors pass through unchanged. Adds unit coverage for both.	2026-06-09 18:31:09 -05:00
Brooklyn Nicholson	b021497bc8	fix(desktop): show a staging spinner in the edit composer while OS drops upload The message-edit composer staged dropped OS files asynchronously with no visible state, so confirming the edit before the upload resolved could send the message without the gateway-side ref (helix4u review note on #43109). Add a staging flag: while uploadOsDropRefs is in flight, show a small spinner pill in the bubble and block submit (disabled send button + submitEdit guard) so the edit can't outrace the ref insertion. New `attachingFile` i18n string across en/zh/zh-hant/ja.	2026-06-09 18:26:54 -05:00
Brooklyn Nicholson	891c9a6823	fix(desktop): close eager-upload races flagged in review Two races in the drop-time eager upload: - Resurrected chip: the success path used addComposerAttachment, which re-appends when the id is gone, so a file removed mid-upload reappeared once the upload resolved. Add updateComposerAttachment (update-only; no-op when the chip was removed) and use it on both the eager success path and submit-time sync. - Duplicate upload: submit-time sync didn't join an eager upload still in flight, so drop-then-Enter could fire file.attach twice and leave a duplicate under .hermes/desktop-attachments/. Track in-flight eager uploads by id and await the pending one before deciding to re-upload, reusing its gateway ref. Tests: composer-store no-resurrect unit tests + a join-on-submit integration test asserting a single file.attach. Addresses @helix4u review on #43109.	2026-06-09 18:21:10 -05:00
kshitijk4poor	72154ad879	perf(ci): cache uv + use uv sync in tests workflow Both jobs in tests.yml (`test` matrix and `e2e`) start from a cold uv cache on every run and install deps with `uv pip install -e ".[all,dev]"`, which re-resolves pyproject.toml ranges and rebuilds the editable install each time. Two changes: 1. Enable uv's official CI caching via setup-uv's `enable-cache: true`, keyed on pyproject.toml + uv.lock, plus `uv cache prune --ci` to keep the persisted cache small. Warm runs install from cache instead of re-downloading/building wheels. 2. Replace the manual `uv venv` + `uv pip install -e` with `uv sync --locked --python 3.11 --extra all --extra dev`. sync installs the exact pinned set from uv.lock (and fails if the lock is stale vs pyproject.toml), creating .venv itself. This is reproducible and, with a warm cache, measurably faster than the editable pip install (~3-4x on the steady-state install step locally). Downstream steps keep using `source .venv/bin/activate`; sync writes .venv to the same path. Follows the Astral-recommended pattern for uv in GitHub Actions: https://docs.astral.sh/uv/guides/integration/github/ Co-authored-by: Wesley Simplicio <wesleysimplicio@live.com>	2026-06-09 18:30:44 -04:00
Brooklyn Nicholson	153060e206	fix(desktop): render optimistic image thumbnails from in-hand base64 The in-flight user bubble seeded image attachment refs as `@image:<localpath>`. In remote-gateway mode that path lives on the desktop, not the gateway, so the inline thumbnail fetch hit /api/media and 403'd ("Path outside media roots"), flashing a fallback chip until submit uploaded the bytes. Seed (and keep) image refs as the raw base64 preview data URL instead. It renders inline via extractEmbeddedImages with zero network, and survives the post-sync rewrite (the agent gets the bytes through the attached-image pipeline, not this display ref) so the thumbnail no longer remounts/flashes. Non-image refs are unchanged. Adds optimisticAttachmentRef + unit coverage.	2026-06-09 17:03:42 -05:00
Brooklyn Nicholson	4906dcfc25	fix(desktop): stage dropped files into the remote session workspace Finder/OS drops became `@file:/Users/...` refs that only resolve when the gateway shares the local disk, so on a remote gateway non-image files (PDF/CSV/Markdown/...) never reached the agent. Route OS drops through the file.attach / image.attach_bytes upload pipeline — in-app project-tree and gutter drags stay inline workspace-relative refs — across every drop surface: the conversation area, the composer form, the contenteditable input, and the message-edit composer (which still reproduced the bug). Also: - upload dropped files eagerly when a session exists, so the card shows a spinner instead of stalling the send (images stay submit-time to avoid racing their thumbnail write); - round the attachment card and drop the monospace detail; - render image previews from the bytes we already hold, so a pasted/dropped screenshot shows its thumbnail and previews even when its only on-disk copy is a transient path (the data URL is not persisted to localStorage). Supersedes #38615, #41203. Co-authored-by: LeonSGP <154585401+LeonSGP43@users.noreply.github.com> Co-authored-by: Teknium <127238744+teknium1@users.noreply.github.com>	2026-06-09 16:50:08 -05:00
Teknium	57c6714995	fix(models): keep curated Anthropic aliases in /model picker (#43103 ) The Anthropic picker returned the live /v1/models dump verbatim whenever credentials were configured. Anthropic's API lags newly-routed curated aliases (e.g. claude-fable-5, reachable on Anthropic before the models endpoint enumerates it), so the curated entry vanished from the picker. Merge curated _PROVIDER_MODELS["anthropic"] with the live catalog — curated first, live-only appended, deduped — mirroring the OpenAI curated-merge path. Live failure / no creds falls back to curated verbatim.	2026-06-09 14:45:19 -07:00
ethernet	a5d05cf30e	fix(nix); don't run .#fix-lockfiles its so slow	2026-06-09 16:55:33 -04:00
ethernet	68a997fed4	add website links to readme for seo	2026-06-09 16:35:34 -04:00
Jeffrey Quesnelle	49dd776d8b	Merge pull request #43041 from NousResearch/fix/fable-anthropic add Fable 5 to model list for Anthropic provider	2026-06-09 15:38:51 -04:00
emozilla	d7886da08c	add Fable 5 to model list for Anthropic provider	2026-06-09 15:33:42 -04:00
xxxigm	02f878ec5a	docs(windows): correct native data dir to %LOCALAPPDATA%\hermes (#42856 ) * docs(windows): correct native data dir to %LOCALAPPDATA%\hermes The Windows-native guide claimed a deliberate split where config, auth, skills, and sessions live under %USERPROFILE%\.hermes. That is not what the installer does: scripts/install.ps1 sets HERMES_HOME=%LOCALAPPDATA%\hermes, so data actually lives in %LOCALAPPDATA%\hermes alongside the disposable install (the hermes-agent\, git\, node\, bin\ subdirectories) — `hermes config` confirms config.yaml/.env resolve there, not under %USERPROFILE%. Update the data-layout table, the "split is deliberate" note, the env-var and uninstall sections to describe the real layout: data and install share the %LOCALAPPDATA%\hermes root, reinstall only replaces hermes-agent\, and a full wipe targets %LOCALAPPDATA%\hermes (with %USERPROFILE%\.hermes kept only as a legacy/WSL cleanup). Mention HERMES_HOME as the override knob. * docs(windows): fix PATH + bin layout to match installer The installer adds hermes-agent\venv\Scripts (where hermes.exe lives) to User PATH and sets HERMES_HOME — not %LOCALAPPDATA%\hermes\bin. The \bin dir holds Hermes's managed uv.exe, not a hermes.cmd shim. Correct the install-step list and the data-layout table accordingly. * fix(install): show real HERMES_HOME path in setup messages The native Windows installer wrote config/env/skills under $HermesHome (%LOCALAPPDATA%\hermes) but its success messages claimed ~/.hermes, which doesn't exist on native Windows. Print the actual paths so a new user can find their config, .env, and skills.	2026-06-09 14:11:20 -05:00
brooklyn!	8d71c38919	fix(desktop): rebind sessions after websocket reconnect (salvage of #41740 ) (#43004 ) * fix(desktop): rebind sessions after websocket reconnect * docs(desktop): explain the reconnect-resume guard in use-route-resume The reconnect fix turns on two subtle conditions with no inline rationale: `seenGatewayStateRef` suppresses a spurious "became open" on the first effect run (so a session mounting with the gateway already open doesn't double-resume), and the `gatewayBecameOpen \|\|` arm forces a re-resume even when the route looks `alreadyActive` because the cached runtime id can be stale after the gateway rebinds/reaps the session. Comment both so the next reader doesn't "simplify" them back into the original bug. No behavior change. --------- Co-authored-by: Josh Dow <josh.dow@prepad.io>	2026-06-09 19:01:00 +00:00
Siddharth Balyan	46fedef07f	fix(openrouter): never send reasoning field for adaptive Anthropic models (#43012 ) The previous fix (#42991) only omitted reasoning when it was being disabled. But reasoning-mandatory Anthropic models (Claude 4.6+, fable) 400 with thinking.type.disabled on EVERY tool-continuation turn even when reasoning is enabled: chat_completions never replays signed thinking blocks, so the prior assistant tool_call has no thinking, and OpenRouter resolves "reasoning requested but history has none" by emitting thinking.type.disabled — which these models reject. Result: first turn works, every turn after the first tool call dies (HTTP 400, non-retryable). OpenRouter ignores reasoning.effort for adaptive Anthropic models anyway (the model self-decides), so the reasoning field is pointless for them on every turn and harmful on tool-replay turns. Omit it entirely → adaptive default. - openrouter profile: drop the reasoning field for reasoning-mandatory Anthropic models regardless of enabled/disabled; legacy Anthropic + non-Anthropic models unchanged. - tests: assert omission across enabled/disabled/effort variants; parity tests switched to a non-Anthropic reasoning model (deepseek) since Anthropic 4.6+ no longer carries a reasoning field. Verified live end-to-end: a tool-replay turn on anthropic/claude-fable-5 with reasoning enabled now builds extra_body=None and returns HTTP 200 (was 400).	2026-06-10 00:18:23 +05:30
brooklyn!	ba44de06da	fix(install): self-heal a stuck Electron download (salvage of #42894 ) (#42998 ) * fix(install): self-heal a stuck Electron download on the desktop build The desktop build downloads Electron (~114MB) from GitHub. A corrupt cached zip, or a blocked/throttled GitHub release host (the repeating "retrying" log), hard-failed the install — and install.sh had no recovery at all while install.ps1 / `hermes desktop` only purged the cache. All three build paths now escalate on a failed `npm run pack`: GitHub → purge corrupt electron-.zip + stale -unpacked and retry → one retry via a public Electron mirror (npmmirror.com). @electron/get SHASUM-verifies the download, and a user-pinned ELECTRON_MIRROR is always respected (never overridden). Adds a bash clear_electron_build_cache()/_desktop_pack() to mirror the existing PowerShell/Python helpers. * test(install): cover the Electron mirror fallback Verify `hermes desktop` falls back to a mirror when the cache purge finds nothing, and that a user-pinned ELECTRON_MIRROR is respected (no extra attempt, not overridden). * docs(desktop): troubleshoot a stuck Electron download Document the automatic cache-purge + mirror fallback, how to pin your own ELECTRON_MIRROR, and how to clear a corrupt cached zip by hand. * docs(install): correct the Electron mirror trust framing The mirror-fallback comments and the desktop troubleshooting doc implied `@electron/get`'s SHASUM check makes the npmmirror.com download safe against tampering. It doesn't: the SHASUMS256.txt is fetched from the same mirror, so the check guards against a corrupt/partial download, not a compromised mirror. Reframe all four surfaces (install.sh, install.ps1, `hermes desktop`, and the docs) to state the trust trade-off honestly — npmmirror.com is the de-facto Electron community mirror, we only fall back to it after the canonical GitHub download fails, and a user-pinned ELECTRON_MIRROR is never overridden. No behavior change. --------- Co-authored-by: xxxigm <tuancanhnguyen706@gmail.com>	2026-06-09 18:19:14 +00:00
Rod Boev	5750d058fa	fix(tests): use cross-platform pytest-timeout method (#39881 )	2026-06-09 14:17:59 -04:00
Siddharth Balyan	1febb08240	fix(anthropic): default new Claude models to the modern thinking contract (#42991 ) New Anthropic models without a recognized version substring (claude-fable-5 and future named/numbered releases) were classified as legacy and routed down the manual-thinking path, which made OpenRouter emit thinking.type.disabled — a form reasoning-mandatory Claude models reject with a non-retryable HTTP 400. Invert the brittle version-substring allowlists to default-to-modern (mirroring _get_anthropic_max_output): unknown Claude models get the adaptive/xhigh/ no-sampling contract, with an explicit legacy list for older families. Non-Claude Anthropic-Messages models (minimax, qwen3, …) keep the manual path. - anthropic_adapter: _supports_adaptive_thinking / _supports_xhigh_effort / _forbids_sampling_params now default unknown Claude models to modern; legacy families enumerated in _LEGACY_MANUAL_THINKING_CLAUDE_SUBSTRINGS. - openrouter profile: omit reasoning entirely (→ adaptive default) instead of forwarding {enabled:false} for reasoning-mandatory Anthropic models; legacy Anthropic + all non-Anthropic models still pass the disable form through. - model_metadata + output-limit table: register claude-fable-5 (1M ctx, 128K out). Tests assert the invariant ("unknown Claude model -> modern contract; legacy stays manual; non-Claude unaffected"), not specific model names.	2026-06-09 23:37:23 +05:30
Frowte3k	39b76d9013	fix(packaging): ship optional-mcps catalog in wheel and sdist (#39859 ) The shipped MCP catalog (optional-mcps/) wasn't packaged, so `hermes mcp catalog` and the dashboard catalog screen come up empty on pip/Homebrew/Nix installs even though the manifests exist in the repo. The runtime expects a packaged catalog (get_optional_mcps_dir() -> _get_packaged_data_dir("optional-mcps"); list_catalog() returns [] when it's absent). Ship it like locales: pyproject [tool.setuptools.data-files] for the wheel + a MANIFEST.in graft for the sdist. optional-mcps/ is nested (optional-mcps/<name>/manifest.yaml) and data-files flattens each glob into its target dir, so each catalog entry gets its own target to preserve the per-entry directory the catalog iterates over.	2026-06-09 14:03:20 -04:00
Austin Pickett	52f7e24a74	feat(tui): interactive Plugins Hub overlay for enable/disable The TUI had no way to toggle plugins — `/plugins` only printed a static list, and the classic `hermes plugins` picker is curses-based and can't run inside the Ink UI. Users had to drop to a separate shell and run `hermes plugins enable/disable`. Add a PluginsHub overlay modeled on the existing SkillsHub: - New gateway RPC `plugins.manage` (list + toggle) backed by the same disk-discovery + dashboard_set_agent_plugin_enabled primitives the CLI and dashboard already use, so all three surfaces agree on state. The toggle path also wires the plugin's toolset into platform_toolsets. - `/plugins` with no arg opens the hub; any subcommand still falls through to the text slash worker for CLI parity. - pluginsHub overlay state threaded through overlayStore / interfaces / useInputHandlers (Esc closes) / appOverlays (renders the FloatBox); preserved across turn teardown like other user-toggled overlays. - Hub UI: arrow/number select, Enter/Space toggles live, Tab switches user-only vs all (bundled) scope, shows ✓/✗/○ activation glyphs. plugins.manage added to _LONG_HANDLERS (disk + config I/O).	2026-06-09 10:50:13 -07:00
Austin Pickett	b8eede7bda	fix(cli): /plugins shows installed-but-not-enabled plugins The /plugins slash command read from the live PluginManager, which only knows about loaded plugins. A freshly-installed plugin that hadn't been enabled yet showed 'No plugins installed. Drop plugin directories into ~/.hermes/plugins/' — even though it was on disk and a valid plugin. Switch to the same disk-discovery path as 'hermes plugins list' (_discover_all_plugins + enabled/disabled sets + _plugin_status), so an installed plugin now appears with its activation state ([not enabled], enabled, or disabled) plus the exact enable command. Default the quick /plugins view to user-installed plugins and summarize bundled providers/platforms on one line (the full catalog stays behind 'hermes plugins list') so the output isn't drowned by 60+ bundled provider plugins.	2026-06-09 10:49:43 -07:00
Teknium	967c325da8	fix(models): read OpenRouter live context_length before hardcoded catch-all (#42986 ) OpenRouter-routed slugs that are absent from models.dev (e.g. a freshly shipped anthropic/claude-fable-5) fell through to the generic DEFAULT_CONTEXT_LENGTHS["claude"]=200K entry and under-reported their real 1M window. The step-6 OpenRouter live-metadata fallback was gated on `not effective_provider`, but an OpenRouter selection sets effective_provider="openrouter" (inferred from the base URL), so that branch was dead code for every OR model. Add a dedicated step-5 OpenRouter branch that consults the live /models catalog (authoritative, refreshes as new slugs ship) before models.dev and the hardcoded family defaults — mirroring the existing Nous/Copilot/GMI branches. Keeps the Kimi-family 32k underreport guard. Per-model values are respected (claude-haiku-4.5 stays 200K), so it does not blanket-bump to 1M. Regression tests cover the fable-5 case, the genuinely-200k case, and the Kimi guard.	2026-06-09 10:49:32 -07:00
Teknium	f6f573ebaa	feat(plugins): install from a subdirectory within a repo (#42963 ) Support installing a plugin that lives in a subdirectory of a larger repo (docs/tests at root, plugin in a subdir) without forcing a dedicated single-plugin repo. Identifier syntax: owner/repo/path/to/plugin (shorthand + subpath) <url>.git/path/to/plugin (.git boundary on GitHub-style URLs) <url>#path/to/plugin (explicit fragment, any scheme) _resolve_git_url now returns (git_url, subdir); _install_plugin_core reads the manifest from and moves only the subdir, so root-level docs and tests no longer leak into ~/.hermes/plugins. _resolve_subdir_within guards against path traversal, missing dirs, and non-directories. Both the CLI (hermes plugins install) and the dashboard install endpoint inherit this for free since they share _install_plugin_core. Dashboard install hint + placeholder updated to advertise the subdir syntax. Co-authored-by: Austin Pickett <pickett.austin@gmail.com>	2026-06-09 13:42:51 -04:00
Teknium	ff9c110d5a	feat(models): add anthropic/claude-fable-5 to openrouter + nous curated lists (#42979 ) Adds the model above claude-opus-4.8 in both the OpenROUTER_MODELS and _PROVIDER_MODELS['nous'] curated picker lists used by /model and `hermes model`. Regenerated website/static/api/model-catalog.json to match.	2026-06-09 10:20:37 -07:00
brooklyn!	c4811c382f	fix(desktop): pad app icon to Apple grid so dock size matches peers (#42946 ) * fix(desktop): pad app icon to Apple grid so dock size matches peers The icon body filled ~92% of the canvas; macOS adds no padding, so it rendered larger than other dock icons. Normalize to Apple's grid (~824px body on a 1024px canvas) and ship a reproducible generator. - regenerate icon.png/.icns/.ico with ~80% body + transparent margins - keep original art as icon-source.png (master) - add scripts/gen-app-icon.cjs + `npm run icons` (idempotent) * chore(desktop): drop one-shot icon generator, ship only the assets The regenerated icon.png/.icns/.ico are the deliverable; the padding rationale lives in the PR. No build infra needed for a one-off. * fix(desktop): pad apple-touch-icon — the actual runtime dock icon app.dock.setIcon() overrides the bundle .icns at runtime with public/apple-touch-icon.png, so the dock icon users see while the app runs came from that (1254px canvas, ~91% full-bleed body). Normalize it to the same Apple grid (824px body on 1024px canvas). Also covers the web favicon + onboarding logo that reference the same file.	2026-06-09 11:48:26 -05:00
Gille	c6dc2fcd21	fix(desktop): release profile backends before delete (#42613 )	2026-06-09 10:52:02 -05:00
liuhao1024	f6416f50fc	fix(deps): bump urllib3 and PyJWT to clear CVEs (#40179 ) * fix(deps): bump urllib3 and PyJWT to clear CVEs urllib3 2.6.3 → 2.7.0: fixes GHSA-mf9v-mfxr-j63j (decompression-bomb bypass in streaming API) and GHSA-qccp-gfcp-xxvc (sensitive headers forwarded across origins in proxied redirects). PyJWT 2.12.1 → 2.13.0: fixes PYSEC-2026-175/177/178/179. Note: python-multipart and idna are already at patched versions in uv.lock (0.0.27 and 3.15 respectively). Fixes #40176 * fix(deps): add upper bound for urllib3 dependency spec Add '<3' ceiling to urllib3 specifier to satisfy the PyPI dependency upper bounds CI check. Per CONTRIBUTING.md policy, all PyPI deps must use '>=floor,<next_major' pinning.	2026-06-09 11:19:05 -04:00
Philip D'Souza	92dfd70d6a	fix(photon): production hardening for the gRPC-native iMessage channel (#42732 ) * fix(photon): override transitive CVEs in the sidecar deps `npm audit` flagged 7 high-severity transitive CVEs (protobufjs code injection GHSA-66ff-xgx4-vchm + outdated @opentelemetry OTLP exporters) pulled in via spectrum-ts -> @photon-ai/otel. npm's suggested fix downgrades spectrum-ts to a version that targets the decommissioned spectrum host, so instead pin patched versions via `overrides` (protobufjs 8.6.1, @opentelemetry/* 0.218.0) without touching spectrum-ts. `npm audit` -> 0; spectrum-ts + provider still import. * fix(photon): harden the sidecar bridge + bound the dedup cache - constant-time sidecar control-token comparison (was `!==`, timing-attackable). - cap the control-channel request body (2 MiB) so a compromised local peer can't OOM the sidecar. - wrap the inbound gRPC stream consumer in a re-subscribe loop with capped exponential backoff + jitter — if the async iterator throws/ends it would otherwise stop inbound forever (the adapter dedupes any replay). - add an unhandledRejection handler so a stray rejection logs instead of killing the process. - dedup cache (adapter) was a true bounded LRU only for expired entries; a burst of unique ids within the window grew it without limit. Evict oldest at the cap. * chore: add AUTHOR_MAP entry for PhilipAD --------- Co-authored-by: PhilipAD <philipadsouza@gmail.com>	2026-06-09 11:12:58 -04:00
Brian D. Evans	b5421f4ba6	fix(deps): declare packaging as a core dependency so it ships everywhere (#40522 ) * fix(deps): declare packaging as a core dependency so it ships everywhere packaging is imported directly on three production paths but was never declared in [project.dependencies], so it only reached users transitively (pip/uv pull it for other tools). The slim official Docker image ships without it, where each try/except-ImportError fallback silently degrades: - plugins/memory/hindsight/__init__.py (_meets_minimum_version) returns False when packaging is absent, disabling update_mode='append' so every session leaks separate Hindsight documents (the reported #40503 symptom). - tools/lazy_deps.py (_is_satisfied) falls back to "installed counts as satisfied", defeating every version-constraint check on lazy extras. - hermes_cli/main.py drops to naive name==version requirement parsing. Promote it to a declared core dep pinned to packaging==26.0 — the exact version already resolved in uv.lock, so there is zero resolution churn (the lock change is two edge annotations marking it transitive->direct). It is a pure-Python py3-none-any wheel with no compiled extensions, safe to ship on every platform. Declaring it also wires it into the _verify_core_dependencies_installed() update-repair guard, which reinstalls missing [project.dependencies] on hermes update. Adds a hermetic tomllib-parse regression test that fails before the declaration and passes after. Fixes #40503 * test(deps): make packaging dep-name extraction PEP 508-robust Address Copilot review on #40522: the inline name-extraction only handled ==, >=, [ and ; and could mis-parse valid requirement strings using <=, ~=, !=, <, > or a direct reference (name @ url). Factor a _distribution_name helper that drops markers, direct-reference URLs and extras, then strips any version operator via regex, so a future dep declared with any PEP 508 specifier shape is matched correctly. --------- Co-authored-by: briandevans <252620095+briandevans@users.noreply.github.com>	2026-06-09 11:11:48 -04:00
brooklyn!	d046169646	fix(desktop): local-only recents, per-platform sidebar sections, and Ctrl+N regressions (#42537 ) * fix(desktop): keep chat recents focused and reset hotkey target Exclude messaging platform threads from chat recents pagination so Load More returns chat sessions, and clear stale quick-create profile state before Ctrl+N starts a new session. * fix(desktop): surface new sessions in sidebar + unstick new-chat Thinking Two renderer regressions in the desktop chat app: - Sidebar ordering: orderByIds/reconcileOrderIds appended ids missing from the persisted order to the BOTTOM. Callers pass recency-sorted lists (newest first), so a brand-new Ctrl+N session sank below the saved order and read as "my latest session never showed up". Prepend fresh ids so new activity surfaces at the top. - New-chat stuck on "Thinking": terminal/attention state transitions (turn finished, error, or agent now waiting on user) were RAF-batched. Electron throttles requestAnimationFrame to ~0 while the window is backgrounded, occluded, or unfocused, stranding the deferred flush. Flush critical transitions (!busy \|\| needsInput) synchronously; keep the busy heartbeat RAF-batched to avoid scroll churn. Does not touch the messaging-source exclusion in chat recents queries. * fix(desktop): stop excluding messaging platforms from chat recents The "keep chat recents focused" change excluded every messaging-platform source (telegram, discord, slack, …) from the recents query. That silently undid the messaging-source-folder feature already on main (`ede4f5a4a`): the sidebar builds those folders purely from the loaded recents page, so once the sources were filtered out the folders never rendered — telegram and friends vanished from the left sidebar. Only cron stays excluded (it has its own dedicated section). Messaging sessions belong in the sidebar and render with their platform folder/icon. Removes the now-unused MESSAGING_SESSION_SOURCE_IDS export. * fix(desktop): give each messaging platform its own self-managed sidebar section Recents are local-only again: cron and every messaging platform are excluded from the chat-recents query, so "Load more" pages through interactive local chats instead of interleaving gateway threads that bury them. Each messaging platform (telegram, discord, ...) is now fetched as its own slice (refreshMessagingSessions) and rendered as a self-managed sidebar section with its platform icon, count, and per-platform "load more" — no source-grouping magic inside recents. Handed-off sessions (live source becomes local after a handoff) keep their origin-platform badge on the row via handoff_platform, so a Telegram thread continued in the desktop still reads as Telegram. * fix(desktop): self-heal a stranded routed session in route-resume An intermittent create/stream race can leave selected/active session ids null while the route stays on /:sid — the transcript then sticks empty even though the turn completed and persisted (the "second Ctrl+N shows no response" symptom). The pathname didn't change, so route-resume's normal gate skipped and the view stayed stuck. Resume whenever the routed session isn't the loaded one, gated on freshDraftReady so the /:sid -> /new transition (which also momentarily nulls selected/active a render before the pathname flips) is NOT treated as stranded. selectedStoredSessionIdRef is set synchronously at resume entry, so this can't loop, and the resume cached fast-path restores the already-streamed messages without a refetch. * fix(desktop): bypass smooth reveal on primary markdown stream Render main assistant text through deferred markdown directly instead of the smooth-reveal wrapper. This isolates the wrapper to reasoning surfaces and avoids the intermittent blank-response regression after consecutive new-session flows.	2026-06-09 14:24:25 +00:00
xxxigm	57775e9e16	test(agent): cover char-based output-cap overflow parsing (#42741 ) Add TestParseCharBasedOutputCap for the LM Studio / llama.cpp phrasing (context in tokens, prompt in characters): the reported error resolves to the available output budget, the retried cap plus the estimated input stays inside the window, and a prompt larger than the window falls through to None so the prompt-too-long/compression path still owns that case.	2026-06-09 03:17:12 -07:00
xxxigm	3a74b75217	fix(agent): recover from char-based output-cap overflow (#42741 ) LM Studio / llama.cpp-style servers report the context window in tokens but the prompt size in characters, e.g. "maximum context length is 65536 tokens. However, you requested 65536 output tokens and your prompt contains 77409 characters". When a provider profile's default_max_tokens equals the model's context window, the very first request asks for the whole window as output and the server returns a hard HTTP 400 — even on a trivial "hi". parse_available_output_tokens_from_error did not recognise this phrasing, so the overflow was misrouted to the prompt-too-long/compression path (which can't help when the input already fits) instead of the output-cap reduction + retry path. Detect the "requested N output tokens" form, estimate the input from the character count (~3 chars/token, conservative so the retried cap stays inside the window), and return the available output budget so the existing retry logic shrinks max_tokens and succeeds.	2026-06-09 03:17:12 -07:00
teknium1	24a934295f	test(yuanbao): add missing patch import to pipeline tests The salvaged refactor's new tests use unittest.mock.patch (25 call sites) but the import line only brought in AsyncMock and MagicMock, so 10 of the new tests failed with NameError. Add patch to the import.	2026-06-09 03:17:00 -07:00
loongzhao	ffcd9d7ac7	refactor(yuanbao): consolidate media resolution into dedicated pipeline middlewares	2026-06-09 03:17:00 -07:00
teknium1	be2f739e9a	test(desktop): cover sleep/wake session recovery in use-prompt-actions Adds three vitest cases for the recovery path: resume+retry on "session not found", no-resume passthrough on other errors, and no-resume when there is no stored session id. Also maps the contributor's commit email in release.py AUTHOR_MAP.	2026-06-09 03:16:59 -07:00
Brian Pasquini	72f522d464	fix(desktop): recover session after sleep/wake gateway restart When the laptop sleeps and wakes, the WebSocket reconnects but the gateway's in-memory session table is cleared. The desktop app still holds the old activeSessionId, so the next prompt.submit call returns error 4001 ('session not found'), surfaced to the user as: 'Prompt failed: session not found' Fix: wrap prompt.submit in a try/catch. On 'session not found', call session.resume with the durable SQLite session ID (selectedStoredSessionIdRef) to re-register the session in the gateway, update activeSessionIdRef to the fresh live session_id, then retry prompt.submit once. If recovery fails or the error is unrelated, the original error is re-thrown and surfaces normally.	2026-06-09 03:16:59 -07:00
JP Lew	cb4cc08b0a	fix(codex): record app-server token usage in session accounting	2026-06-09 02:46:04 -07:00
kshitij	85852b71d8	fix(nemo-relay): preserve downstream errors in adaptive execution (#42691 ) Based on #42658 by @mnajafian-nv. Preserves the real downstream provider/tool exception when NeMo Relay's managed adaptive execution wraps a failing callback as an internal runtime error. Without this, the original exception (and its retry-classification signal, e.g. status_code) is lost behind Relay's wrapper. Salvage changes on top of the original PR: - Tolerant Relay-wrapper match: _is_relay_wrapped_callback_error now uses str.startswith on the "internal error: <cls>: <msg>" prefix instead of exact equality, so a future Relay version appending a traceback/suffix doesn't silently defeat the unwrap. On a total format change it returns False and falls back to the pre-fix behavior (surfacing Relay's error) rather than masking it. - Deduplicated the LLM and tool execute paths into a shared _run_managed_with_downstream_preservation helper, removing ~20 lines of copy-pasted nonlocal/try-except scaffolding that could drift out of sync. - Added a real-middleware regression guard (test_nemo_relay_downstream_unwrap_matches_real_middleware_wrapper_shape) that drives hermes_cli.middleware._run_execution_chain and asserts the plugin's _original_downstream_error unwraps the actual private _DownstreamExecutionError wrapper. The original synthetic tests modeled the wrapper with a local class, so a rename or shape change in core middleware would not have been caught; this test fails loudly if that contract drifts. Co-authored-by: mnajafian-nv <mnajafian@nvidia.com>	2026-06-09 02:31:10 -07:00
Teknium	8d99b5bc4f	fix(gateway): cap terminal code-block preview in non-verbose mode (#42729 ) The markdown code-block change rendered args['command'] in full in both verbose AND non-verbose (all/new) modes, so a long or multi-line terminal command bypassed the tool_preview_length cap (default 40) and rendered as a huge block. Non-verbose now collapses to a single line capped at the preview length while keeping the fence; verbose keeps the full command.	2026-06-09 02:28:47 -07:00
kshitij	a38cc69bcc	fix(terminal): complete sane PATH entries on POSIX (salvage of #35614 ) (#42653 ) * fix(terminal): complete sane PATH entries on POSIX Fixes macOS gateway/launchd terminal sessions whose PATH already includes /usr/bin while omitting Apple Silicon Homebrew paths. LocalEnvironment._make_run_env() now appends each missing _SANE_PATH entry individually on POSIX, preserving caller precedence and avoiding duplicate sane entries. Root cause: the previous logic used /usr/bin as the sentinel for sane PATH injection. macOS launchd commonly provides /usr/bin while leaving out /opt/homebrew/bin and /opt/homebrew/sbin, so Homebrew-installed CLIs stayed unavailable in terminal tool calls. Salvaged from #35614 by @y0shua1ee. Fixes #35613. Co-authored-by: y0shua1ee <104712437+y0shua1ee@users.noreply.github.com> * test(terminal): harden sane PATH completion against dup/empty entries Follow-up to the #35613 fix. Strengthens _append_missing_sane_path_entries: - De-duplicate the caller-supplied PATH (first occurrence wins) so a PATH that already contains duplicate entries is collapsed rather than carried through. Previously only newly-appended sane entries were guarded against duplication; pre-existing caller duplicates were preserved verbatim. - Drop empty PATH entries (leading/trailing/double ':'), which POSIX shells interpret as the current working directory — a mild foot-gun in a default terminal environment. Behaviour for well-formed PATHs (no duplicates, no empty entries) is byte-identical to before; only malformed/duplicated inputs change. Adds regression tests for: the literal macOS launchd PATH (/usr/bin:/bin:/usr/sbin:/sbin), caller-duplicate collapsing with order preservation, and empty-entry stripping. * docs(terminal): clarify PATH normalisation semantics; drop dead set add Addresses review findings on the sane-PATH completion follow-up: - Sharpen the _append_missing_sane_path_entries docstring to state explicitly that on POSIX the caller PATH is rewritten (empty entries stripped, duplicates collapsed) rather than merely appended to, and that well-formed PATHs remain byte-identical bar the appended sane entries. This makes the intentional semantic change visible rather than buried under "hardening". - Document why _path_env_key is a deliberate second Windows guard distinct from the helper's early return (key-casing selection vs standalone safety), so neither is mistaken for redundant and removed. - Drop the dead `seen.add(entry)` in the sane-entry loop: _SANE_PATH is a static duplicate-free constant, so the membership check against the caller entries is sufficient and `seen` is never read afterwards. No behaviour change: verified byte-identical output across the launchd, minimal, empty, duplicate, empty-entry and already-full cases, and re-confirmed gh/brew resolve through the real LocalEnvironment.execute() path under a launchd-style PATH. 133 targeted tests pass. Intentionally NOT consolidating with tools/browser_tool._merge_browser_path: it prepends (vs append), filters on os.path.isdir, uses os.pathsep, and draws from a dynamic candidate set — a shared helper is a separate refactor, out of scope for this bugfix. --------- Co-authored-by: y0shua1ee <104712437+y0shua1ee@users.noreply.github.com>	2026-06-09 02:21:12 -07:00
kshitij	76f89d66de	fix(test): track TERMINAL_CONFIG_ENV_MAP after env-sync consolidation (#42695 ) `test_terminal_config_env_sync.py::_save_config_env_sync_keys()` AST-scanned `hermes_cli/config.py:set_config_value` for a `_config_to_env_sync = {...}` literal. The terminal-config env bridging was consolidated onto the canonical `TERMINAL_CONFIG_ENV_MAP` (now read via `terminal_config_env_var_for_key()`), so that literal no longer exists and the scanner raised: AssertionError: Could not find `_config_to_env_sync = {...}` literal in source failing 8 of 9 tests on main for every PR. Read the live `TERMINAL_CONFIG_ENV_MAP` instead — the actual source of truth `set_config_value` bridges through — mirroring its `terminal.cwd` exclusion. Refresh the stale module docstring and the now-incorrect error-message hints that still referenced `_config_to_env_sync`. Verified: the suite goes green, and a mutation (dropping `docker_volumes` from `TERMINAL_CONFIG_ENV_MAP`) still trips the pinned regression test, so the drift guard retains its teeth.	2026-06-09 02:11:46 -07:00
helix4u	f8adefdebf	fix(tui): apply terminal backend config before launch Some checks failed Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details uv.lock check / uv lock --check (push) Waiting to run Details Build Skills Index / build-index (push) Has been cancelled Details Build Skills Index / trigger-deploy (push) Has been cancelled Details	2026-06-09 00:31:27 -07:00
teknium1	dbbd1d4d05	feat(desktop+gateway): remote-gateway file attachments via file.attach @file: attachments now work when the desktop is connected to a remote gateway. Previously a referenced file resolved to a client-disk path the gateway couldn't see, so context_references rejected it with "path is outside the allowed workspace" and the agent never saw the file. Adds a file.attach RPC (sibling to the existing image.attach_bytes / pdf.attach byte-upload pipeline): the desktop uploads the file bytes, the gateway stages them into <workspace>/.hermes/desktop-attachments/ and returns a workspace-relative @file: ref that resolves cleanly. Local mode passes the path directly; a gateway-visible file outside the workspace is copied in; an in-workspace file is referenced as-is with no copy. Consolidates the file-sync design from #38615 (LeonSGP43) and the host-file-staging idea from #33455 (Carry00), rebased onto the image/PDF remote-media helpers already on main. Co-authored-by: LeonSGP43 <cine.dreamer.one@gmail.com>	2026-06-09 00:03:49 -07:00
Teknium	e687292eb4	feat(models): persist Nous recommended-models to disk; fall back on Portal failure (#42628 ) The Portal's /api/nous/recommended-models endpoint is the source of truth for which models are free/paid right now, but its result was cached in-process only. When the live fetch failed (network, parse, non-2xx), the function returned {} and the model picker silently dropped the free/paid recommendations — free models would vanish with no indication anything went wrong. Add a per-base disk cache at $HERMES_HOME/cache/nous_recommended_cache.json: a successful live fetch is persisted as last-known-good, and a failed fetch with an empty in-process cache falls back to the disk copy instead of {}. Self-heals on the next successful fetch. With no disk copy, still degrades to {} (callers already handle that). Keyed by portal base URL so staging/prod don't collide. E2E: live fetch writes disk; simulated Portal failure returns the cached free models from disk; no-disk + failure returns {}.	2026-06-09 00:03:43 -07:00
Teknium	c4066091ca	feat(models): add laguna-m.1 + nemotron-3-ultra to curated OpenRouter list (#42629 ) Two new free-tier slugs surfaced in /model and `hermes model`. owl-alpha was already present. Regenerated website/static/api/model-catalog.json to keep the manifest sync test green.	2026-06-08 23:05:35 -07:00
Teknium	50ad191a8b	test(hermes_cli): harden concurrent-gate fixture against partial-import race (#42626 ) The autouse _suppress_concurrent_hermes_gate fixture did monkeypatch.setattr(main, '_detect_concurrent_hermes_instances', ...) with no raising=False. Its try/except guards the import but not the setattr, so under pytest's per-test spawn isolation a transiently partial hermes_cli.main module (one a concurrent worker is mid-importing) made setattr raise AttributeError and errored unrelated tests in the slice. Add raising=False so a transiently-absent attribute is a no-op default rather than a hard error. The attribute always exists once main.py finishes importing; the real-function opt-out (@pytest.mark.real_concurrent_gate) is unaffected.	2026-06-08 22:54:25 -07:00
teknium1	520b59db16	fix(tui): use canonical get_fallback_chain for parity + map author Follow-up to the salvaged fallback-chain fix: - Replace the hand-rolled fallback loader with the shared hermes_cli.fallback_config.get_fallback_chain() helper so the TUI path matches HermesCLI and gateway/run.py exactly: fallback_providers stays first and keeps order, with distinct legacy fallback_model entries merged in after (deduped). Previously the TUI loader picked one key OR the other, diverging from CLI/gateway when both were set. - Update the test to assert the merged canonical semantics. - Add psionic73 to scripts/release.py AUTHOR_MAP (CI gate).	2026-06-08 22:53:42 -07:00

1 2 3 4 5 ...

11191 commits