hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-24 16:54:43 +00:00

Author	SHA1	Message	Date
ruangraung	f4531feee8	fix(telegram): improve MarkdownV2 edit fallback and fix _strip_mdv2 bold handling When edit_message(finalize=True) fails with a MarkdownV2 parse error, the silent fallback previously sent raw content with escape sequences. Now it logs the error and strips markdown formatting via _strip_mdv2() for clean plain-text fallback. Also fixes _strip_mdv2 to handle standard markdown bold (\\text\\) before MarkdownV2 bold (\text\), preventing half-stripped asterisks. Refs: #41955, #41732	2026-06-08 15:53:16 -07:00
ruangraung	6d2732e786	fix(gateway): apply MarkdownV2 formatting on progress message edits When a platform adapter sets REQUIRES_EDIT_FINALIZE=True (e.g. TelegramAdapter), tool progress edits now pass finalize=True so format_message() is applied before sending to the platform. Previously, the initial send() formatted the message correctly via MarkdownV2, but subsequent edit_message() calls skipped formatting (finalize=False), causing raw markdown (e.g. triple backticks for bash code blocks) to render as plain text on Telegram. Refs: #41955, #41732	2026-06-08 15:53:16 -07:00
teknium1	aa424e51ac	refactor(doctor): fold custom-provider vendor-slug check into one predicate Collapse the bare-"custom" allowlist entry and the custom:<name> guard into a single provider_accepts_vendor_slug predicate so the slug-warning suppression reads as one rule instead of two scattered conditions. No behavior change.	2026-06-08 15:53:09 -07:00
helix4u	732ababa1a	fix(doctor): allow vendor slugs for named custom providers	2026-06-08 15:53:09 -07:00
GodsBoy	421226e404	fix(gateway): stop terminal progress from posting the full command to messaging chats #41215 rendered a terminal tool call as a native ```bash fenced block on markdown platforms (Telegram, WhatsApp, Slack, and others), showing the full command with no truncation, in both all/new and verbose modes. That posted complete shell commands (heredocs, internal paths, destructive commands) into the chat before the final answer, visible to everyone in it. This restores the prior behavior: terminal progress shows the short, truncated preview line that every other tool already uses, capped at tool_preview_length. The supports_code_blocks capability flag is left in place for future use. CLI/TUI rendering is a separate path and was unaffected. Adds a regression test asserting terminal progress renders as a truncated preview, not a fenced bash block, even on a markdown-capable gateway. Fixes #41955	2026-06-08 15:53:00 -07:00
Ray Sun	37561c214b	fix(photon): use allowlisted device client_id + validate token before save Photon now allowlists registered device clients on the device-code endpoint; the old client_id "hermes-agent" is rejected with 400 invalid_client, breaking the entire login flow. Switch to Photon's published "photon-cli" device client and send the standard scope. Also validate the device-flow token against /api/auth/get-session and /api/projects/ before persisting it, and extract token candidates from every response shape Photon has used (access_token, accessToken, data.*, set-auth-token header) so a token that authenticates the session lookup but is rejected by the project API fails loudly at login instead of 404ing downstream. Verified live: request_device_code() now returns 200 + a valid user_code where "hermes-agent" returned 400 invalid_client. Salvaged from #34467 by @yanxue06.	2026-06-08 15:52:33 -07:00
Teknium	4615e08d3d	feat(photon): wire outbound media via spectrum-ts attachment() (#42397 ) Photon now exposes attachment send (Ray Sun, photon-nousresearch), so the Photon plugin gains outbound media to match the BlueBubbles iMessage channel. - sidecar: new /send-attachment endpoint wrapping space.send(attachment()) / space.send(voice()); caption sent as a trailing text bubble. - adapter: override send_image/send_image_file/send_voice/send_video/ send_document/send_animation. URL helpers cache to a local path first (cache_image_from_url), file helpers pass through. Defense-in-depth path re-validation before the path reaches the Node sidecar. - _standalone_send (cron): send text first, then each media_file as a /send-attachment call (is_voice -> voice builder). - docs/README: flip the 'outbound attachments not wired' note.	2026-06-08 15:29:16 -07:00
Teknium	5e9d7a7661	fix(skills-hub): stop shipping a degenerate index when GitHub taps collapse (#42347 ) The Skills Hub lost every api.github.com-backed source — the OpenAI, Anthropic, HuggingFace, NVIDIA, gstack, Claude Marketplace and Well-Known tabs all vanished — while ClawHub/skills.sh/LobeHub/browse.sh survived. A GitHub API rate limit during the docs-deploy crawl zeroed all three api.github.com sources (github / claude-marketplace / well-known) at once. Two compounding bugs let the broken index reach the live site: 1. build_skills_index.py wrote the output file BEFORE the health check, so even when the github floor (30) tripped and the script exited 2, the degenerate file was already on disk. deploy-site.yml then swallowed the exit code with `\|\| echo non-fatal` and extract-skills.py read the partial index. Fix: run the health check first, write the file only when healthy, exit without writing on failure. Removed the non-fatal swallow in deploy-site.yml so a collapse fails the deploy and the last good site stays live (Pages serves the previous build). 2. The build-time GitHub listing path returned [] on a 403 rate-limit without retrying or flagging it, so a rate-limited crawl looked identical to an empty source. Fix: a shared _github_get() helper on GitHubSource with retry/backoff (honors Retry-After / X-RateLimit-Reset on 403/429, backs off on 5xx + transport errors) and flags is_rate_limited. Routed _list_skills_in_repo and _fetch_file_content through it; gave ClaudeMarketplaceSource a persistent GitHubSource + is_rate_limited so the builder can name the rate limit as the cause instead of '0 results'. Added tests/scripts/test_build_skills_index_health.py pinning both contracts: a degenerate crawl exits non-zero and writes no file; a healthy crawl writes the index with github/claude-marketplace/well-known all present.	2026-06-08 15:21:28 -07:00
Robin Fernandes	639c1e3636	feat(sessions): add optional max session cap	2026-06-08 15:12:12 -07:00
kshitij	1e3b3dfabb	Merge pull request #40560 from kamonspecial/fix/langfuse-usage-sanitized-response fix(langfuse): restore usage/cost when post_api_request sends a sanitized response	2026-06-08 15:04:37 -07:00
brooklyn!	09a6a2ddd7	fix(desktop): stream the transcript while the window is backgrounded (#42399 ) The chat transcript reaches the screen through a requestAnimationFrame-gated flush (useSessionStateCache). The main BrowserWindow never set backgroundThrottling, so Chromium paused rAF and clamped timers whenever the window was blurred or occluded -- the live answer would stall until the window regained focus or the user refreshed. In practice this bit any time Hermes wasn't the focused window mid-turn (typing in your editor while the agent replies, detached devtools, another window on top), presenting as "thinking, no text, have to refresh." Opt the renderer out of background throttling so a streaming chat app actually streams in the background: - backgroundThrottling: false on the main window (matches the secondary windows that already set it) - disable-renderer-backgrounding / disable-backgrounding-occluded-windows / disable-background-timer-throttling at the process level for the occlusion case Latent since the desktop app landed (#20059), not a recent regression.	2026-06-08 17:01:08 -05:00
kshitij	d3992d1a28	Merge pull request #42331 from mnajafian-nv/fix/nemo-relay-adaptive-config-shape fix(nemo-relay): align adaptive config with tool_parallelism mode	2026-06-08 14:48:58 -07:00
kshitij	1db79bfe1e	Merge branch 'main' into fix/nemo-relay-adaptive-config-shape	2026-06-08 14:42:05 -07:00
Teknium	d6c11a4575	test(run_agent): fix racy ordering in test_concurrent_handles_tool_error (#42356 ) The test keyed the 'which call raises' decision on a shared invocation counter (first call → raise, second → success), then asserted the error landed in messages[0] (c1) and success in messages[1] (c2). But _execute_tool_calls_concurrent runs the two web_search calls on a thread pool with no ordering guarantee — c2's handler can be invoked first, take the 'first call raises' branch, and the error ends up in messages[1]. Results are ordered by tool_call_id, so messages[0] (c1) was then 'success' and the assertion failed. It passed in isolation but reliably failed under CI's full parallel slice (8 xdist workers) where the scheduler actually interleaves the two handlers. Fix: tie the raise to a specific tool call via its arguments (q=boom raises, q=ok succeeds) instead of invocation order, and assert tool_call_id ↔ content pairing explicitly. Deterministic regardless of thread scheduling — verified 10/10 in isolation and the full TestConcurrentToolExecution class (32) green.	2026-06-08 14:40:39 -07:00
kshitij	3f1758d2e4	Merge pull request #41551 from mnajafian-nv/fix/hermes-plugin-openinference-finalization fix(observability): flush plugin-config OpenInference when the final session closes	2026-06-08 14:29:34 -07:00
kshitij	cf49630379	Merge branch 'main' into fix/hermes-plugin-openinference-finalization	2026-06-08 14:19:18 -07:00
kshitij	9fd3d5cf85	Merge pull request #42380 from kshitijk4poor/chore/author-map-mnajafian chore(release): add mnajafian-nv to AUTHOR_MAP	2026-06-08 14:17:53 -07:00
kshitijk4poor	a1cb84aca9	chore(release): add mnajafian-nv to AUTHOR_MAP Unblocks #41551 (and any future mnajafian-nv contributions) from the contributor-attribution check. Maps mnajafian@nvidia.com -> mnajafian-nv.	2026-06-09 02:40:43 +05:30
teknium1	754154a9c2	fix(tests): retry per-file pytest subprocess once on exit-4 when the file exists The parallel test runner sharded a present, tracked test file (tests/plugins/platforms/photon/test_inbound.py) onto a slice that then reported 'file or directory not found' (pytest exit 4) at exec time — even though the planner had just enumerated the file via --collect-only ('5269 passed, 0 failed' in the same run). On loaded shared CI runners the per-file subprocess can fail to stat a file the planner already saw; the deterministic LPT slicer then reproduces it on every rerun because the same file set lands on the same shard. Fix: when a per-file run exits 4 AND the file still exists on disk, retry the subprocess once before surfacing it as a hard failure. This kills the shard-flake class for everyone, not just this PR. Does NOT widen the exit-5-is-pass rule — exit 4 on a genuinely missing file still fails (verified). Retry reuses the same pgroup-kill cleanup as the primary run so no grandchildren orphan. Validation: photon dir runs green through scripts/run_tests_parallel.py; unit-level negative case confirms a nonexistent file still returns rc=4.	2026-06-08 13:38:30 -07:00
teknium1	1866518574	feat(photon): group-chat mention gating for full channel parity Adds the last missing parity piece vs the established channels: group chats can be made opt-in via a mention wake word, exactly like the BlueBubbles iMessage channel. - require_mention + mention_patterns, read from config.extra (config.yaml via the generic gateway bridge) or PHOTON_REQUIRE_MENTION / PHOTON_MENTION_PATTERNS env vars. Same shapes BlueBubbles accepts (list / JSON / comma / newline), same default Hermes wake words. - _dispatch_inbound drops unmatched group messages and strips the leading wake word from matched ones; DMs are never gated. - plugin.yaml + docs document both knobs and the config.yaml form. - New test_mention_gating.py (8 tests): default-off, group drop/pass, wake-word strip, DM bypass, custom patterns, env comma-list, invalid regex skip. The config.yaml -> extra bridge needed no core change — the generic shared-key loop in gateway/config.py already iterates plugin platforms (_shared_loop_targets += plugin_entries()), so require_mention / mention_patterns flow through automatically. Note: outbound media is the one capability Photon still can't reach — Photon exposes no HTTP send-attachment endpoint yet (documented API limitation), so the sidecar can't send files. Not faked. Validation: 34/34 photon tests; E2E confirms config.yaml require_mention + custom mention_patterns bridge through load_gateway_config into a live adapter and gate/strip correctly.	2026-06-08 13:38:30 -07:00
teknium1	d7f42e368e	feat(photon): full channel parity — gateway setup, pairing, PII redaction, doc fixes Brings Photon in line with how every other Hermes gateway channel behaves, instead of being a one-off with its own surfaces. - gateway setup: register a `setup_fn` so Photon appears in `hermes gateway setup` (the unified wizard) and runs the same device-login + project + user + sidecar flow as `hermes photon setup`. Adds `cli.gateway_setup()` as the zero-arg entry point. - PII redaction: flip `pii_safe` False -> True. The comment already said iMessage E.164 numbers should be redacted; the value contradicted it. Now matches BlueBubbles (the other iMessage channel) which is in _PII_SAFE_PLATFORMS — phone numbers are stripped before reaching the LLM. - Pairing/authz: already worked via the registry's allowed_users_env / allow_all_env generic path in authz_mixin; documented it. The adapter forwards unauthorized DMs to the gateway (no intake gating), so the pairing handshake fires and `hermes pairing approve photon <CODE>` works. - Docs: fixed the `hermes photon status` output block to match the real labels (project key / webhook key, not project secret / webhook secret), added the missing PHOTON_API_HOST / PHOTON_DASHBOARD_HOST / PHOTON_HOME_CHANNEL_NAME env vars, and added gateway-setup + authorize-users sections mirroring the other channel docs. Validation: 26/26 photon tests, 6504/6504 gateway+plugins tests, registry E2E confirms setup_fn dispatch + pii_safe + authz envs all wired.	2026-06-08 13:38:30 -07:00
teknium1	630318e958	refactor(photon): fold device login into setup, drop standalone login verb Every other Hermes gateway channel onboards through a single setup surface (paste a token / run the wizard) with no per-platform login command. Photon's device-code flow is unavoidable because Photon mints credentials via API rather than a copy-paste dashboard field, but exposing it as a top-level `hermes photon login` verb broke channel parity. - Remove the `login` subcommand; setup already runs the device flow as its first step. `--no-browser` moves onto `setup`. - Rename `_cmd_login` -> `_run_device_login` (internal helper). - Status / credential-summary hints now point at `hermes photon setup`. - README updated to the one-command onboarding flow.	2026-06-08 13:38:30 -07:00
teknium1	8f89c4615f	chore(photon): clean up ty type-checker warnings from lint-diff bot The advisory lint-diff bot flagged 17 new ty diagnostics. 6 are `unresolved-import` for httpx/aiohttp/pytest, which is structural (CI lint env has no project deps) and matches every other platform plugin's noise floor. The remaining 11 are real and fixable: - `Optional[callable]` → `Optional[Callable[..., None]]` (auth.py) invalid-type-form on `callable` as a type expression. Added the proper `typing.Callable` import. Two sites: on_pending in poll_for_token, on_user_code in login_device_flow. - Dropped three unused `# type: ignore` comments on hermes_constants / hermes_cli.config imports — ty can resolve those modules fine, the comments were dead. - _supervise_sidecar(proc) widened `proc.stdout` from `IO[Any] \| None` to a narrowed local after an early `is None` guard. Defensive against subprocesses launched without stdout=PIPE. - cli.py _cmd_setup: dropped the `has_existing_project = bool(...)` intermediate, did the narrowing inline with `if existing_id and existing_secret:` so ty can see project_id/project_secret are non-None when create_user is called. - test_inbound.py: replaced three `adapter.handle_message = fake_handle # type: ignore[assignment]` with `monkeypatch.setattr(adapter, 'handle_message', fake_handle)`. Same behavior, no type-ignore, and the monkeypatch reverts cleanly between tests. Validation: ty check plugins/platforms/photon/ tests/plugins/platforms/photon/ → All checks passed! tests/plugins/platforms/photon/ → 26/26 pass py_compile clean Windows footgun checker → 0 footguns	2026-06-08 13:38:30 -07:00
Teknium	083d8b2d60	fix(photon): collapse credential summary to single-emit literal-blob CodeQL ignored the # lgtm[...] suppressions on default-config hosted scans — same three high-severity false positives stayed open at auth.py:461-463. Last code-level attempt: drop the per-line emit() calls in favor of - reading every credential into a tight prelude block that resolves each to a display literal in a dict-typed local - assembling the full 6-line banner as a list of plain strings - calling emit() ONCE with '\\n'.join(rows) CodeQL's flow tracker often gives up at the dict-literal + str-concat + list-join boundary because it has to track taint through index access AND string concatenation AND join. Worth one more shot before asking for an admin dismissal. Output is byte-identical; live smoke confirms the same status table renders. 26/26 photon tests still pass. If CodeQL still flags this on the next scan, the architecture is as clean as it can get without obfuscation and the right call is to dismiss the three alerts as false positives in the Security tab (documented escape valve for this rule).	2026-06-08 13:38:30 -07:00
Teknium	6a0cc9bf92	fix(photon): suppress CodeQL clear-text-logging false-positives in auth.py After four iterations the taint flow finally settled on auth.py's print_credential_summary, which emits four lines like `emit(f" device token : {_present_token()}")`. The `_present_*()` closures collapse credentials into display literals ("✓ stored" / "✗ missing") before the f-string evaluation, so no secret bytes ever reach emit() — but CodeQL's interprocedural taint tracker can't see through the closure-then-literal-return pattern and keeps flagging the four lines. This is the appropriate place for an inline suppression: - auth.py is the only module that legitimately handles the secret; every other surface (cli.py, adapter.py, tests) routes through these helpers and stays clear of taint. - The four lines are physically the boundary between credential-reading code and a display callback. Without the `emit(...)` calls there is no status command. - The suppression is per-line with a comment explaining the misfire pattern so a future maintainer can see the reasoning without git-archaeology. If GitHub's hosted CodeQL doesn't honor # lgtm comments on default- config scans we'll need to dismiss these as false positives in the Security tab once — that's the standard escape valve for this rule. Validation: tests/plugins/platforms/photon/ → 26/26 pass py_compile clean	2026-06-08 13:38:30 -07:00
Teknium	2ee7abf271	fix(photon): emit credential summary via callback so no tainted value escapes auth.py The previous pass moved credential reads into auth.credential_summary() which returned a dict of pre-formatted display strings. CodeQL's interprocedural taint analysis still flagged the cli.py prints because the dict's values were transitively derived from load_photon_token() and load_project_credentials(). Pattern that finally works: same as persist_webhook_signing_secret — the helper takes an emit callback and does the formatting + emitting itself. cli.py passes `print` as the sink and never receives any return value derived from credential reads. CodeQL's flow stops at the helper's emit() boundary. Changes: - auth.print_credential_summary(emit=print) — closure-scoped probes, emits 6 lines (header + separator + 4 credential rows) via the callback. Returns None. - cli._cmd_status now calls print_credential_summary(print) then appends the two non-credential rows (node binary, sidecar deps) locally with no credential flow. - Added test_print_credential_summary_emits_only_display_strings asserting the emit callback never sees raw token/secret bytes. Validation: tests/plugins/platforms/photon/ → 26/26 pass live smoke: hermes photon status (with empty HERMES_HOME) renders the expected layout cleanly	2026-06-08 13:38:30 -07:00
Teknium	55fb422f6f	fix(photon): isolate ALL secret-touching prints behind auth.py helpers CodeQL was still flagging three taint-flow alerts in cli.py — its flow tracker keeps spreading the 'sensitive' label through every variable that even touched a credential-returning function, including 'has_token = bool(load_photon_token())' and the redacted-response dict returned by persist_webhook_signing_secret. Refactor: 1. cli.py _cmd_status now calls a new auth.credential_summary() that returns a {key: pre-formatted display string} dict. All probes + bool checks happen inside the helper. cli.py never sees a token or secret variable, only literals like '✓ stored' / '✗ missing'. 2. persist_webhook_signing_secret(webhook_data, *, on_summary=print) now owns the formatting + writing + status messages. It returns only a bool. The redacted-response JSON dump + 'saved to <path>' confirmation are emitted via the on_summary callback, so cli.py passes as the sink and never receives the path/dict back. cli.py is now mechanical: register_webhook → persist (with print) → return 0/1. Zero credential-tainted variables in cli.py at all. 3. Tests updated for the new signatures and a credential_summary guard added (the helper must never leak raw token/secret bytes into its return strings). Validation: tests/plugins/platforms/photon/ → 25/25 pass scripts/check-windows-footguns.py --all → 0 footguns py_compile clean	2026-06-08 13:38:30 -07:00
Teknium	91db0ab420	fix(photon): clear remaining CodeQL clear-text-{logging,storage} alerts Down to 4 CodeQL alerts after the last pass; all addressed: cli.py:215 (clear-text-logging-sensitive-data) The status banner literal 'project secret : ✓ stored' tripped CodeQL's variable-name heuristic even though only a boolean was interpolated. Renamed the column labels to 'project key' and 'webhook key' — fields contain only ✓ stored / ✗ missing / ⚠ unset literals now, the word 'secret' is no longer in the source. cli.py:283 (clear-text-logging-sensitive-data) The fallback path for register-webhook used to echo 'PHOTON_WEBHOOK_SECRET=<value>' to stdout when the .env write failed. Removed entirely — there is no scenario where we should print the secret. On failure we now tell the user to fix the .env permissions and re-register (after deleting the orphaned webhook from the Photon dashboard). cli.py:354 (clear-text-storage-sensitive-data) + cli.py:276 (clear-text-logging-sensitive-data) Replaced the hand-rolled .env writer in cli.py with the canonical hermes_cli.config.save_env_value helper that every other API-key persistence path uses (OpenAI key, Anthropic, Telegram, ...). Moved the persist logic into auth.py as persist_webhook_signing_secret(webhook_data) so the signing-secret value never gets bound to a local in cli.py at all — cli.py hands the raw API response straight to the helper and receives back only the path + a redacted copy of the response for display. This both matches project convention and removes the taint flow CodeQL was tracking. Bonus cleanup: - dropped unused 'from typing import Any, Optional' in cli.py - added 2 tests covering persist_webhook_signing_secret (writes env successfully + returns redacted copy + no-secret-no-write) Validation: tests/plugins/platforms/photon/ → 24/24 pass scripts/check-windows-footguns.py --all → 0 footguns py_compile on all photon modules → clean	2026-06-08 13:38:30 -07:00
Teknium	3a0f6ac3d4	fix(photon): satisfy Windows footgun + CodeQL checks CI red on three blocking checks; all addressed: 1. Windows footguns: os.killpg() flagged as POSIX-only despite the sys.platform != 'win32' guard. Static scanner doesn't see flow. Added the documented '# windows-footgun: ok' suppression. 2. test (3): tests/plugins/platforms/photon/__init__.py shadowed the real plugin's __init__.py because test_plugin_platform_interface.py looks at PROJECT_ROOT/plugins/platforms/<name>/__init__.py with PROJECT_ROOT=tests/ (pre-existing bug in that test, made visible by the new test directory layout). Dropping the empty test __init__.py restores the prior NOTSET parametrize behavior. 3. CodeQL (7 alerts in new code): - cli.py: stop printing the first 8 chars of the bearer token after login — even prefixes are partial credentials. - cli.py: stop printing the first 8 chars of project_secret after setup, same reason. - cli.py 'hermes photon webhook register': stop dumping the raw register-webhook response (contained signingSecret) and stop echoing PHOTON_WEBHOOK_SECRET to stdout. Write it directly to ~/.hermes/.env (0o600), preserving existing entries; fall back to manual instructions only if the file write fails. Photon still only returns the secret once; this just doesn't put it in scrollback / shell history. - cli.py setup + status: rename project_id/project_secret/token locals to has_* booleans before printing, breaking CodeQL's taint flow through f-string interpolations. Drop diagnostic prints of phone / assignedPhoneNumber that flagged as 'sensitive data' false positives. - sidecar/index.mjs: stop returning the raw error message (potentially containing stack trace) in HTTP 500 responses; supervisor logs the real error to stderr, client only sees a generic 'internal sidecar error'. Validation: - scripts/check-windows-footguns.py --all → 0 footguns (518 files) - tests/plugins/platforms/photon/ → 22/22 pass - tests/gateway/test_plugin_platform_interface.py → 7/7 pass, collects NOTSET (matches pre-PR state) - tests/gateway/test_platform_registry.py → 50/50 pass - node --check sidecar/index.mjs clean	2026-06-08 13:38:30 -07:00
Teknium	5b4e431e8c	feat(gateway): add Photon Spectrum (iMessage) platform plugin First-class iMessage support via Photon's managed Spectrum platform. Targeted as a successor to the BlueBubbles adapter — Photon allocates the iMessage line, handles delivery, and abuse-prevention so users don't have to run their own Mac relay. Free tier uses Photon's shared line pool. Architecture: - Inbound: signed JSON webhooks (X-Spectrum-Signature, HMAC-SHA256) delivered to a local aiohttp listener. Dedupes on message.id, rejects deliveries with >5min timestamp drift. - Outbound: small supervised Node sidecar that runs the spectrum-ts SDK. Photon does not currently expose a public HTTP send-message endpoint; the sidecar is the only way to call Space.send() today. When Photon ships an HTTP send endpoint we collapse the sidecar into _sidecar_send and drop the Node dep — every other layer of the plugin stays the same. - Setup: 'hermes photon login' runs the RFC 8628 device-code flow; 'hermes photon setup' creates a Spectrum-enabled project, creates a shared user (free tier), installs the sidecar's npm deps. - Webhook management: 'hermes photon webhook register\|list\|delete'. - Credentials persisted under credential_pool.photon / credential_pool.photon_project in ~/.hermes/auth.json. Plugin path (not built-in) — per current policy (May 2026), all new platforms ship under plugins/platforms/. Registers itself via ctx.register_platform() + ctx.register_cli_command(), zero edits to core gateway code. Tests cover: - HMAC-SHA256 signature verification (happy path, tampered body, wrong secret, drift, missing v0 prefix, empty inputs, non-integer timestamp) - Inbound dispatch for text DMs, group ids (any;+;...), and attachment metadata markers - Deduplication window - check_requirements gating when Node is absent - Device-code flow: request, header-based token return, body-fallback token return, access_denied propagation - Project/user/webhook API clients with mocked httpx Known limitations (current Photon API): - Attachments are metadata only — no download URL yet - Outbound attachment send not wired (sidecar can add easily) - Reactions / message effects not exposed yet Docs: website/docs/user-guide/messaging/photon.md + sidebar entry.	2026-06-08 13:38:30 -07:00
brooklyn!	6e7033bb4c	fix(desktop): don't drop the focused chat's own stream when unscoped (#42359 ) #42178 dropped every session-scoped gateway event that arrived without an explicit session_id, to stop background activity attaching to the focused chat. But the gateway already stamps background sessions with their own id, so an unscoped message/reasoning/tool/prompt event can only be the focused turn's own output. Dropping those swallowed the live answer — it reappeared only after a transcript refetch (manual refresh). Narrow the guard to subagent.* (the only genuinely background/async family); everything else falls back to the active session as before.	2026-06-08 15:24:15 -05:00
Brooklyn Nicholson	e88116256c	fix(update): scope git fetch to target branch A bare `git fetch origin` (and `git fetch upstream`) pulls every ref. The repo carries thousands of auto-generated branches, so on any non-single-branch checkout the installer's update path and `hermes update` spend minutes downloading the full branch list — long enough to stall the desktop installer or trip the follow-up `git pull --ff-only`. Scope every update-path fetch to the branch we actually compare/merge against: - scripts/install.sh: collapse the remote to single-branch and fetch only $BRANCH on the "existing install, updating" path. - hermes_cli/main.py: fetch the resolved branch in the apply path, the --check path (upstream + origin), and the fork upstream-sync. Tracking-ref updates still happen via git's opportunistic refspec, so the later origin/<branch> rev-parse/rev-list checks are unaffected. Tests assert the apply-path fetch is branch-scoped and never bare.	2026-06-08 15:24:31 -04:00
Teknium	2f510ca8e0	fix(deps): align anthropic extra pin with lazy pin + guard whole pin surface (#42335 ) The anthropic extra pinned anthropic==0.86.0 while LAZY_DEPS['provider.anthropic'] pins 0.87.0 (CVE-2026-34450, CVE-2026-34452) — the same drift class as the aiohttp #31817 downgrade. On hermes update the extra pin won and rolled anthropic 0.87.0 -> 0.86.0, reopening both CVEs until the native-Anthropic lazy refresh re-bumped it. Bump the extra to 0.87.0, regenerate uv.lock, and generalize the regression guard: test_pyproject_pins_match_lazy_deps_pins now fails if ANY package pinned in both a pyproject extra and a LAZY_DEPS entry drifts, so a third package can't reintroduce this class. The aiohttp-specific test is kept for focused #31817 coverage.	2026-06-08 12:11:54 -07:00
teknium1	c78b3e1d3c	fix(auth): add Codex OAuth accounts as distinct pool entries hermes auth add openai-codex now creates an independent manual:device_code pool entry per account instead of routing through the singleton _save_codex_tokens save path, which collapsed every added account into the latest login (the second add overwrote the first account's singleton-mirrored device_code entry). This is the add-path half of #39236; PR #39243 (already on this branch) fixes the re-auth half. manual:device_code entries refresh from their own token pair (_sync_codex_entry_from_auth_store only adopts the singleton for source=="device_code"), so they need no providers.openai-codex shadow. Adding the first credential marks openai-codex active (the singleton path did this implicitly) so the setup wizard's get_active_provider() check still passes; subsequent adds leave the active provider untouched. Adds SOURCE_MANUAL_DEVICE_CODE constant and a regression test that two distinct accounts keep distinct token pairs. Updates two existing add tests to the pool-only behavior. Co-authored-by: glesperance <info@glesperance.com>	2026-06-08 11:57:03 -07:00
Ted Malone	761b744abb	fix(auth): preserve independent Codex pool entries on re-auth (#39236 ) The #33538 fix refreshed every credential_pool entry with source "manual:device_code" on every Codex OAuth re-auth, on the assumption that such entries were always legacy aliases of the singleton from the #33000 workaround era. That assumption is no longer true: `hermes auth add openai-codex` also produces "manual:device_code" entries for independent ChatGPT accounts, and the broad sync silently clobbered them with the latest-authenticated token pair (labels preserved, token material overwritten, status / quota readings then lie). Narrow the sync: refresh a "manual:device_code" entry only when its existing access_token matches the previous singleton access_token (true legacy alias). Entries with distinct token material represent independent accounts and are now left alone. Error markers are cleared only on entries actually rewritten, so an independent account's own 429 / 401 state survives a re-auth that targeted a different account. Tests: * New: independent acctB/acctC are not overwritten when acctA re-auths. * New: legacy singleton-alias still refreshed (preserves #33538). * New: missing previous singleton state handled (no crash, no false alias match). * New: access_token-only alias match (legacy schema without refresh_token still recognized). * New: error markers cleared only on entries actually refreshed. * Updated: existing manual-device-code sync test now covers both the legacy-alias path AND the independent-account path in one fixture. Behaviour change is zero for users with a single Codex account and zero for users whose only "manual:device_code" entry is the legacy alias of the singleton. Users with multiple independent Codex accounts added via `hermes auth add` now keep their distinct token material across re-auths. Local: 29 passed in tests/hermes_cli/test_auth_codex_provider.py, no new failures in tests/hermes_cli/ vs upstream/main baseline. Fixes #39236.	2026-06-08 11:57:03 -07:00
Teknium	c9094f5e5f	fix(stream): don't report dropped mid-tool-call streams as output truncation (#42314 ) * fix(stream): don't report dropped mid-tool-call streams as output truncation A streaming tool call whose SSE ends with no finish_reason (the upstream delivers the tool name + opening '{' then closes the connection cleanly, no terminator, no [DONE]) was stamped finish_reason='length' by the mock builder. That routed it through the output-cap truncation path: 3 useless max_tokens-boosted retries, then the misleading 'Response truncated due to output length limit' error — even though the model never reported hitting any cap. Reproduced live on nvidia/nemotron-3-ultra:free via the Nous dedicated endpoint, which stalls/drops during large tool-arg generation (50s-4m41s). Now: when tool args are incomplete AND the provider sent no finish_reason, tag the response as a partial-stream stub so the loop reports an honest mid-tool-call drop and asks the model to chunk its output (existing continuation machinery), instead of escalating output budget and lying. A provider-reported finish_reason='length' still takes the real-truncation path unchanged. * test(stream): update truncated-tool-args test for drop-vs-cap split test_truncated_tool_call_args_upgrade_finish_reason_to_length pinned the old behaviour where ANY incomplete tool args → finish_reason='length' with tool_calls preserved. That single-chunk-no-finish_reason scenario is exactly the mid-tool-call stream drop now reclassified as a partial-stream stub. Split into two tests matching the new contract: - no finish_reason + incomplete args → PARTIAL_STREAM_STUB_ID, tool_calls=None, _dropped_tool_names set (the drop path) - explicit finish_reason='length' + incomplete args → tool_calls preserved, 'length' upgrade unchanged (the genuine output-cap path)	2026-06-08 11:56:10 -07:00
teknium1	89d380261d	fix(approval): resolve Hermes home at detection time, not import time helix4u's fix snapshotted the resolved HERMES_HOME into the static config/env patterns at module-import time. That breaks when HERMES_HOME is set after tools.approval is imported (the hermetic test conftest, any deferred-profile-resolution path), and made the PR's own 4 new tests red. Move the resolution into _normalize_command_for_detection(): rewrite the live resolved absolute home prefix (and its symlink-resolved form) to the canonical ~/.hermes/ form before pattern matching. Tracks the live env, needs no regex recompile, and folds the absolute form into the shared _SENSITIVE_WRITE_TARGET so > redirects, tee, cp, etc. are covered too — not just sed/perl/ruby in-place edits.	2026-06-08 11:55:40 -07:00
helix4u	b0efe1d64b	fix(approval): gate resolved Hermes config paths	2026-06-08 11:55:40 -07:00
xxxigm	96fd9d4979	fix(desktop): stop running Hermes.exe locking win-unpacked before Windows pack (#42100 ) * fix(desktop): stop running app locking win-unpacked before pack On Windows a running Hermes.exe keeps an exclusive lock on release/win-unpacked/Hermes.exe, so electron-builder's pack cannot replace it and dies with "remove ...\Hermes.exe: Access is denied" / ERR_ELECTRON_BUILDER_CANNOT_EXECUTE (before-pack hits the same EPERM cleaning the dir, and the cache-purge retry repeats the failure since the lock is still held). Before building the packaged app, terminate any process whose executable lives inside this build's release/ tree so the rebuild -- including the installer's headless --update rebuild -- can replace the binary. Scope is narrow (only exes under release/), POSIX is a no-op (it can unlink a running binary), and the final error now points Windows users at the running-app cause. * test(desktop): cover the win-unpacked lock-breaker helper Verify _stop_desktop_processes_locking_build is a no-op off-Windows, terminates only processes whose exe lives under release/ (sparing our own PID and unrelated installs), and short-circuits when no release dir exists.	2026-06-08 11:51:31 -07:00
mnajafian-nv	021d1034d0	fix(nemo-relay): align adaptive config with tool_parallelism mode Signed-off-by: mnajafian-nv <mnajafian@nvidia.com>	2026-06-08 11:48:19 -07:00
Teknium	abcf996b1f	feat(windows): enable dashboard /chat tab via ConPTY (win_pty_bridge) + tests (#42251 ) * feat(windows): enable dashboard chat tab via ConPTY (win_pty_bridge) Add hermes_cli/win_pty_bridge.py — a pywinpty-backed drop-in for PtyBridge with the same spawn/read/write/resize/close surface — and wire it into the web_server PTY import block so Windows picks it up instead of falling back to None. pywinpty is already a declared win32 dependency (pyproject.toml). The ConPTY read path runs inside run_in_executor so the event loop is never blocked. Spawn/read/write/terminate call shapes are taken directly from tools/process_registry.py which already exercises the same pywinpty version. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * docs: remove WSL2-only caveat for dashboard chat tab The chat pane now works on native Windows via the ConPTY bridge added in the previous commit. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * test(windows): cover ConPTY bridge + web_server platform-branched import Companion to the bridge added in the previous commits. Verified live on native Windows 11 (pywinpty 2.0.15) against `hermes dashboard`'s `/api/pty` WebSocket: the spawned `hermes --tui` (node entry.js) renders through ConPTY, resize escapes reach `setwinsize`, and closing the WS reaps both the node child and the pywinpty agent with zero orphans. tests/hermes_cli/test_win_pty_bridge.py Mirrors the layout of the existing POSIX test_pty_bridge.py: spawn/io/resize/close/env coverage against cmd.exe and python -c, plus the cross-platform fallback surface (PtyUnavailableError, the off-Windows `spawn -> raises PtyUnavailableError` guard, and the load-bearing _clamp() helper that protects setwinsize from garbage winsize values out of xterm.js). tests/hermes_cli/test_web_server_pty_import.py Asserts that web_server.PtyBridge resolves to WinPtyBridge on win32 and to the POSIX PtyBridge on POSIX, that PtyUnavailableError is the matching class on each side (so isinstance checks in /api/pty's spawn fallback path work), and a source-text check that pins the platform-branched import shape so a future refactor can't quietly collapse it back to a POSIX-only import. scripts/release.py AUTHOR_MAP entries so CI release-note generation can resolve both authors' plain (non-noreply) emails to their GitHub logins. Co-Authored-By: JoelJJohnson <josephjohnson.joel@gmail.com> Co-Authored-By: Nea74 <andreas@schwarz-ketsch.de> --------- Co-authored-by: JoelJJohnson <josephjohnson.joel@gmail.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Nea74 <andreas@schwarz-ketsch.de>	2026-06-08 11:32:43 -07:00
cresslank	c6d27addf7	fix(deps): align aiohttp extras pins with lazy Slack pin (3.13.4) The messaging/slack/homeassistant/sms extras exact-pinned aiohttp==3.13.3 while LAZY_DEPS['platform.slack'] already pins 3.13.4 (the CVE fix). On `hermes update` the extras pin won, downgrading aiohttp 3.13.4 -> 3.13.3 and reopening 10 published advisories (CVE-2026-34513/34515/34516/34517/ 34518/34519/34520/34525, -22815, -34514) until Slack's lazy refresh re-upgraded it. Bump all four extras to 3.13.4 to match the lazy pin, regenerate uv.lock, and add test_pyproject_aiohttp_pins_match_lazy_slack_pin to guard the alignment going forward. Fixes #31817	2026-06-08 11:30:48 -07:00
teknium1	5916248dc0	chore: add AUTHOR_MAP entry for rbrtbn (salvage #25939 )	2026-06-08 11:29:53 -07:00
BarnacleBoy	550b72dd87	fix(cli): gate tool-rendering paths with tool_progress_mode, not quiet_mode quiet_mode was being used to suppress tool-result display when tool_progress_mode was 'off'. But quiet_mode also gates operational status messages, so users with /verbose + tool-progress off lost all status output. Adds a dedicated tool_progress_mode attribute to AIAgent; the tool_executor result-rendering path gates on tool_progress_mode != 'off'. The CLI passes its tool_progress_mode through agent setup and the tool-progress cycle command syncs it onto the live agent. Fixes #33860.	2026-06-08 11:29:53 -07:00
Robert Ban	4129092fda	fix(cli): strip OSC 8 hyperlink sequences in ChatConsole output prompt_toolkit's ANSI parser does not handle OSC escape sequences (\x1b]...\x07 / \x1b]...\x1b\), which caused Rich's [link=...] markup to leak raw OSC 8 payload into the banner title after /clear. Added _OSC_ESCAPE_RE to strip OSC sequences in ChatConsole.print() before routing through _cprint(). CSI/SGR color sequences are preserved. Visible text between OSC sequences is kept intact.	2026-06-08 11:29:53 -07:00
liuhao1024	8e4c447e5f	fix(gateway): prevent duplicate user messages in state.db When the agent has its own SessionDB reference (_session_db is not None), _flush_messages_to_session_db() persists user messages to SQLite during the agent run. Two gateway fallback paths also wrote the same user message without skip_db=True, creating duplicate entries in state.db: 1. agent_failed_early path (transient 429/timeout failures) 2. not-new-messages path (history_offset >= len(messages) edge case) Move agent_persisted flag definition to before the if/elif/else block so all paths can use it, and pass skip_db=agent_persisted to every fallback append_to_transcript() call. Fixes #42039	2026-06-08 11:29:53 -07:00
brooklyn!	9b1e0d6f70	feat(desktop): assignable themes per profile (#42286 ) * feat(desktop): assignable themes per profile The desktop skin was a single global preference, so every profile shared one look. Make the theme assignment per profile: picking a theme assigns it to the profile that's currently live, and switching profiles paints that profile's own skin. A profile with no assignment inherits the global default, so single-profile installs and existing setups are unchanged. - themes/context.tsx: per-profile skin record in localStorage; ThemeProvider follows $activeGatewayProfile; boot paint uses the last active profile's theme to avoid a flash on a non-default relaunch; setTheme assigns to the live profile (default profile also seeds the legacy global fallback). - settings/appearance-settings.tsx: caption noting the theme is saved per profile, shown only when more than one profile exists. - i18n: themeProfileNote string across en/zh/zh-hant/ja. - themes/profile-theme.test.ts: resolution + inheritance coverage. * feat(desktop): make light/dark mode per profile too The command palette / theme picker sets skin + mode together on each pick, so leaving mode global meant a profile couldn't actually remember the full look it was given (e.g. "Ember Dark" in one profile would render Ember Light if another profile last flipped the global mode). Mirror the per-profile skin record for light/dark mode: ThemeProvider resolves and applies the active profile's mode on switch, the boot paint uses it, and setMode assigns to the live profile (default profile also seeds the legacy global mode fallback). * refactor(desktop): collapse per-profile skin/mode into one helper Skin and mode were near-identical resolve/assign pairs with hand-rolled try/catch around localStorage. Fold both into a single profilePref<T> factory (resolve + assign, default profile seeds the legacy global) and lean on storedString/persistString for the error-swallowing. Tests go table-driven over both prefs since they share one contract. No behavior change; -89 LOC. * refactor(desktop): treat default profile as the global slot directly "default" isn't a real profile — it is the legacy global value. Stop double-writing (record['default'] + global) on assign; route default straight to the global. resolve is unchanged: a profile with no record entry already falls back to the global, so default reads it for free.	2026-06-08 17:42:17 +00:00
brooklyn!	395ed91891	fix(desktop): keep a just-finished session visible after switching away (#42285 ) A brand-new session's first turn persists to the SessionDB a beat after the gateway emits message.complete, so a refresh fired in that window gets a listSessions(min_messages=1) page that omits the new row. sessionsToKeep() already shields the active chat from this race, but a session you started and then navigated away from is — at the next refresh — neither working, pinned, nor active, so mergeSessionPage() evicts it. Nothing re-fetches afterward, so it stays gone until the app restarts. Track sessions whose turn just settled (a real working->idle transition) in a short, auto-expiring grace window and add them to the merge keep-set. This bridges the persist race for non-active chats without resurrecting deleted rows (mergeSessionPage only revives rows still in the in-memory list, which optimistic delete/archive already drop). Repro: start a new chat, send a message, then click another session before the reply lands — the new session vanishes from the sidebar.	2026-06-08 12:32:27 -05:00
kshitij	a38003be3d	Merge pull request #42143 from kshitijk4poor/salvage/tui-slash-worker-leak-35626	2026-06-08 10:07:18 -07:00
teknium1	365813a72b	fix: resolve rebase conflict in _teardown_session worker cleanup Main folded slash_worker.close() into _finalize_session (the single _finalized-guarded chokepoint) while #42143 was open. The rebase conflicted with the PR's worker-close in _teardown_session. Keep both — they target the same #38095 leak and _SlashWorker.close() is idempotent (_closed/poll()-guarded) — so callers reaching _teardown_session without the real _finalize_session (and the PR's own tests, which monkeypatch _finalize_session out) still reap the worker. Same for _shutdown_sessions, now routed through the unified _close_session_by_id funnel.	2026-06-08 10:02:05 -07:00

1 2 3 4 5 ...

11098 commits