hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-01 12:02:05 +00:00

Author	SHA1	Message	Date
Teknium	c8fd47be14	docs: add PR infographic for approval mode validation	2026-06-28 19:04:18 -07:00
Teknium	f1cbe4308f	fix(gateway): log error-notification failures instead of silently swallowing (#54472 ) * fix(gateway): log error-notification failures instead of silently swallowing The last-resort exception handler in _process_message_background() that sends an error notice to the user caught all exceptions with a bare pass, leaving zero trace when the notification itself failed. Upgrade to logger.error(..., exc_info=True) so a failed error-notification send is debuggable post-mortem. Salvaged from #6499 by @BongSuCHOI (the logging-upgrade portion only). * docs: add PR infographic for gateway error-notify logging	2026-06-28 18:52:51 -07:00
Hermes Agent	c8b86963d0	docs: add PR infographic for anthropic stale base_url guard	2026-06-28 15:12:03 -07:00
Teknium	e5d22ab80d	fix(daytona): quote single-upload mkdir parent path (#54440 ) * fix(daytona): quote single-upload mkdir parent path The single-file _daytona_upload() path shelled out 'mkdir -p {parent}' with the remote parent interpolated unquoted, so shell metacharacters in the path could break the command or inject arbitrary commands into the sandbox. The bulk-upload, bulk-download, and delete paths were already hardened with shlex-quoting helpers; this single-upload path was missed. Route it through the existing quoted_mkdir_command() helper and add a regression test covering a path with shell metacharacters. Reported by @Gutslabs (#3960); the original branch predated the file_sync refactor, so the fix is re-applied to the current code path. * docs(infographic): daytona quote-sync fix	2026-06-28 14:33:03 -07:00
Teknium	b31b0b9d95	docs: reconcile docs with code across last 3 releases (#54254 ) Audited the last 3 releases (v2026.5.28..main) against the docs site and fixed code-vs-docs drift: - slash-commands: add /moa, /prompt, /pet, /hatch, /timestamps - cli-commands: add hermes pets / project / desktop / whatsapp-cloud + dashboard register; correct --insecure (now a deprecated no-op); add gateway migrate-legacy + enroll --wake-url + dashboard --skip-build - environment-variables: document the remaining ~48 env vars (SimpleX, Photon, Teams adapter, per-platform _ALLOW_ALL_USERS, home-channel vars, IRC, Brave/Krea/Notion/Linear/Airtable/Tenor keys, QQ_SANDBOX) — full OPTIONAL_ENV_VARS (265) now covered - configuration: document tool_loop_guardrails, goals, prompt_caching, network, onboarding, dashboard config blocks - toolsets/tools-reference + tools.md: add coding/project toolsets and read_terminal/project_ tools; remove the stale messaging toolset and send_message agent tool (removed in #47856); drop stale RL-training prose - messaging: new IRC channel page (adapter shipped without docs) + index row + sidebar + env vars - pets: document the /hatch AI generation pipeline + Nous/OpenRouter image backend - web-dashboard: document the bearer-token / TokenPrincipal service auth path - purge agent-callable send_message references across guides/features and the research-paper-writing skill (tool removed in #47856) Verified: docusaurus build succeeds; all authored internal links resolve.	2026-06-28 12:47:50 -07:00
teknium1	e54bedd8ea	docs: add infographic for #42006 launchd bootout fix	2026-06-28 04:17:13 -07:00
Teknium	8e356eccea	docs(readme): trim provider list to a few names plus docs link (#54169 ) The README line enumerated 11 providers inline, which dilutes the point and goes stale as providers come and go. Replace with Nous Portal, OpenRouter, OpenAI, your own endpoint, and a 'many others' link to the canonical AI Providers docs page that already lists them all.	2026-06-28 04:14:59 -07:00
teknium1	f22b9d3867	docs: add infographic for MCP WS discovery fix (#38945 )	2026-06-28 04:14:12 -07:00
teknium1	19cbbe304a	docs: add infographic for clarify typed-replies fix	2026-06-28 04:13:19 -07:00
Teknium	c1c179a239	fix(security): redact secrets in background process + foreground env-dump output (#43025 ) (#54149 ) * fix(security): redact secrets in background process + foreground env-dump output Terminal-output redaction was incomplete (#43025): - Gap 1: process(action=poll/log/wait) returned background stdout verbatim — no redaction at all. A background printenv/server/test emitting a key leaked raw to the model, session.db, and CLI display. Same for the gateway background-process watcher's completion/progress notifications. - Gap 2: the foreground terminal path hardcoded code_file=True, which skips the ENV-assignment pass, so an opaque token (no vendor prefix) from env/printenv leaked even there. Adds agent.redact.redact_terminal_output(output, command) as the single policy for ALL terminal-output surfaces: env-dump commands (env/printenv/set/export/ declare) get the ENV-assignment pass (code_file=False) to mask opaque tokens; other commands stay on code_file=True to avoid false positives on source dumps. Wired into terminal_tool, process_registry (_handle_process boundary), and the gateway watcher. Respects security.redact_secrets (no force) — opt-out preserved. * docs: add infographic for #43025 terminal-output redaction fix	2026-06-28 02:44:21 -07:00
teknium1	822b71cbf8	docs: add infographic for #43083 secret-redaction fix	2026-06-28 02:44:06 -07:00
Teknium	6d879d486b	fix(dashboard): close PTY WebSocket on child EOF to stop FD leak (#54028 ) (#54123 ) * fix(dashboard): close PTY WebSocket on child EOF to stop FD leak The /api/pty handler's reader task returns on child EOF, but the writer loop stayed blocked on ws.receive() until the browser sent a disconnect. When the browser socket is half-open (no FIN delivered — common on macOS/launchd), that disconnect never arrives, so the handler never reaches its finally and the PTY master fd + child process leak. With dashboard auto-reconnect (#52962), every dropped socket then spawns a fresh PTY on top of the orphaned one, exhausting file descriptors within hours (EMFILE / Errno 24). Fix: the reader task now closes the WebSocket in a finally when the child EOFs or the send side breaks, which unblocks ws.receive() so the existing finally runs bridge.close(). The writer loop also guards ws.receive() against the RuntimeError Starlette raises once the socket is closed. Reported by @fifteenzhang. Fixes #54028 * docs: add infographic for #54028 PTY FD leak fix	2026-06-28 02:42:21 -07:00
teknium1	c38dfba3a7	docs: add infographic for #53175 gateway cleanup off-loop fix	2026-06-28 02:41:36 -07:00
Teknium	b508d4296e	test(ci): raise per-file timeout 140s → 300s to stop false timeouts (#54143 ) * test(ci): raise per-file timeout 140s to 300s to stop false timeouts The per-file parallel runner caps each test-file subprocess at a flat wall-clock budget. Combined with per-test subprocess isolation (a fresh Python process per test), a large-collection file pays N x (interpreter startup + import) of overhead before any test logic runs. That overhead dilates under load on shared CI runners, so a file that finishes in ~100s on a quiet box can blow the old 140s cap purely from scheduling jitter, surfacing as a false 'no tests ran' timeout (rc=124) with zero actual test failures. Raise the default to 300s (5 min). The Docker build matrix jobs already take 7-10 min, so this headroom costs nothing on total CI wall time while still bounding a genuinely hung file. * docs: add infographic for CI per-file timeout bump	2026-06-28 02:41:07 -07:00
teknium1	dcc6cd1b42	docs: add infographic for #52378 Windows update-loop salvage	2026-06-28 02:40:37 -07:00
teknium1	6eec0d4f08	docs: add infographic for #53107 gateway force-exit fix	2026-06-28 02:34:23 -07:00
Teknium	f646b82ff0	docs: add infographic for #38249 atomic env-snapshot fix	2026-06-28 02:08:57 -07:00
teknium1	9f7d520caf	docs: add infographic for #36664 WhatsApp LID session-path fix	2026-06-28 02:05:26 -07:00
teknium1	d0f087e7f9	docs: add infographic for #36109 empty-400 diagnostics	2026-06-28 02:05:20 -07:00
teknium1	64972b6403	fix(config): canonicalize model.name/model.model to model.default (#34500 ) A custom_providers config that names the model under model.name (or model.model) resolved to an empty model, so the API request went out with model= — HTTP 400 from OpenAI-compatible backends. Display paths (hermes status/dump) already read model.name and showed the model, making the failure silent. The model id was read via 'default or model' at ~14 independent sites (cli, gateway, cron, curator, oneshot, fallback, profiles, ...), none of which honored 'name'. Rather than patch every site, canonicalize at the single load/save chokepoint: _normalize_root_model_keys() now promotes model.model/model.name -> model.default (precedence default > model > name) and drops the stale alias, so every reader — present and future — sees a populated default and config.yaml is migrated canonical on next save. The gateway, which bypasses load_config(), replays the same normalization in _load_gateway_config(). Co-authored-by: Bartok9 <danielrpike9@gmail.com> Credit: root-cause analysis and fix direction from @Bartok9 (#34502, first) and @v86861062 (#34527).	2026-06-28 02:05:13 -07:00
Teknium	2ecb6f7fe6	fix(telegram): clear send_path_degraded on successful reconnect (#35205 ) (#54076 ) * fix(telegram): clear send_path_degraded on successful reconnect _send_path_degraded was cleared only in _verify_polling_after_reconnect, 60s after reconnect and only if scheduled. A clean start_polling() reconnect left the flag stuck True, short-circuiting send() and blocking all outbound messages until the deferred probe ran (or forever if it never did). Clear the flag the moment start_polling() succeeds — that is the recovery signal. The deferred probe remains a defensive re-check that re-enters the reconnect ladder (re-setting the flag) if it detects a silent wedge. Fixes #35205. * docs: add infographic for #35205 telegram send-path fix	2026-06-28 01:38:17 -07:00
Teknium	de6e9ac760	docs(discord): document bot-to-bot comms as unsupported (#32791 ) (#54063 ) * docs(discord): document bot-to-bot comms as unsupported (#32791) Multi-profile bot-to-bot conversation is not a supported topology. DISCORD_ALLOW_BOTS=none (the default) blocks all bot-originated messages; setting mentions/all across multiple Hermes profiles to make them reply to each other ack-loops because Discord's reply auto-mention satisfies the mention gate every turn. Document the safe default and the loop hazard so operators don't wire it up. * docs(discord): infographic for bot-to-bot unsupported stance (#32791)	2026-06-28 01:15:34 -07:00
teknium1	4f16950e9a	docs: add infographic for #32421 content-filter fallback fix	2026-06-28 01:15:21 -07:00
teknium1	0800f1c28b	infographic: whatsapp send-queue serialization (#33360 )	2026-06-28 01:10:14 -07:00
teknium1	4a0fe4e54a	docs: add PR infographic for #32762 clarify-expiry fix	2026-06-28 01:07:53 -07:00
teknium1	3a03d03bdc	docs: add infographic for #30636 macOS state.db fix	2026-06-28 00:53:19 -07:00
Teknium	1b70a91844	docs: third-party-product plugins ship standalone, not into core tree (#54001 ) * docs: third-party-product plugins ship standalone, not into core tree Generalizes the closed-set memory-provider policy to any plugin that integrates someone else's product/project (observability backends, vendor SaaS, analytics dashboards, paid-service tie-ins). These create an open-ended maintenance burden on us for backends we don't own, so they ship as standalone plugin repos installed into ~/.hermes/plugins/ and are promoted in #plugins-skills-and-skins — not merged into core. - AGENTS.md: new 'what we don't want' bullet + generalized policy note beside the memory-provider closed-set rule - CONTRIBUTING.md: new 'Third-Party Product Integrations' section - build-a-hermes-plugin.md: caution callout at the top of the guide It's a coupling decision, not a quality bar — a plugin can clear review and still be a close. * docs: add infographic for standalone-plugin policy	2026-06-27 22:23:50 -07:00
teknium1	9c7f9f9502	infographic: partial-stream recovery fix (salvage #41498 )	2026-06-27 22:03:14 -07:00
Teknium	e3c9924b8b	fix(cli): correct stale `hermes auth login nous` hints to `hermes auth add nous` (#53929 ) * fix(cli): correct stale `hermes auth login nous` hints to `hermes auth add nous` There is no `hermes auth login` subcommand — valid auth verbs are add/list/remove/reset/status/logout/spotify. Six user-facing strings told users to run `hermes auth login nous`, which fails with `invalid choice: 'login'` — the same broken-hint class reported in #28089 for the proxy flow (already fixed there to `hermes auth add nous`). Sites corrected to `hermes auth add nous`: - hermes_cli/dashboard_register.py (401 retry hint, not-logged-in hint) - hermes_cli/gateway_enroll.py (401 retry hint, not-logged-in hint) - cli-config.yaml.example (two provider-requirement comments) * docs(infographic): auth login nous hint fix	2026-06-27 21:30:37 -07:00
teknium1	b304023fc6	docs(infographic): model picker fixes (#49129 + #51488 )	2026-06-27 21:23:25 -07:00
teknium1	1ad8b44413	docs(infographic): skill sync external_dirs shadow fix	2026-06-27 21:07:53 -07:00
Teknium	d43e0cf304	fix(agent): config-driven intent-ack continuation for all api_modes (#27881 ) (#53943 ) * fix(agent): config-driven intent-ack continuation for all api_modes (#27881) The agent could end a turn after only stating intent ('I will run a health check...') without executing the announced tool call, forcing the user to re-prompt. A continuation guard that catches this and nudges the model to proceed already existed but was hard-gated to the codex_responses api_mode, so Gemini/Claude/OpenRouter turns never benefited. - New agent.intent_ack_continuation config (default 'auto' = codex-only, byte-stable for existing conversations). 'true'/model-list opts every api_mode in; 'false' disables. Mirrors agent.tool_use_enforcement's shape. - looks_like_codex_intermediate_ack gains require_workspace (default True). The opted-in path drops the codebase/filesystem requirement so general autonomous workflows (server ops, deploys, API calls) are caught, not just coding tasks. Future-ack + action-verb + short-content + no-prior-tool guards still apply; the 2-nudge-per-turn cap is unchanged. - Resolution centralized in intent_ack_continuation_mode (off/codex_only/all). * docs(infographic): intent-ack continuation (#27881)	2026-06-27 20:46:00 -07:00
teknium1	a590c5efdc	docs: add infographic for provider-precedence fix (#29285 )	2026-06-27 19:49:02 -07:00
Teknium	28ed883959	docs: add PR infographic for config-defaults fix	2026-06-27 19:38:11 -07:00
teknium1	4133cd9fbf	docs(infographic): eager fallback on persistent transport failures	2026-06-27 19:12:21 -07:00
Teknium	917f6bdb00	fix(tools): let vision pick any provider+model, not just OpenRouter (#53606 ) * fix(tools): let vision pick any provider+model, not just OpenRouter hermes tools → configure → vision no longer forces an OPENROUTER_API_KEY. It now offers the same any-provider surface as the model command: Auto (use main model / aggregator fallback), pick any authenticated provider + model, or a custom OpenAI-compatible endpoint. Selections persist to auxiliary.vision.{provider,model,base_url} — the keys the vision resolver already reads. Custom endpoint pins provider=custom so base_url routes correctly. Reconfigure path uses the same picker instead of re-prompting for OPENROUTER_API_KEY. * docs: add PR infographic for vision any-provider picker	2026-06-27 04:41:42 -07:00
Ben Barclay	eaa0984210	chore: drop committed PR-infographic assets from the repo (#48261 ) PR infographics are decorative visual hooks for a PR body, not repo artifacts. The established convention (commit `5772e638c`, "chore: drop in-repo infographic/ directory; keep PR-body URLs only", #30854) is to hotlink an externally-hosted image so GitHub camo-proxies it inline, leaving zero binary footprint in the tree. Two such assets had been committed anyway and are referenced nowhere in the codebase: - docs/assets/ns504-chat-session-reconnect.png (1024-equiv, NS-504 PR infographic, added in #47674 alongside the ChatPage.tsx fix) - infographic/kanban-db-corruption-defense/infographic.png (re-added a directory #30854 had explicitly removed, in #30952) Both are unreferenced decorative infographics, so removing them has no effect on docs, website, or app builds. Removing the latter also clears the stray top-level infographic/ directory that #30854 had retired. These blobs remain in history (the commits that introduced them are already on main and bundled with real code, so they can't be dropped); this just removes them from the working tree going forward.	2026-06-18 16:03:29 +10:00
Teknium	cae7537359	infographic: kanban.db corruption defense (#30858 + #30862 ) (#30952 )	2026-05-23 05:55:25 -07:00
Teknium	5772e638c9	chore: drop in-repo infographic/ directory; keep PR-body URLs only (#30854 ) PR infographics belong in PR descriptions, not committed to the repo. Removes the 13 archived directories under infographic/ and adds the path to .gitignore so future generations don't accidentally land in-tree. The fal.media URLs embedded in each PR's body remain the canonical artifact — those PR descriptions are the storage.	2026-05-23 02:25:03 -07:00
Teknium	729a778af0	infographic: PR #17659 read-deny credentials salvage Some checks failed Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Docker Build and Publish / move-latest (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details Nix Lockfile Fix / auto-fix-main (push) Has been cancelled Details Nix Lockfile Fix / fix (push) Has been cancelled Details	2026-05-22 20:15:09 -07:00
Teknium	7f7245bf62	infographic: PR #6656 skill hub safety audit salvage	2026-05-22 19:59:24 -07:00
Teknium	a84cec61ca	fix(minimax-oauth): refresh short-lived access tokens per request (#30619 ) * fix(minimax-oauth): refresh short-lived access tokens per request MiniMax OAuth issues ~15-minute access tokens. The Anthropic SDK caches api_key as a static string at client construction, so a session that resolves credentials once at startup keeps sending the same bearer until MiniMax returns 401 mid-session. Swap the static string for a callable token provider, reusing the existing Entra-ID bearer-hook infrastructure in build_anthropic_client. The callable re-reads auth.json on each invocation and calls _refresh_minimax_oauth_state, which is a no-op when the token still has more than 60s of life left and refreshes proactively otherwise. Refreshes persist to auth.json so other processes (gateway, cron) see them immediately. The wire-up lives at the agent-init / model-switch boundary rather than in resolve_runtime_provider, so aux client paths that hand the api_key string to OpenAI(api_key=...) are unaffected. * docs: add infographic for minimax-oauth token refresh	2026-05-22 15:16:15 -07:00
Teknium	2233b8b244	infographic: PR #30609 Termux cold-start salvage (#30618 )	2026-05-22 14:32:41 -07:00
Teknium	d11cbb1032	infographic: PR #30591 Discord adapter → bundled plugin salvage (#30614 )	2026-05-22 14:24:03 -07:00
Teknium	4f988634f8	infographic: PR #27612 Nous URL allowlist salvage	2026-05-22 14:17:40 -07:00
Teknium	1e71b7180e	infographic: PR #14157 control-plane write-deny salvage	2026-05-22 04:32:14 -07:00
Teknium	6f436a463e	infographic: PR #27784 anthropic adapter refactor salvage	2026-05-22 04:23:02 -07:00
Teknium	ec2ab5bfaf	infographic: PR #8056 hash pairing codes salvage	2026-05-22 04:11:49 -07:00
Teknium	7dea33303a	infographic: PR #30373 aux model picker parity salvage	2026-05-22 04:10:38 -07:00
Teknium	8b49012a0a	infographic: PR #8306 webhook HMAC bypass salvage	2026-05-22 03:45:21 -07:00

1 2

52 commits