hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-30 19:09:28 +00:00

Author	SHA1	Message	Date
Nanako0129	2e991770fc	fix(gemini): pass base_url into chat transport	2026-04-29 12:10:40 -07:00
Nanako0129	c5a5e586d7	fix(gemini): nest OpenAI-compat thinking config under google	2026-04-29 12:10:40 -07:00
github-actions[bot]	5a61c116e1	fix(nix): auto-refresh npm lockfile hashes Source: `430302c197` Run: https://github.com/NousResearch/hermes-agent/actions/runs/25123381903	2026-04-29 18:07:17 +00:00
teknium1	69d4800db7	chore: add txbxxx to AUTHOR_MAP	2026-04-29 10:35:28 -07:00
txbxxx	9ee540a5e2	fix(install): promote croniter to a core dependency Cron is a built-in Hermes feature (CLI `hermes cron`, `cronjob` agent tool, gateway ticker, scheduler in cron/scheduler.py) but croniter has been gated behind the [cron] optional extra. Users who do a plain `pip install hermes-agent` can create jobs via /cron but any recurring cron schedule silently returns next_run_at=None (HAS_CRONITER=False), which then gets wrapped into a 'state=error' message only after a tick. Move croniter into core dependencies so scheduled jobs work out of the box on any install path. The [cron] extra is kept as an empty passthrough so existing `pip install hermes-agent[cron]` installs and the [all]/[termux] extras continue to resolve. Also update the now-stale user-facing error message in `compute_next_run()` that still tells users to install `hermes-agent[cron]`. Salvaged from #17234 (authored by @txbxxx) with a corrected premise: the original PR claimed [cron] wasn't in [all], but it is (pyproject.toml line 112). The real UX problem is the plain no-extras install path, which this fix addresses.	2026-04-29 10:35:28 -07:00
Teknium	0e577fb1be	docs(curator): document that pinning also blocks skill_manage writes (#17578 ) Add a dedicated 'Pinning a skill' section that covers both gating layers — curator auto-transitions AND the agent's skill_manage tool — so users know what the flag actually protects against after PR #17562. Updates the one-line claim in 'How it runs' to cross-link the new section instead of only mentioning auto-transitions.	2026-04-29 10:35:16 -07:00
Teknium	c61b2e0af7	feat(skills): refuse skill_manage writes on pinned skills (#17562 ) Extend curator's pin flag from 'skip auto-transitions' to 'no agent edits at all'. All five skill_manage mutation actions (edit, patch, delete, write_file, remove_file) now refuse pinned skills with a message pointing the user at `hermes curator unpin <name>`. Motivation: pin used to only stop the curator's own maintenance pass from touching a skill. Nothing prevented the main agent from editing or deleting a pinned skill via skill_manage in-session. This gives users a hard fence against unwanted agent edits — same semantics as curator pinning, extended to the write tool. Create is unaffected (you can't pin a name that doesn't exist yet, and name collisions already error out). Broken sidecars fail open rather than lock the agent out. The schema description advertises the new refusal so models know not to route around it with rename/recreate tricks.	2026-04-29 10:28:25 -07:00
Teknium	b01656d116	docs: exclude per-skill pages from search, add curator feature page (#17563 ) Skill catalog pages (bundled/optional) were drowning out real user-guide and reference docs in search results. There are ~3100 of them and they match on almost every generic term. - Add `ignoreFiles` regexes to docusaurus-search-local for `user-guide/skills/bundled/` and `user-guide/skills/optional/`. The two human-written catalog indexes (`reference/skills-catalog`, `reference/optional-skills-catalog`) remain indexed. - Add a new feature page `user-guide/features/curator.md` covering the curator subsystem merged in #16049 and refined in #17307 (per-run reports): how it runs, config, CLI (`hermes curator status/run/pin/ restore/...`), `.usage.json` telemetry, archival semantics, and recovery. Slotted into the Core features sidebar next to Skills. Search index size dropped from 5822 docs to 2704 in the main section; `user-guide/features/curator` is indexed.	2026-04-29 10:28:15 -07:00
Austin Pickett	430302c197	Merge pull request #17175 from NousResearch/fix/markdown feat(latex): latex in tui	2026-04-29 10:18:17 -07:00
teknium1	40a98fb0fa	feat(minimax-oauth): full integration with peer OAuth providers Close integration gaps discovered by auditing qwen-oauth's file coverage. These are surfaces the original salvage missed — they all existed on main and were added in the 747 commits since PR #15203 was opened. Coverage added: - agent/credential_pool.py: seed pool from auth.json providers.minimax-oauth so `hermes auth list` reflects logged-in state and `hermes auth remove minimax-oauth <N>` works through the standard flow. - agent/credential_sources.py: register RemovalStep for minimax-oauth with suppression-aware `_clear_auth_store_provider`. - agent/models_dev.py: PROVIDER_TO_MODELS_DEV mapping (-> 'minimax' family). - hermes_cli/providers.py: HermesOverlay entry (anthropic_messages transport, oauth_external auth_type, api.minimax.io/anthropic base). - hermes_cli/model_normalize.py: add to _MATCHING_PREFIX_STRIP_PROVIDERS so `minimax-oauth/MiniMax-M2.7` in config.yaml gets correctly repaired. - hermes_cli/status.py: render MiniMax OAuth block in `hermes doctor` (logged-in / region / expires_at / error). - hermes_cli/web_server.py: register in OAUTH_PROVIDER_REGISTRY + dispatch branch in _resolve_provider_status so the dashboard auth page shows it. - website/docs/integrations/providers.md: full 'MiniMax (OAuth)' section. - website/docs/reference/cli-commands.md: --provider enum. - website/docs/user-guide/features/fallback-providers.md: fallback table row. - scripts/release.py AUTHOR_MAP: amanning3390 mapping (CI gate).	2026-04-29 09:53:42 -07:00
Adam Manning	eafa637287	docs: document MiniMax OAuth login flow Add comprehensive documentation for the minimax-oauth provider. New file: website/docs/guides/minimax-oauth.md - Overview table (provider ID, auth type, models, endpoints) - Quick start via 'hermes model' - Manual login via 'hermes auth add minimax-oauth' - --region global\|cn flag reference - The PKCE OAuth flow explained step-by-step - hermes doctor output example - Configuration reference (config.yaml shape, region table, aliases) - Environment variables note: MINIMAX_API_KEY is NOT used by minimax-oauth (OAuth path uses browser login) - Models table with context length note - Troubleshooting section: expired token, timeout, state mismatch, headless/remote sessions, not logged in - Logout command Updated: website/docs/getting-started/quickstart.md - Add MiniMax (OAuth) to provider picker table as the recommended path for users who want MiniMax models without an API key Updated: website/docs/user-guide/configuration.md - Add 'minimax-oauth' to the auxiliary providers list - Add MiniMax OAuth tip callout in the providers section - Add minimax-oauth row to the provider table (auxiliary tasks) - Add MiniMax OAuth config.yaml example in Common Setups Updated: website/docs/reference/environment-variables.md - Annotate MINIMAX_API_KEY, MINIMAX_BASE_URL, MINIMAX_CN_API_KEY, MINIMAX_CN_BASE_URL as NOT used by minimax-oauth - Add minimax-oauth to HERMES_INFERENCE_PROVIDER allowed values	2026-04-29 09:53:42 -07:00
Adam Manning	f3aa989b1b	test(cli): cover minimax-oauth resolution, refresh, menu wiring Add and extend tests for the minimax-oauth provider across three test modules. New file: tests/test_minimax_oauth.py (15 tests) - test_pkce_pair_produces_valid_s256: verifies PKCE verifier/challenge pair produces a valid S256 hash and correct lengths - test_request_user_code_happy_path: mocks httpx, verifies correct POST parameters and response parsing - test_request_user_code_state_mismatch_raises: verifies CSRF guard - test_request_user_code_non_200_raises: verifies HTTP error handling - test_poll_token_pending_then_success: verifies polling loop retries on 'pending' and returns on 'success' - test_poll_token_error_raises: verifies 'error' status raises AuthError - test_poll_token_timeout_raises: verifies deadline expiry raises - test_refresh_skip_when_not_expired: verifies no HTTP call when token is fresh - test_refresh_updates_access_token: verifies new access/refresh tokens stored on successful refresh - test_refresh_reuse_triggers_relogin_required: verifies relogin_required=True on invalid_grant/refresh_token_reused - test_resolve_credentials_requires_login: verifies AuthError when no stored state - test_provider_registry_contains_minimax_oauth: PROVIDER_REGISTRY key - test_minimax_oauth_alias_resolves: portal/global/underscore aliases - test_get_minimax_oauth_auth_status_not_logged_in - test_get_minimax_oauth_auth_status_logged_in Extended: tests/hermes_cli/test_runtime_provider_resolution.py - test_minimax_oauth_runtime_returns_anthropic_messages_mode - test_minimax_oauth_runtime_uses_inference_base_url Extended: tests/hermes_cli/test_api_key_providers.py - TestMinimaxOAuthProvider class (8 tests) covering registry keys, auth_type, endpoints, client_id, aliases, CANONICAL_PROVIDERS listing, _PROVIDER_MODELS entries, and aux model	2026-04-29 09:53:42 -07:00
Adam Manning	0b2f1bb27b	feat(agent): wire MiniMax-M2.7 for minimax-oauth provider Wire MiniMax-M2.7 and MiniMax-M2.7-highspeed into the model catalog, CLI model picker, and agent auxiliary/metadata subsystems. Changes: - hermes_cli/models.py: - Add 'minimax-oauth' to _PROVIDER_MODELS with MiniMax-M2.7 and MiniMax-M2.7-highspeed - Add ProviderEntry('minimax-oauth', 'MiniMax (OAuth)', ...) to CANONICAL_PROVIDERS near existing minimax entries - Add aliases: minimax-portal, minimax-global, minimax_oauth in _PROVIDER_ALIASES - hermes_cli/main.py: - Add 'minimax-oauth' to provider_labels dict - Insert 'minimax-oauth' into providers list in select_provider_and_model() near the other minimax entries - Add 'minimax-oauth' to --provider argparse choices - Add _model_flow_minimax_oauth() function: ensures login via _login_minimax_oauth(), resolves runtime credentials, prompts for model selection, saves model choice and config - Add dispatch elif branch for selected_provider == 'minimax-oauth' - agent/auxiliary_client.py: - Add 'minimax-oauth': 'MiniMax-M2.7-highspeed' to _API_KEY_PROVIDER_AUX_MODELS - Add 'minimax-oauth' to _ANTHROPIC_COMPAT_PROVIDERS set - agent/model_metadata.py: - Add 'minimax-oauth' to _PROVIDER_PREFIXES frozenset - MiniMax-M2.7 context length (200_000) already covered by the existing 'minimax' substring match in DEFAULT_CONTEXT_LENGTHS	2026-04-29 09:53:42 -07:00
Adam Manning	9eb16025bd	feat(cli): add minimax-oauth provider with PKCE browser flow Add MiniMax OAuth (minimax-oauth) as a first-class provider using a PKCE device-code flow ported from openclaw/extensions/minimax/oauth.ts. Changes: - hermes_cli/auth.py: - Add 8 MINIMAX_OAUTH_* constants (client ID, scope, grant type, global/CN base URLs, inference URLs, refresh skew) - Add 'minimax-oauth' ProviderConfig to PROVIDER_REGISTRY (auth_type oauth_minimax) with global portal + inference base URLs and CN extras in the extra dict - Add provider aliases: minimax-portal, minimax-global, minimax_oauth - Implement _minimax_pkce_pair(), _minimax_request_user_code(), _minimax_poll_token(), _minimax_save_auth_state(), _minimax_oauth_login(), _refresh_minimax_oauth_state(), resolve_minimax_oauth_runtime_credentials(), get_minimax_oauth_auth_status(), _login_minimax_oauth() - Token refresh uses standard OAuth2 refresh_token grant; triggers relogin_required on invalid_grant / refresh_token_reused - hermes_cli/runtime_provider.py: - Add minimax-oauth branch (after qwen-oauth) that calls resolve_minimax_oauth_runtime_credentials() and returns api_mode='anthropic_messages' with the OAuth Bearer token - hermes_cli/auth_commands.py: - Add 'minimax-oauth' to _OAUTH_CAPABLE_PROVIDERS - Add auth_type auto-detection for oauth_minimax - Add provider == 'minimax-oauth' branch in auth_add_command - hermes_cli/doctor.py: - Import get_minimax_oauth_auth_status - Add MiniMax OAuth status check in the Auth Providers section	2026-04-29 09:53:42 -07:00
teknium1	b2820cd207	chore: add beenherebefore to AUTHOR_MAP	2026-04-29 08:24:48 -07:00
beenherebefore	e0c0167428	fix(cron): use last_run_at as croniter base for cron jobs compute_next_run() ignored the last_run_at parameter for cron-type schedules, always computing from _hermes_now() instead. This was inconsistent with interval jobs which DO use last_run_at as the anchor. After a crash or restart, cron jobs would compute next_run_at from the arbitrary restart time rather than the actual last execution time. While the stale detection in get_due_jobs() catches most cases, using last_run_at as the croniter base eliminates edge cases and makes the behavior consistent across schedule types. Salvaged from #9014 (authored by @beenherebefore) onto current main. The original PR branch was 2+ weeks stale and would have reverted substantial unrelated work (jobs_file_lock, workdir/context_from/ enabled_toolsets, issue #16265 state=error recovery). Kept just the 7-line substantive fix and the regression test.	2026-04-29 08:24:48 -07:00
teknium1	6d8423761b	chore: add yeyitech to AUTHOR_MAP	2026-04-29 08:21:04 -07:00
yeyitech	ec27f0a3fa	fix(cron): fall back gracefully when HERMES_CRON_TIMEOUT is invalid Bare `float(os.getenv("HERMES_CRON_TIMEOUT", 600))` in `run_job()` raises a `ValueError` when the env var is set to a non-numeric string (e.g. "abc"). Replace it with the same defensive try/except pattern already used by `_get_script_timeout()` for `HERMES_CRON_SCRIPT_TIMEOUT`: log a warning and fall back to the 600 s default instead of crashing. Also update the existing env-var tests to exercise the new code path and add two new tests — one for an invalid value, one for an empty string. Fixes #11319 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 08:21:04 -07:00
Teknium	8c8fc6c1ec	fix(skills): let skill_manage patch/edit/delete skills in external_dirs in place (#17512 ) Closes #4759, closes #4381. Mutating actions (patch, edit, write_file, remove_file, delete) used to refuse skills that lived under `skills.external_dirs` with 'Skill X is in an external directory and cannot be modified. Copy it to your local skills directory first.' Faced with that error, the agent would fall back to action='create', which always writes under ~/.hermes/skills/ — producing a silent duplicate of the external skill in the local store. Fix: drop the read-only gate. `skills.external_dirs` is configured by the user; if they pointed it at a directory, they already said 'these are my skills, treat them the same.' Filesystem permissions handle the genuine read-only case (write fails, agent sees the error). - New _containing_skills_root() resolves whichever dir actually contains the skill; _delete_skill uses it to bound empty-category cleanup so an external root is never rmdir'd. - _create_skill behavior is unchanged: new skills still land in local SKILLS_DIR only. Fewer moving parts. - Seven new TestExternalSkillMutations tests covering patch/edit/write_file/ remove_file/delete/create against a mocked two-root layout + a category rmdir-safety check.	2026-04-29 08:16:52 -07:00
Teknium	e120cd5941	fix(model_switch): dedup /model picker rows when custom provider endpoint matches a built-in (#16970 ) (#17511 ) When a user authenticates a built-in provider via env var (e.g. DASHSCOPE_API_KEY triggers the built-in 'alibaba' row) AND defines a custom_providers entry pointing at the same endpoint, the picker previously emitted two rows for one endpoint. The built-in row already carries the canonical slug, curated model list, and correct auth wiring, so the shadow custom entry is redundant. Adds a _builtin_endpoints set populated as sections 1/2/2b emit rows. Each entry is the provider's effective base URL (env override via base_url_env_var wins over the static inference_base_url, so DASHSCOPE_BASE_URL-overridden endpoints dedup correctly). Section 4 skips any grouped custom entry whose base_url matches. Intentionally does NOT repurpose model_catalog.enabled as a 'hide built-ins' flag. That config controls the remote curated-manifest fetch (documented on the model-catalog reference page) and overloading it would silently change behavior for users who disable it for network/privacy reasons. Three new tests: - shadow dedup fires when endpoint matches static inference_base_url - dedup does NOT hide custom entries on genuinely distinct endpoints - dedup honors the base_url_env_var override path	2026-04-29 08:11:05 -07:00
teknium1	fa3338c171	test(anthropic): regression guard for DeepSeek /anthropic thinking replay Covers the #16748 fix: - unsigned thinking blocks synthesised from reasoning_content survive replay - non-latest assistant turns keep their thinking (DeepSeek validates every turn) - signed Anthropic blocks are stripped (DeepSeek can't validate them) - cache_control is stripped from thinking blocks - OpenAI-compat base (api.deepseek.com without /anthropic) is NOT matched - non-DeepSeek third parties (minimax) keep the generic strip-all behaviour	2026-04-29 08:10:29 -07:00
vominh1919	fd5479a4fc	fix: preserve DeepSeek thinking blocks on Anthropic replay (#16748 ) DeepSeek's /anthropic endpoint requires thinking blocks to be replayed in multi-turn conversations for reasoning continuity. The existing code classified api.deepseek.com as a generic third-party endpoint and stripped ALL thinking blocks, causing HTTP 400 from DeepSeek. Fix: add _is_deepseek_anthropic_endpoint() detector (following the Kimi precedent) and a dedicated branch that strips only signed Anthropic blocks while preserving unsigned ones synthesised from reasoning_content. This follows the exact same pattern as the Kimi exemption (issue #13848) and does not change behavior for any other third-party endpoint (Azure, Bedrock, MiniMax, etc.). Fixes NousResearch/hermes-agent#16748	2026-04-29 08:10:29 -07:00
teknium1	fd7188a7c6	chore(release): map liuhao03@bilibili.com to @liuhao1024	2026-04-29 08:10:25 -07:00
刘昊	60c6b07128	fix(cron): keep SOUL.md identity when workdir is unset	2026-04-29 08:10:25 -07:00
teknium1	0a5ee01e48	fix(hindsight): route flush-on-switch through writer queue, not raw thread Follow-up to the cherry-picked PR #17447. The original flush spawned a bare threading.Thread for the buffer-flush path, overwriting self._sync_thread — which is aliased to the long-lived writer thread. Two consequences: 1. No serialization with the writer queue. If old-session retains were still queued in _retain_queue, the flush ran concurrently with the writer and both threads could call aretain_batch against the same document_id. 2. The pre-spawn 'self._sync_thread.join(timeout=5.0)' tried to join the long-lived writer, which never exits, so the join was a no-op that just timed out — never actually serialized anything. Fix: enqueue the flush closure on _retain_queue via _ensure_writer + put(). Natural FIFO ordering behind any pending retains, no new thread, no broken join. Shutdown-aware so it doesn't enqueue after teardown. Tests updated to drain via _retain_queue.join() instead of the stale _sync_thread.join(). Added regression guard test_flush_serializes_behind_pending_retains_via_writer_queue that blocks the writer mid-retain to prove the flush waits in FIFO behind the old retain. Also seeds _retain_queue / _shutting_down / stubbed _ensure_writer on the bare-object test helper in test_memory_session_switch.py so that path doesn't blow up under the new queue-enqueue. tests/plugins/memory/test_hindsight_provider.py + tests/agent/test_memory_session_switch.py: 103/103 passing.	2026-04-29 08:09:03 -07:00
Nicolò Boschi	c38dac742b	fix(hindsight): flush buffered turns and drop stale prefetch on session switch Two data-loss / leak gaps in HindsightMemoryProvider.on_session_switch introduced by #17409. 1. Buffered turns silently lost when retain_every_n_turns > 1. on_session_switch unconditionally cleared _session_turns without flushing. Users who batched every N>1 turns and switched mid-batch (/reset, /new, /resume, /branch, or context compression) had those buffered turns disappear. Same data-loss class as the shutdown race, different lifecycle event. Note commit_memory_session() -> on_session_end() runs before on_session_switch on /reset, but Hindsight doesn't implement on_session_end so the buffer survives that step and dies at clear time. /resume, /branch, and compression skip commit_memory_session entirely so an on_session_end impl wouldn't help them anyway. Fix: snapshot the old _session_id, _document_id, _parent_session_id, _turn_index, and _session_turns; spawn one final retain that lands under the OLD document_id; then rotate state. Metadata is built synchronously against the old self._* so session_id / lineage tags on the flushed item all reference the prior session consistently. 2. Stale _prefetch_result leaks across switch. If queue_prefetch ran in the old session and the result hadn't been consumed by prefetch() yet, on_session_switch left the cached recall text in place. The next session's first prefetch() call would return text mined from the prior session's bank/query. Fix: join any in-flight _prefetch_thread (3s bounded — matches shutdown()), then clear _prefetch_result under _prefetch_lock before rotating session_id. Tests ----- - tests/plugins/memory/test_hindsight_provider.py (TestSessionSwitchBufferFlush): - buffered turns flushed under OLD document_id with OLD lineage tags - empty buffer => no spurious retain - _prefetch_result cleared on switch - in-flight prefetch thread is awaited before clear (no race) - tests/agent/test_memory_session_switch.py: factory extended to seed the attrs the new flush path reads (_retain_source, _platform, _bank_id, prefetch state, etc.) and stub _run_hindsight_operation so existing switch-state assertions keep passing without network setup.	2026-04-29 08:09:03 -07:00
Teknium	1bedc836b5	docs(onboarding): lead OpenClaw residue banner with migrate, warn that cleanup breaks OpenClaw (#17507 ) The ~/.openclaw/ detection banner (#16327) had two problems flagged in #16629: 1. It only pitched 'hermes claw cleanup' (destructive archive) and never mentioned 'hermes claw migrate' — the actual non-destructive path that ports config/memory/skills into Hermes. 2. The copy anthropomorphized the bug ('the agent can still get confused', 'dutifully reads') and framed OpenClaw as a competitor to eliminate ('instead of Hermes's'). Rewrite so migrate leads, cleanup is a clearly-labelled follow-up with a warning that archiving breaks OpenClaw for users still running it. Closes #16629	2026-04-29 08:08:36 -07:00
briandevans	e0a03f3f40	fix(api-server): collapse tool start/lifecycle into a single SSE event Address Copilot review on PR #16666: 1. Duplicate event on every tool start — both ``tool_progress_callback`` and ``tool_start_callback`` fire side-by-side in ``run_agent.py``, so wiring both into chat completions emitted two ``hermes.tool.progress`` events per real tool call. Drop the legacy ``_on_tool_progress`` emit entirely; ``_on_tool_start`` now produces a single unified event that carries the legacy ``tool``/``emoji``/``label`` fields plus the new ``toolCallId``/``status`` correlation fields. Label is computed inline via ``build_tool_preview`` so callers do not need to pre-format it. 2. Weak per-event correlation in the regression test — the previous assertion checked that a ``toolCallId`` appeared somewhere in the aggregate, which would have passed even if ``running`` lacked the id. Collect ``(status, toolCallId)`` per event and assert each event carries the correct pair, plus exactly two events on the wire (no silent duplication regression). The two existing chat-completions tool-progress tests are updated to fire ``tool_start_callback`` instead of ``tool_progress_callback``, matching production reality where ``run_agent`` always pairs them. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 08:08:16 -07:00
kshitijk4poor	13c238327e	fix: address self-review findings for Vercel Sandbox salvage - Add vercel_sandbox to hardline blocklist container bypass test - Add vercel_sandbox to skills_tool remote backend parametrize test - Deduplicate runtime set: doctor.py and setup.py now import _SUPPORTED_VERCEL_RUNTIMES from terminal_tool.py - Add docstring to _run_bash explaining timeout/stdin_data discards - Always stop sandbox during cleanup (unconditional, matching Modal/Daytona) - Update security.md: container bypass text, production tip, comparison table - Update environment-variables.md: TERMINAL_ENV list, Vercel auth vars, TERMINAL_VERCEL_RUNTIME - Update inline comments in cli.py and config.py to include vercel_sandbox	2026-04-29 07:22:33 -07:00
Scott Trinh	5a1d4f6804	feat: add Vercel Sandbox backend Adds Vercel Sandbox as a supported Hermes terminal backend alongside existing providers (Local, Docker, Modal, SSH, Daytona, Singularity). Uses the Vercel Python SDK to create/manage cloud microVMs, supports snapshot-based filesystem persistence keyed by task_id, and integrates with the existing BaseEnvironment shell contract and FileSyncManager for credential/skill syncing. Based on #17127 by @scotttrinh, cherry-picked onto current main.	2026-04-29 07:22:33 -07:00
Magaav	810d98e892	feat(api_server): expose run status for external UIs (#17085 ) Adds two API server endpoints for external UIs and orchestrators: - GET /v1/capabilities — machine-readable feature discovery so clients can detect which Runs API / SSE / auth features this Hermes version supports before depending on them. - GET /v1/runs/{run_id} — pollable run status so dashboards can check queued/running/completed/failed/cancelled/stopping state without holding an SSE connection open. Also moves request validation ahead of run allocation so invalid payloads no longer leave orphaned entries in _run_streams waiting for the TTL sweep. task_id is intentionally kept as "default" for the Runs API to preserve the shared-sandbox model used by CLI, gateway, and the existing _run_agent_with_callbacks path. session_id is surfaced in run status for external-UI correlation only. Salvage of PR #17085 by @Magaav.	2026-04-29 06:38:10 -07:00
Teknium	83c288da01	fix(anthropic): broaden Kimi thinking-suppression to custom endpoints (#17455 ) The guard that drops Anthropic's `thinking` kwarg for Kimi endpoints was matched on `https://api.kimi.com/coding` only. Users configuring a custom Kimi-compatible gateway (or an official Moonshot host) with `api_mode: anthropic_messages` fall through to the generic third-party path, which strips thinking blocks AND still sends `thinking={enabled,...}` → upstream rejects with HTTP 400 "reasoning_content is missing in assistant tool call message at index N" on the next request after a tool call. Replace `_is_kimi_coding_endpoint` callers (history replay + thinking kwarg gate) with `_is_kimi_family_endpoint(base_url, model)` that also matches the `api.kimi.com` / `moonshot.ai` / `moonshot.cn` hosts and Kimi/Moonshot family model names (`kimi-`, `moonshot-`, `k1.`, `k2.`, …) for custom / proxied endpoints. Keeps the UA-header check in `build_anthropic_client` URL-only — the `claude-code/0.1.0` header is an official-Kimi contract. Plumbs optional `model` through `convert_messages_to_anthropic` so the unsigned reasoning_content→thinking block synthesised for Kimi's history validation survives the third-party signature-stripping pass on custom hosts too. Closes #17057.	2026-04-29 06:35:42 -07:00
Teknium	398945e7b1	fix(cron): accept list-form deliver values so deliver=['telegram'] works (#17456 ) The cron schema contracts deliver as a string ("local", "origin", "telegram", "telegram:chat_id[:thread_id]", or comma-separated combos), but MCP clients and scripts sometimes pass an array like ['telegram']. Before this change, the list was written to jobs.json verbatim, and the scheduler's str(deliver).split(',') then tried to resolve the literal string "['telegram']" as a platform — returning None and logging 'no delivery target resolved for deliver=[\'telegram\']'. Fix on both ends: - tools/cronjob_tools.py: normalize deliver at the API boundary on create and update, so storage is always a string. - cron/scheduler.py: normalize deliver in _resolve_delivery_targets, so existing jobs.json entries with list-form deliver are handled gracefully without requiring users to edit the file. Closes #17139	2026-04-29 06:35:34 -07:00
vominh1919	7141cda967	fix: narrow Anthropic adapter dot-mangling to Claude models only The normalize_model_name() function unconditionally converted dots to hyphens in all model names. This caused non-Anthropic models (e.g. gpt-5.4) to be mangled to gpt-5-4 when routed through the Anthropic adapter path, resulting in HTTP 404 from the backend. Now only applies dot-to-hyphen conversion for models starting with "claude-" or "anthropic/", which are the actual Anthropic model IDs. Fixes NousResearch/hermes-agent#17171 Related: #7421, #13061, #16417	2026-04-29 06:34:57 -07:00
Nicolò Boschi	0565497dcc	fix(hindsight): drain retain queue cleanly on shutdown The plugin used to spawn one daemon thread per sync_turn() to do the aretain_batch network write. On CLI exit, that pattern raced interpreter shutdown — the last retain could reach aiohttp after asyncio's "cannot schedule new futures" guard had fired, producing noisy logs and silently losing the final unsaved turn: WARNING ... Hindsight sync failed: cannot schedule new futures after interpreter shutdown ERROR asyncio: Unclosed client session client_session: <aiohttp.client.ClientSession object at 0x...> Switch to a single-writer model: each provider owns one long-lived writer thread plus a queue. sync_turn() snapshots state and enqueues a job; the writer drains sequentially. Once shutdown() is called: - new sync_turn() / queue_prefetch() calls are dropped, not enqueued - a sentinel wakes the writer so it finishes in-flight work - shutdown joins the writer (10s) before nulling the client Also register an idempotent atexit hook from the first sync_turn(), so exit paths that don't go through MemoryManager.shutdown_all() (Ctrl-C, abrupt exit) still get a chance to drain. Tests: keep _sync_thread as a legacy alias to the writer, swap join() calls to _retain_queue.join() (canonical wait-for-drain), add a new TestShutdownRace suite covering single-writer reuse, post-shutdown drop, queue draining, and shutdown idempotency.	2026-04-29 06:34:24 -07:00
teknium1	5662ac2afc	chore(release): map Kailigithub email to GitHub login	2026-04-29 06:34:13 -07:00
Kailigithub	cf83982da0	fix(gateway): handle wmic encoding errors on Windows non-English locales Pass encoding='utf-8', errors='ignore' and guard against result.stdout being None so _scan_gateway_pids() no longer crashes with UnicodeDecodeError + AttributeError on Windows systems whose default code page is not UTF-8 (e.g. cp936 on zh-CN). The parser only matches the ASCII prefixes CommandLine= and ProcessId=, so dropping undecodable bytes is safe. Closes #17049.	2026-04-29 06:34:13 -07:00
briandevans	835f9adec0	fix(update,test): clarify wmic comment; switch tests to monkeypatch sys.platform Two fix-ups for #17123: 1. Reword the inline comment in `_warn_stale_dashboard_processes` to accurately describe the failure mode (locale-dependent decoder, not a "default UTF-8 decoder") and identify `errors="ignore"` as the load-bearing protection. Per Copilot's review. 2. Switch `TestWindowsWmicEncoding` from `patch("hermes_cli.main.sys")` to `monkeypatch.setattr(sys, "platform", "win32")` — the codebase's canonical pattern (e.g. `tests/hermes_cli/test_auth_ssl_macos.py`). The MagicMock-replacement approach passed locally on Python 3.12 but the platform-equality check failed under CI's xdist+Python 3.11, leaving both new tests red despite the fix being present. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 06:34:13 -07:00
briandevans	b85fff9495	fix(update): protect dashboard wmic scan against UnicodeDecodeError on Windows non-UTF-8 locales (#17049 ) `hermes update` calls `_warn_stale_dashboard_processes()` to warn about dashboard processes still running the pre-update Python backend. On Windows, that scan shells out to `wmic process get ProcessId,CommandLine /FORMAT:LIST` with `text=True` and no explicit encoding. `wmic` emits text in the system code page (e.g. cp936 on zh-CN locales), not UTF-8. Without an explicit `encoding=`, Python's default UTF-8 decoder crashes the subprocess reader thread with `UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd0 ...`. In Python 3.11 that crash is silently absorbed: `subprocess.run()` returns a `CompletedProcess` with `result.stdout = None`, the next line calls `result.stdout.split("\n")`, and `hermes update` aborts with the exact `AttributeError: 'NoneType' object has no attribute 'split'` trace reported in #17049. Fix: pass `encoding="utf-8", errors="ignore"` so undecodable bytes cannot take down the reader thread (the parsing only matches the ASCII prefixes `CommandLine=` and `ProcessId=`, so dropping non-UTF-8 bytes is safe), and short-circuit when `result.stdout is None` as a defensive guard for environments where the reader thread still fails for other reasons. This is the same root cause as #17074 (which patches `hermes_cli/gateway._scan_gateway_pids` for the `hermes setup` path). That PR does not touch `_warn_stale_dashboard_processes`, so `hermes update` remains broken on the same locales until this lands. Regression test in `tests/hermes_cli/test_update_stale_dashboard.py`: - `test_wmic_invoked_with_utf8_ignore_errors` asserts the explicit encoding/errors kwargs reach `subprocess.run`. - `test_wmic_returns_none_stdout_does_not_crash` simulates the reader-thread-crashed `result.stdout=None` aftermath and asserts the function returns silently instead of raising AttributeError. Both new tests fail against clean origin/main (`7d4648461`) reproducing the original AttributeError; both pass with this patch. The remaining 3 failures in `tests/hermes_cli/test_cmd_update.py` and `test_update_autostash.py` are pre-existing baselines on origin/main — they reproduce identically without this change and are unrelated to the wmic scan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 06:34:13 -07:00
Teknium	f317325279	docs(weixin): clarify iLink bot identity limits and warn on group policy (#17433 ) QR-login connects an iLink bot identity (...@im.bot), not a scriptable personal WeChat account. iLink typically does not deliver ordinary WeChat group events to these bots, so WEIXIN_GROUP_POLICY / WEIXIN_GROUP_ALLOWED_USERS often have no effect regardless of value. - Setup wizard: print iLink-bot caveat before the group-policy prompt; relabel the allowlist input as 'group chat IDs (not member user IDs)'; note that 'open' / 'allowlist' only take effect if iLink delivers group events. - Adapter: log a WARNING at connect() when WEIXIN_GROUP_POLICY is non-disabled so the limitation is surfaced in gateway logs, not just docs. - Docs: add a top-of-page warning callout to weixin.md explaining the iLink bot identity, narrow the 'DM and group messaging' feature line to DM-only with a group caveat, tighten the Group Policy section and troubleshooting row, and clarify WEIXIN_GROUP_ALLOWED_USERS as group IDs (not user IDs) in weixin.md and environment-variables.md. Closes #17094	2026-04-29 06:26:10 -07:00
teknium1	9e63062b6c	fix(stt): resolve API keys from ~/.hermes/.env via get_env_value (#17140 ) Widen #17163 to the sibling file tools/transcription_tools.py, which had the same class of bug. STT provider call sites and the _get_provider selection gate called os.getenv(...) directly and missed keys that only lived in ~/.hermes/.env. Same pattern as tts_tool.py: one guarded top-level import of get_env_value (falls back to os.getenv on ImportError), then every API-key and paired-base-URL lookup swapped over. Call sites migrated: - _transcribe_groq — GROQ_API_KEY - _transcribe_mistral — MISTRAL_API_KEY - _transcribe_xai — XAI_API_KEY, XAI_STT_BASE_URL - _get_provider — GROQ/MISTRAL/XAI_API_KEY in explicit + auto branches Module-level defaults (DEFAULT_STT_MODEL, GROQ_BASE_URL, etc.) stay on os.getenv — they're import-time constants, not runtime config, and the dotenv fallback would add no value there. New regression tests in tests/tools/test_transcription_dotenv_fallback.py (8 cases) mirror briandevans' TTS tests: per-provider dotenv-key forwarding, selection-gate dotenv visibility, and an end-to-end probe that patches hermes_cli.config.load_env to simulate ~/.hermes/.env carrying the key while os.environ does not.	2026-04-29 06:25:20 -07:00
briandevans	33967b4e52	fix(tts): tolerate missing hermes_cli.config in tts_tool import Wrap the new top-level `from hermes_cli.config import get_env_value` in try/except ImportError and fall back to a thin os.getenv shim, so importing tools.tts_tool keeps working in environments where hermes_cli.config is unavailable. This matches the existing tolerance in `_load_tts_config()` (tools/tts_tool.py) and the same import-fallback pattern in tools/tool_backend_helpers.py::fal_key_is_configured. Also update the TestDotenvFallbackPerProvider docstring to accurately describe the mocking strategy: per-provider tests patch `tools.tts_tool.get_env_value` directly, while the regression-guard tests cover the lower-level `hermes_cli.config.load_env` integration. Addresses Copilot review on #17163. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 06:25:20 -07:00
briandevans	40d25e125b	fix(tts): resolve API keys from ~/.hermes/.env via get_env_value (#17140 ) TTS provider tools (elevenlabs, xai, minimax, mistral, gemini) called os.getenv("X_API_KEY") directly, which bypassed Hermes's dotenv bridge in hermes_cli.config. Users who keep their TTS keys only in ~/.hermes/.env saw "X_API_KEY not set" errors even though the rest of the stack (agent/credential_pool, hermes_cli/auth) already resolves keys through get_env_value() — same class of bug as #15914 fixed for those modules. Switch every TTS env-var lookup (API keys, base URLs, and check_tts_requirements gates) to get_env_value, which checks os.environ first and then ~/.hermes/.env. Behaviour for users with keys exported in the shell is unchanged; users with dotenv-only keys now succeed. The two diagnostics prints in __main__ are migrated for consistency. Regression test (tests/tools/test_tts_dotenv_fallback.py): - per-provider: each backend reads the dotenv key when only ~/.hermes/.env carries it (5 providers). - end-to-end: with hermes_cli.config.load_env returning the key and os.environ empty, _generate_minimax_tts and check_tts_requirements both succeed; reverting tools/tts_tool.py back to os.getenv makes all 7 tests fail with "MINIMAX_API_KEY not set" / similar. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 06:25:20 -07:00
Teknium	ff687c019e	fix(aux): skip kimi-coding in vision auto-detect (closes #17076 ) (#17451 ) * docs(anthropic): correct OAuth scope to Max plan + extra usage credits only The previous docs pass (#17399) overstated what Anthropic OAuth works with. In practice Hermes can only route against a Claude Max plan that has purchased extra usage credits — the base Max allowance is not consumed, and Claude Pro is not supported at all. Without Max + extra credits, users must fall back to an ANTHROPIC_API_KEY (pay-per-token). Updates the four pages touched in #17399: - integrations/providers.md - user-guide/features/credential-pools.md - reference/environment-variables.md - getting-started/quickstart.md * fix(aux): skip kimi-coding in vision auto-detect (closes #17076) Kimi Coding Plan's /coding endpoint (Anthropic Messages wire) has no image_in capability — Kimi's own docs confirm and suggest switching to a vision-capable model. Vision lives on the separate Kimi Platform (api.moonshot.ai, OpenAI-wire, pay-as-you-go). When the user has kimi-coding as main provider and auxiliary.vision.provider=auto, resolve_vision_provider_client was handing back an AnthropicAuxiliaryClient wrapped around /coding which 404'd on every vision request. Add a _PROVIDERS_WITHOUT_VISION frozenset ({kimi-coding, kimi-coding-cn}) and gate the main-provider vision branch on membership. On a skip the auto-detect falls through to OpenRouter → Nous like any other main-provider-unavailable case. Explicit per-task overrides (auxiliary.vision.provider=kimi-coding) are unaffected — the skip only applies when the caller is in auto mode. Tests: 4 new targeted tests in TestVisionAutoSkipsKimiCoding covering the skip path, CN variant, explicit-override passthrough, and a guard against accidental skip-list widening.	2026-04-29 06:10:23 -07:00
Teknium	aea72c0936	skills: adapt spike/sketch + 2 references from gsd-build/get-shit-done (MIT) (#17421 ) * skills: port spike, sketch, and gates/context-budget references from GSD Adds two new lightweight standalone skills and two reference docs adapted from gsd-build/get-shit-done (MIT © 2025 Lex Christopherson). All ports coexist cleanly with a full `npx get-shit-done-cc --hermes --global` install — GSD lives under `skills/gsd-/`, these ports live at their natural Hermes category paths, zero name collisions. New skills: - skills/software-development/spike/ — Lightweight "spike an idea with throwaway experiments" workflow: decompose into Given/When/Then questions, research per-spike, build comparable variants, close with VALIDATED/PARTIAL/INVALIDATED verdict. Standalone alternative to the full `gsd-spike` (which requires `.planning/spikes/` state machinery and the rest of GSD). - skills/creative/sketch/ — Lightweight "sketch 2-3 HTML design variants" workflow: intake (feel, references, core action), produce differentiated variants along a design axis, head-to-head comparison. Standalone alternative to the full `gsd-sketch`. New references under subagent-driven-development/: - references/context-budget-discipline.md — Four-tier context degradation model (PEAK/GOOD/DEGRADING/POOR at 0-30%/30-50%/50-70%/70%+) with read-depth rules that scale with context window size, plus early warning signs of silent degradation (silent partial completion, increasing vagueness, skipped protocol steps). - references/gates-taxonomy.md — Four canonical gate types for validation checkpoints: Pre-flight (precondition block), Revision (bounded retry loop with stall detection), Escalation (pause for human decision), Abort (terminate to prevent damage). Each ships with behavior, recovery, and examples. Collision guard: each port has explicit "If the user has the full GSD system installed" guidance directing the agent to prefer `gsd-spike` / `gsd-sketch` when the full workflow is available. Verified end-to-end with 86 GSD skills + these 2 Hermes ports installed in the same HERMES_HOME — 90 total skills, zero duplicate names, both counterparts appear in the system prompt with distinct descriptions. Attribution preserved in each SKILL.md footer per MIT notice requirement. Full GSD system now installable via `npx get-shit-done-cc --hermes --global` (gsd-build/get-shit-done#2845). skills/gsd-port: tighten descriptions, surface Hermes-native tools Review feedback adjustments to the spike/sketch ports from the previous commit on this branch: - description lengths trimmed to <=60 chars with trigger-first phrasing (spike: 55 chars 'Throwaway experiments to validate an idea before build.'; sketch: 55 chars 'Throwaway HTML mockups: 2-3 design variants to compare.') - author field credits gsd-build/get-shit-done explicitly - stale duplicate top-level `tags:` removed from sketch frontmatter (Hermes reads only metadata.hermes.tags — the top-level field was dead weight) - spike research step now shows concrete Hermes tool calls (web_search, web_extract with real URLs, terminal for venv inspection) instead of just naming the tool names - spike build step adds a worked tool-sequence example (terminal + write_file + terminal to run) and a delegate_task fan-out pattern for parallel comparison spikes (002a / 002b) - sketch build step adds browser_navigate + browser_vision verification step — visual spot-check that catches layout bugs pure source inspection misses - sketch Output section adds a worked tool-sequence example mirroring the spike pattern Descriptions now lead with 'Throwaway' (the pattern-match word that signals 'disposable / not production code') — gives the agent a clean activation signal in the system-prompt skill index.	2026-04-29 06:10:05 -07:00
vominh1919	fe6c86623f	fix: close file descriptor in LocalEnvironment._update_cwd _update_cwd() uses a bare open(self._cwd_file).read() that never closes the file descriptor. This method runs on every terminal command execution, so the fd leaks accumulate in long sessions. Use a with statement so the fd is released promptly. Fixes #15552 (standalone resubmission)	2026-04-29 05:46:52 -07:00
teknium1	258755a24f	test(weixin): cover _is_stale_session_ret helper (#17228 ) Regression test for the ret=-2 / errmsg='unknown error' disambiguation: - ret=-2 or errcode=-2 with 'unknown error' → stale session (True) - ret=-2 with 'freq limit' or other errmsg → rate limit (False) - ret=-14 → not matched here (handled by SESSION_EXPIRED_ERRCODE path) - Success codes and missing errmsg → False	2026-04-29 05:44:44 -07:00
vominh1919	e9b96fd050	fix: recognize ret=-2 as stale-session signal in Weixin adapter The Weixin adapter only recognized errcode=-14 as a session-expired signal. However, iLink also returns ret=-2 with errmsg="unknown error" for the same underlying condition (stale session). The adapter treated ret=-2 as a rate-limit, exhausting retries with the same stale context_token instead of refreshing the session. Added _is_stale_session_ret() helper that distinguishes ret=-2 with "unknown error" from genuine rate limits. Updated both the poll loop and _send_text_chunk to use the helper. Fixes NousResearch/hermes-agent#17228	2026-04-29 05:44:44 -07:00
teknium1	b0435cc164	fix(model_tools): cancel coroutine on timeout so worker thread exits + log full traceback _run_async() bridges sync tool handlers to async code. When the handler is invoked from inside a running event loop (gateway / nested async), it spawns a worker thread and blocks on future.result(timeout=300). Before this change, a coroutine that ran past 300s leaked its worker thread: - future.cancel() is a no-op on a running ThreadPoolExecutor future (cancel only works on not-yet-started work). - pool.shutdown(wait=False, cancel_futures=True) let the caller proceed but the worker kept running the coroutine until it returned on its own. Every tool timeout leaked one thread. In long-lived gateway / RL sessions this is cumulative. The fix replaces bare asyncio.run() with a worker wrapper that creates its own event loop. On timeout, _run_async schedules task.cancel() on that loop via call_soon_threadsafe, then shuts the pool down with wait=False so the caller returns immediately. The coroutine observes CancelledError at its next await and the worker thread exits cleanly. Also switches logger.error() to logger.exception() in the top-level handle_function_call() except block so tool failures produce full stack traces in errors.log instead of just the message. Related: #17420 (contributor flagged the leak; the original fix used pool.shutdown(wait=True) which would have converted the leak into a hang — caller blocks forever on the same stuck coroutine). Credit for identifying the leak goes to the contributor. Co-authored-by: 0z! <162235745+0z1-ghb@users.noreply.github.com>	2026-04-29 05:00:40 -07:00
teknium1	46437966cc	chore(release): map tmimmanuel email to GitHub login	2026-04-29 05:00:37 -07:00

1 2 3 4 5 ...

6602 commits