hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-25 00:51:20 +00:00

Author	SHA1	Message	Date
Brooklyn Nicholson	2e75460066	test(ui-tui): add log-update diff contract tests - steady-state diff skips unchanged rows - width change emits clearTerminal before repaint - drift repro: prev.screen desync from terminal leaves orphaned cells no code path can reach	2026-04-23 12:08:23 -05:00
kshitij	82a0ed1afb	feat: add Xiaomi MiMo v2.5-pro and v2.5 model support (#14635 ) ## Merged Adds MiMo v2.5-pro and v2.5 support to Xiaomi native provider, OpenCode Go, and setup wizard. ### Changes - Context lengths: added v2.5-pro (1M) and v2.5 (1M), corrected existing MiMo entries to exact values (262144) - Provider lists: xiaomi, opencode-go, setup wizard - Vision: upgraded from mimo-v2-omni to mimo-v2.5 (omnimodal) - Config description updated for XIAOMI_API_KEY - Tests updated for new vision model preference ### Verification - 4322 tests passed, 0 new regressions - Live API tested on Xiaomi portal: basic, reasoning, tool calling, multi-tool, file ops, system prompt, vision — all pass - Self-review found and fixed 2 issues (redundant vision check, stale HuggingFace context length)	2026-04-23 10:06:25 -07:00
Brooklyn Nicholson	071bdb5a3f	Revert "fix(ui-tui): force full xterm.js alt-screen repaints" This reverts commit `bc9518f660`.	2026-04-23 11:55:09 -05:00
Brooklyn Nicholson	bc9518f660	fix(ui-tui): force full xterm.js alt-screen repaints - force full alt-screen damage in xterm.js hosts to avoid stale glyph artifacts - skip incremental scroll optimization there and repaint from a cleared screen atomically	2026-04-23 11:44:27 -05:00
Teknium	ce089169d5	feat(skills-guard): gate agent-created scanner on config.skills.guard_agent_created (default off) Replaces the blanket 'always allow' change from the previous commit with an opt-in config flag so users who want belt-and-suspenders security can still get the keyword scan on skill_manage output. ## Default behavior (flag off) skill_manage(action='create'\|'edit'\|'patch') no longer runs the keyword scanner. The agent can write skills that mention risky keywords in prose (documenting what reviewers should watch for, describing cache-bust semantics in a PR-review skill, referencing AGENTS.md, etc.) without getting blocked. Rationale: the agent can already execute the same code paths via terminal() with no gate, so the scan adds friction without meaningful security against a compromised or malicious agent. ## Opt-in behavior (flag on) Set skills.guard_agent_created: true in config.yaml to get the original behavior back. Scanner runs on every skill_manage write; dangerous verdicts surface as a tool error the agent can react to (retry without the flagged content). ## External hub installs unaffected trusted/community sources (hermes skills install) always get scanned regardless of this flag. The gate is specifically for skill_manage, which only agents call. ## Changes - hermes_cli/config.py: add skills.guard_agent_created: False to DEFAULT_CONFIG - tools/skill_manager_tool.py: _guard_agent_created_enabled() reads the flag; _security_scan_skill() short-circuits to None when the flag is off - tools/skills_guard.py: restore INSTALL_POLICY['agent-created'] = ('allow', 'allow', 'ask') so the scan remains strict when it does run - tests/tools/test_skills_guard.py: restore original ask/force tests - tests/tools/test_skill_manager_tool.py: new TestSecurityScanGate class covering both flag states + config error handling ## Validation - tests/tools/test_skills_guard.py + test_skill_manager_tool.py: 115/115 pass - E2E: flagged-keyword skill creates with default config, blocks with flag on	2026-04-23 06:20:47 -07:00
Teknium	e3c0084140	fix(skills-guard): allow agent-created dangerous verdicts without confirmation The security scanner is meant to protect against hostile external skills pulled from GitHub via hermes skills install — trusted/community policies block or ask on dangerous verdicts accordingly. But agent-created skills (from skill_manage) run in the same process as the agent that wrote them. The agent can already execute the same code paths via terminal() with no gate, so the ask-on-dangerous policy adds friction without meaningful security. Concrete trigger: an agent writing a PR-review skill that describes cache-busting or persistence semantics in prose gets blocked because those words appear in the patterns list. The skill isn't actually doing anything dangerous — it's just documenting what reviewers should watch for in other PRs. Change: agent-created dangerous verdict maps to 'allow' instead of 'ask'. External hub installs (trusted/community) keep their stricter policies intact. Tests updated: renamed test_dangerous_agent_created_asks → test_dangerous_agent_created_allowed; renamed force-override test and updated assertion since force is now a no-op for agent-created (the allow branch returns first).	2026-04-23 05:18:44 -07:00
Teknium	5651a73331	fix(gateway): guard-match the finally-block _active_sessions delete Before this, _process_message_background's finally did an unconditional 'del self._active_sessions[session_key]' — even if a /stop/ /new command had already swapped in its own command_guard via _dispatch_active_session_command and cancelled us. The old task's unwind would clobber the newer guard, opening a race for follow-ups. Replace with _release_session_guard(session_key, guard=interrupt_event) so the delete only fires when the guard we captured is still the one installed. The sibling _session_tasks pop already had equivalent ownership matching via asyncio.current_task() identity; this closes the asymmetry. Adds two direct regressions in test_session_split_brain_11016: - stale guard reference must not clobber a newer guard by identity - guard=None default still releases unconditionally (for callers that don't have a captured guard to match against) Refs #11016	2026-04-23 05:15:52 -07:00
Teknium	81d925f2a5	chore(release): map dyxushuai and etcircle in AUTHOR_MAP Personal gmail and noreply pattern for the contributors whose commits are preserved on the salvage PR for issue #11016.	2026-04-23 05:15:52 -07:00
Teknium	ec02d905c9	test(gateway): regressions for issue #11016 split-brain session locks Covers all three layers of the salvaged fix: 1. Adapter-side cancellation: /stop, /new, /reset cancel the in-flight adapter task, release the guard, and let follow-up messages through; /new keeps the guard installed until the runner response lands, then drains the queued follow-up in order. 2. Adapter-side self-heal: a split-brain guard (done owner task, lock still live) is healed on the next inbound message and the user gets a reply instead of being trapped in infinite busy acks. A guard with no recorded owner task is NOT auto-healed (protects fixtures that install guards directly). 3. Runner-side generation guard: stale async runs whose generation was bumped by /stop or /new cannot clear a newer run's _running_agents slot on the way out. 11 tests, all green. Refs #11016	2026-04-23 05:15:52 -07:00
etcircle	b7bdf32d4e	fix(gateway): guard session slot ownership after stop/reset Closes the runner-side half of the split-brain described in issue #11016 by wiring the existing _session_run_generation counter through the session-slot promotion and release paths. Without this, an older async run could still: - promote itself from sentinel to real agent after /stop or /new invalidated its run generation - clear _running_agents on the way out, deleting a newer run's slot Both races leave _running_agents desynced from what the user actually has in flight, which is half of what shows up as 'No active task to stop' followed by late 'Interrupting current task...' acks. Changes: - track_agent() in _run_agent now calls _is_session_run_current() before writing the real agent into _running_agents[session_key]; if /stop or /new bumped the generation while the agent was spinning up, the slot is left alone (the newer run owns it). - _release_running_agent_state() gained an optional run_generation keyword. When provided, it only clears the slot if the generation is still current. The final cleanup at the tail of _run_agent passes the run's generation so an old unwind can't blow away a newer run's state. - Returns bool so callers can tell when a release was blocked. All the existing call sites that do NOT pass run_generation behave exactly as before — this is a strict additive guard. Refs #11016	2026-04-23 05:15:52 -07:00
dyxushuai	d72985b7ce	fix(gateway): serialize reset command handoff and heal stale session locks Closes the adapter-side half of the split-brain described in issue #11016 where _active_sessions stays live but nothing is processing, trapping the chat in repeated 'Interrupting current task...' while /stop reports no active task. Changes on BasePlatformAdapter: - Add _session_tasks: Dict[str, asyncio.Task] mapping session -> owner task so session-terminating commands can cancel the right task and old task finally blocks can't clobber a newer task's guard. - Add _release_session_guard(guard=...) that only releases if the guard Event still matches, preventing races where /stop or /new swaps in a temporary guard while the old task unwinds. - Add _session_task_is_stale() and _heal_stale_session_lock() for on-entry self-heal: when handle_message() sees an _active_sessions entry whose RECORDED owner task is done/cancelled, clear it and fall through to normal dispatch. No owner task recorded = not stale (some tests install guards directly and shouldn't be auto-healed). - Add cancel_session_processing() as the explicit adapter-side cancel API so /stop/ /new/ /reset can cleanly tear down in-flight work. - Route /stop, /new, /reset through _dispatch_active_session_command(): 1. install a temporary command guard so follow-ups stay queued 2. let the runner process the command 3. cancel the old adapter task AFTER the runner response is ready 4. release the command guard and drain the latest pending follow-up - _start_session_processing() replaces the inline create_task + guard setup in handle_message() so guard + owner-task entry land atomically. - cancel_background_tasks() also clears _session_tasks. Combined, this means: - /stop / /new / /reset actually cancel stuck work instead of leaving adapter state desynced from runner state. - A dead session lock self-heals on the next inbound message rather than persisting until gateway restart. - Follow-up messages after /new are processed in order, after the reset command's runner response lands. Refs #11016	2026-04-23 05:15:52 -07:00
Teknium	5a26938aa5	fix(terminal): auto-source ~/.profile and ~/.bash_profile so n/nvm PATH survives (#14534 ) The environment-snapshot login shell was auto-sourcing only ~/.bashrc when building the PATH snapshot. On Debian/Ubuntu the default ~/.bashrc starts with a non-interactive short-circuit: case $- in i) ;; *) return;; esac Sourcing it from a non-interactive shell returns before any PATH export below that guard runs. Node version managers like n and nvm append their PATH line under that guard, so Hermes was capturing a PATH without ~/n/bin — and the terminal tool saw 'node: command not found' even when node was on the user's interactive shell PATH. Expand the auto-source list (when auto_source_bashrc is on) to: ~/.profile → ~/.bash_profile → ~/.bashrc ~/.profile and ~/.bash_profile have no interactivity guard — installers that write their PATH there (n's n-install, nvm's curl installer on most setups) take effect. ~/.bashrc still runs last to preserve behaviour for users who put PATH logic there without the guard. Added two tests covering the new behaviour plus an E2E test that spins up a real LocalEnvironment with a guard-prefixed ~/.bashrc and a ~/.profile PATH export, and verifies the captured snapshot PATH contains the profile entry.	2026-04-23 05:15:37 -07:00
Teknium	d45c738a52	fix(gateway): preflight user D-Bus before systemctl --user start (#14531 ) On fresh RHEL/Debian SSH sessions without linger, `systemctl --user start hermes-gateway` fails with 'Failed to connect to bus: No medium found' because /run/user/$UID/bus doesn't exist. Setup previously showed a raw CalledProcessError and continued claiming success, so the gateway never actually started. systemd_start() and systemd_restart() now call _preflight_user_systemd() for the user scope first: - Bus socket already there → no-op (desktop / linger-enabled servers) - Linger off → try loginctl enable-linger (works when polkit permits, needs sudo otherwise), wait for socket - Still unreachable → raise UserSystemdUnavailableError with a clean remediation message pointing to sudo loginctl + hermes gateway run as the foreground fallback Setup's start/restart handlers and gateway_command() catch the new exception and render the multi-line guidance instead of a traceback.	2026-04-23 05:09:38 -07:00
Teknium	d50be05b1c	chore(release): map j0sephz in AUTHOR_MAP	2026-04-23 05:09:08 -07:00
Teknium	24e8a6e701	feat(skills_sync): surface collision with reset-hint When a newly-bundled skill's name collides with a pre-existing user skill, sync silently kept the user's copy. Users never learned that a bundled version shipped by that name. Now (on non-quiet sync only) print: ⚠ <name>: bundled version shipped but you already have a local skill by this name — yours was kept. Run `hermes skills reset <name>` to replace it with the bundled version. No behavior change to manifest writes or to the kept user copy — purely additive warning on the existing collision-skip path.	2026-04-23 05:09:08 -07:00
j0sephz	3a97fb3d47	fix(skills_sync): don't poison manifest on new-skill collision When a new bundled skill's name collided with a pre-existing user skill (from hub, custom, or leftover), sync_skills() recorded the bundled hash in the manifest even though the on-disk copy was unrelated to bundled. On the next sync, user_hash != origin_hash (bundled_hash) marked the skill as "user-modified" permanently, blocking all bundled updates for that skill until the user ran `hermes skills reset`. Fix: only baseline the manifest entry when the user's on-disk copy is byte-identical to bundled (safe to track — this is the reset re-sync or coincidentally-identical install case). Otherwise skip the manifest write entirely: the on-disk skill is unrelated to bundled and shouldn't be tracked as if it were. This preserves reset_bundled_skill()'s re-baseline flow (its post-delete sync still writes to the manifest when user copy matches bundled) while fixing the poisoning scenario for genuinely unrelated collisions. Adds two tests following the existing test_failed_copy_does_not_poison_manifest pattern: one verifying the manifest stays clean after a collision with differing content, one verifying no false user_modified flag on resync.	2026-04-23 05:09:08 -07:00
Siddharth Balyan	91d6ea07c8	chore(dev): add ruff linter to dev deps and configure in pyproject.toml (#14527 ) Adds ruff (fast Python linter from Astral) as a dev dependency and sets up initial config with all files excluded — ruff is entirely disabled for now, this just lands the config for slow rollout enabling it module-by-module in follow-up PRs.	2026-04-23 17:20:18 +05:30
Siddharth Balyan	fdcb3e9a4b	chore(dev): add ty type checker to dev deps and configure in pyproject.toml (#14525 ) Adds ty (Red Knot) as a dev dependency and sets up initial configuration with all files excluded — to be incrementally enabled per-module.	2026-04-23 17:15:57 +05:30
Teknium	627abbb1ea	chore(release): map davidvv in AUTHOR_MAP	2026-04-23 03:10:30 -07:00
David VV	39fcf1d127	fix(model_switch): group custom_providers by endpoint in /model picker (#9210 ) Multiple custom_providers entries sharing the same base_url + api_key are now grouped into a single picker row. A local Ollama host with per-model display names ("Ollama — GLM 5.1", "Ollama — Qwen3-coder", "Ollama — Kimi K2", "Ollama — MiniMax M2.7") previously produced four near-duplicate picker rows that differed only by suffix; now it appears as one "Ollama" row with four models. Key changes: - Grouping key changed from slug-by-name to (base_url, api_key). Names frequently differ per model while the endpoint stays the same. - When the grouped endpoint matches current_base_url, the row's slug is set to current_provider so picker-driven switches route through the live credential pipeline (no re-resolution needed). - Per-model suffix is stripped from the display name ("Ollama — X" → "Ollama") via em-dash / " - " separators. - Two groups with different api_keys at the same base_url (or otherwise colliding on cleaned name) are disambiguated with a numeric suffix (custom:openai, custom:openai-2) so both stay visible. - current_base_url parameter plumbed through both gateway call sites. Existing #8216, #11499, #13509 regressions covered (dict/list shapes of models:, section-3/section-4 dedup, normalized list-format entries). Salvaged from @davidvv's PR #9210 — the underlying code had diverged ~1400 commits since that PR was opened, so this is a reconstruction of the same approach on current main rather than a clean cherry-pick. Authorship preserved via --author on this commit. Closes #9210	2026-04-23 03:10:30 -07:00
Teknium	6172f95944	chore(release): map GuyCui in AUTHOR_MAP	2026-04-23 03:10:04 -07:00
GuyCui	b24d239ce1	Update permissions for config.yaml Fix config.yaml permission drift on startup	2026-04-23 03:10:04 -07:00
Teknium	cd9cd1b159	chore(release): map MikeFac in AUTHOR_MAP	2026-04-23 03:08:53 -07:00
MikeFac	78e213710c	fix: guard against None tirith path in security scanner When _resolve_tirith_path() returns None (e.g. install failed on unsupported platform or all resolution paths exhausted), the function passed None directly to subprocess.run(), causing a TypeError instead of respecting the fail_open config. Add a None check before the subprocess call that allows or blocks according to the configured fail_open policy, matching the existing error handling behavior for OSError and TimeoutExpired.	2026-04-23 03:08:53 -07:00
Teknium	4f4fd21149	chore(release): map vivganes in AUTHOR_MAP	2026-04-23 03:07:06 -07:00
Vivek Ganesan	7ca2f70055	fix(docs): Add links to Atropos and wandb in user guide fix #7724 The user guide has mention of atropos and wandb but no links. This PR adds links so that users dont have to search for them.	2026-04-23 03:07:06 -07:00
Teknium	dab36d9511	chore(release): map phpoh in AUTHOR_MAP	2026-04-23 03:05:49 -07:00
phpoh	4c02e4597e	fix(status): catch OSError in os.kill(pid, 0) for Windows compatibility On Windows, os.kill(nonexistent_pid, 0) raises OSError with WinError 87 ("The parameter is incorrect") instead of ProcessLookupError. Without catching OSError, the acquire_scoped_lock() and get_running_pid() paths crash on any invalid PID check — preventing gateway startup on Windows whenever a stale PID file survives from a prior run. Adapted @phpoh's fix in #12490 onto current main. The main file was refactored in the interim (get_running_pid now iterates over (primary_record, fallback_record) with a per-iteration try/except), so the OSError catch is added as a new except clause after PermissionError (which is a subclass of OSError, so order matters: PermissionError must match first). Co-authored-by: phpoh <1352808998@qq.com>	2026-04-23 03:05:49 -07:00
Aslaaen	51c1d2de16	fix(profiles): stage profile imports to prevent directory clobbering	2026-04-23 03:02:34 -07:00
Teknium	08cb345e24	chore(release): map Lind3ey in AUTHOR_MAP	2026-04-23 03:02:09 -07:00
Lind3ey	9dba75bc38	fix(feishu): issue where streaming edits in Feishu show extra leading newlines	2026-04-23 03:02:09 -07:00
Teknium	8f50f2834a	chore(release): add Wysie to AUTHOR_MAP	2026-04-23 03:01:18 -07:00
Wysie	be99feff1f	fix(image-gen): force-refresh plugin providers in long-lived sessions	2026-04-23 03:01:18 -07:00
Teknium	911f57ad97	chore(release): map TaroballzChen in AUTHOR_MAP	2026-04-23 02:37:15 -07:00
TaroballzChen	5d09474348	fix(tools): enforce ACP transport overrides in delegate_task child agents When override_acp_command was passed to _build_child_agent, it failed to override effective_provider to 'copilot-acp' and effective_api_mode to 'chat_completions'. This caused the child AIAgent to inherit the parent's native API configuration (e.g. Anthropic) and attempt real HTTP requests using the parent's API key, leading to HTTP 401 errors and completely bypassing the ACP subprocess. Ensure that if an ACP command override is provided, the child agent correctly routes through CopilotACPClient. Refs #2653	2026-04-23 02:37:15 -07:00
Teknium	33773ed5c6	chore(release): map DrStrangerUJN in AUTHOR_MAP	2026-04-23 02:37:07 -07:00
drstrangerujn	a5b0c7e2ec	fix(config): preserve list-format models in custom_providers normalize _normalize_custom_provider_entry silently drops the models field when it's a list. Hand-edited configs (and the shape used by older Hermes versions) still write models as a plain list of ids, so after the normalize pass the entry reaches list_authenticated_providers() with no models and /model shows the provider with (0) models — even though the underlying picker code handles lists fine. Convert list-format models into the empty-value dict shape the rest of the pipeline already expects. Dict-format entries keep passing through unchanged. Repro (before the fix): custom_providers: - name: acme base_url: https://api.example.com/v1 models: [foo, bar, baz] /model shows "acme (0)"; bypassing normalize in list_authenticated_providers returns three models, confirming the drop happens in normalize. Adds four unit tests covering list→dict conversion, dict pass-through, filtering of empty/non-string entries, and the empty-list case.	2026-04-23 02:37:07 -07:00
Teknium	c80cc8557e	chore(release): map RyanLee-Dev in AUTHOR_MAP	2026-04-23 02:35:13 -07:00
yuanhe	1df0c812c4	feat(skills): add MiniMax-AI/cli as default skill tap Adds MiniMax-AI/cli to the default taps list so the mmx-cli skill is discoverable and installable out of the box via /skills browse and /skills install. The skill definition lives upstream at github.com/MiniMax-AI/cli/skill/SKILL.md, keeping updates decoupled. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 02:35:13 -07:00
Teknium	b5ec6e8df7	chore(release): map sharziki in AUTHOR_MAP	2026-04-23 02:34:11 -07:00
sharziki	d7452af257	fix(pairing): handle null user_name in pairing list display When user_name is stored as None (e.g. Telegram users without a display name), dict.get('user_name', '') returns None because the key exists — the default is only used for missing keys. This causes a TypeError when the format specifier :<20 is applied to None. Use `or ''` to coerce None to an empty string. Fixes #7392 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 02:34:11 -07:00
Teknium	48923e5a3d	chore(release): map azhengbot in AUTHOR_MAP	2026-04-23 02:32:56 -07:00
azhengbot	f77da7de42	Rename _api_call_with_interrupt to _interruptible_api_call	2026-04-23 02:32:56 -07:00
azhengbot	36adcebe6c	Rename API call function to _interruptible_api_call	2026-04-23 02:32:56 -07:00
kshitijk4poor	43de1ca8c2	refactor: remove _nr_to_assistant_message shim + fix flush_memories guard NormalizedResponse and ToolCall now have backward-compat properties so the agent loop can read them directly without the shim: ToolCall: .type, .function (returns self), .call_id, .response_item_id NormalizedResponse: .reasoning_content, .reasoning_details, .codex_reasoning_items This eliminates the 35-line shim and its 4 call sites in run_agent.py. Also changes flush_memories guard from hasattr(response, 'choices') to self.api_mode in ('chat_completions', 'bedrock_converse') so it works with raw boto3 dicts too. WS1 items 3+4 of Cycle 2 (#14418).	2026-04-23 02:30:05 -07:00
kshitijk4poor	f4612785a4	refactor: collapse normalize_anthropic_response to return NormalizedResponse directly 3-layer chain (transport → v2 → v1) was collapsed to 2-layer in PR 7. This collapses the remaining 2-layer (transport → v1 → NR mapping in transport) to 1-layer: v1 now returns NormalizedResponse directly. Before: adapter returns (SimpleNamespace, finish_reason) tuple, transport unpacks and maps to NormalizedResponse (22 lines). After: adapter returns NormalizedResponse, transport is a 1-line passthrough. Also updates ToolCall construction — adapter now creates ToolCall dataclass directly instead of SimpleNamespace(id, type, function). WS1 item 1 of Cycle 2 (#14418).	2026-04-23 02:30:05 -07:00
kshitijk4poor	738d0900fd	refactor: migrate auxiliary_client Anthropic path to use transport Replace direct normalize_anthropic_response() call in _AnthropicCompletionsAdapter.create() with AnthropicTransport.normalize_response() via get_transport(). Before: auxiliary_client called adapter v1 directly, bypassing the transport layer entirely. After: auxiliary_client → get_transport('anthropic_messages') → transport.normalize_response() → adapter v1 → NormalizedResponse. The adapter v1 function (normalize_anthropic_response) now has zero callers outside agent/anthropic_adapter.py and the transport. This unblocks collapsing v1 to return NormalizedResponse directly in a follow-up (the remaining 2-layer chain becomes 1-layer). WS1 item 2 of Cycle 2 (#14418).	2026-04-23 02:30:05 -07:00
Teknium	1c532278ae	chore(release): map lvnilesh in AUTHOR_MAP	2026-04-23 02:30:00 -07:00
Nilesh	22afa066f8	fix(cron): guard against non-dict result from run_conversation When run_conversation returns a non-dict value (e.g. an int under error conditions), the subsequent result.get("final_response", "") raises an opaque "'int' object has no attribute 'get'" AttributeError. Add a type guard that converts this into a clear RuntimeError, which is properly caught by the outer except Exception handler that marks the job as failed and delivers the error message. Fixes NousResearch/hermes-agent#9433 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-23 02:30:00 -07:00
Teknium	5e76c650bb	chore(release): map yzx9 in AUTHOR_MAP	2026-04-23 02:06:16 -07:00

1 2 3 4 5 ...

5607 commits