hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-25 00:51:20 +00:00

Author	SHA1	Message	Date
alt-glitch	420c4d02e2	refactor(acp): rewrite imports and update infra for hermes_agent.acp Rewrite all acp_adapter imports to hermes_agent.acp in source, tests, and pyproject.toml. Convert relative imports to absolute per manifest convention. Strip sys.path hack from entry.py (redundant with editable install). Update pyproject.toml entry point and packages.find. Part of #14586, #14182	2026-04-23 21:02:03 +05:30
Teknium	ce089169d5	feat(skills-guard): gate agent-created scanner on config.skills.guard_agent_created (default off) Replaces the blanket 'always allow' change from the previous commit with an opt-in config flag so users who want belt-and-suspenders security can still get the keyword scan on skill_manage output. ## Default behavior (flag off) skill_manage(action='create'\|'edit'\|'patch') no longer runs the keyword scanner. The agent can write skills that mention risky keywords in prose (documenting what reviewers should watch for, describing cache-bust semantics in a PR-review skill, referencing AGENTS.md, etc.) without getting blocked. Rationale: the agent can already execute the same code paths via terminal() with no gate, so the scan adds friction without meaningful security against a compromised or malicious agent. ## Opt-in behavior (flag on) Set skills.guard_agent_created: true in config.yaml to get the original behavior back. Scanner runs on every skill_manage write; dangerous verdicts surface as a tool error the agent can react to (retry without the flagged content). ## External hub installs unaffected trusted/community sources (hermes skills install) always get scanned regardless of this flag. The gate is specifically for skill_manage, which only agents call. ## Changes - hermes_cli/config.py: add skills.guard_agent_created: False to DEFAULT_CONFIG - tools/skill_manager_tool.py: _guard_agent_created_enabled() reads the flag; _security_scan_skill() short-circuits to None when the flag is off - tools/skills_guard.py: restore INSTALL_POLICY['agent-created'] = ('allow', 'allow', 'ask') so the scan remains strict when it does run - tests/tools/test_skills_guard.py: restore original ask/force tests - tests/tools/test_skill_manager_tool.py: new TestSecurityScanGate class covering both flag states + config error handling ## Validation - tests/tools/test_skills_guard.py + test_skill_manager_tool.py: 115/115 pass - E2E: flagged-keyword skill creates with default config, blocks with flag on	2026-04-23 06:20:47 -07:00
Teknium	e3c0084140	fix(skills-guard): allow agent-created dangerous verdicts without confirmation The security scanner is meant to protect against hostile external skills pulled from GitHub via hermes skills install — trusted/community policies block or ask on dangerous verdicts accordingly. But agent-created skills (from skill_manage) run in the same process as the agent that wrote them. The agent can already execute the same code paths via terminal() with no gate, so the ask-on-dangerous policy adds friction without meaningful security. Concrete trigger: an agent writing a PR-review skill that describes cache-busting or persistence semantics in prose gets blocked because those words appear in the patterns list. The skill isn't actually doing anything dangerous — it's just documenting what reviewers should watch for in other PRs. Change: agent-created dangerous verdict maps to 'allow' instead of 'ask'. External hub installs (trusted/community) keep their stricter policies intact. Tests updated: renamed test_dangerous_agent_created_asks → test_dangerous_agent_created_allowed; renamed force-override test and updated assertion since force is now a no-op for agent-created (the allow branch returns first).	2026-04-23 05:18:44 -07:00
Teknium	5651a73331	fix(gateway): guard-match the finally-block _active_sessions delete Before this, _process_message_background's finally did an unconditional 'del self._active_sessions[session_key]' — even if a /stop/ /new command had already swapped in its own command_guard via _dispatch_active_session_command and cancelled us. The old task's unwind would clobber the newer guard, opening a race for follow-ups. Replace with _release_session_guard(session_key, guard=interrupt_event) so the delete only fires when the guard we captured is still the one installed. The sibling _session_tasks pop already had equivalent ownership matching via asyncio.current_task() identity; this closes the asymmetry. Adds two direct regressions in test_session_split_brain_11016: - stale guard reference must not clobber a newer guard by identity - guard=None default still releases unconditionally (for callers that don't have a captured guard to match against) Refs #11016	2026-04-23 05:15:52 -07:00
Teknium	ec02d905c9	test(gateway): regressions for issue #11016 split-brain session locks Covers all three layers of the salvaged fix: 1. Adapter-side cancellation: /stop, /new, /reset cancel the in-flight adapter task, release the guard, and let follow-up messages through; /new keeps the guard installed until the runner response lands, then drains the queued follow-up in order. 2. Adapter-side self-heal: a split-brain guard (done owner task, lock still live) is healed on the next inbound message and the user gets a reply instead of being trapped in infinite busy acks. A guard with no recorded owner task is NOT auto-healed (protects fixtures that install guards directly). 3. Runner-side generation guard: stale async runs whose generation was bumped by /stop or /new cannot clear a newer run's _running_agents slot on the way out. 11 tests, all green. Refs #11016	2026-04-23 05:15:52 -07:00
Teknium	5a26938aa5	fix(terminal): auto-source ~/.profile and ~/.bash_profile so n/nvm PATH survives (#14534 ) The environment-snapshot login shell was auto-sourcing only ~/.bashrc when building the PATH snapshot. On Debian/Ubuntu the default ~/.bashrc starts with a non-interactive short-circuit: case $- in i) ;; *) return;; esac Sourcing it from a non-interactive shell returns before any PATH export below that guard runs. Node version managers like n and nvm append their PATH line under that guard, so Hermes was capturing a PATH without ~/n/bin — and the terminal tool saw 'node: command not found' even when node was on the user's interactive shell PATH. Expand the auto-source list (when auto_source_bashrc is on) to: ~/.profile → ~/.bash_profile → ~/.bashrc ~/.profile and ~/.bash_profile have no interactivity guard — installers that write their PATH there (n's n-install, nvm's curl installer on most setups) take effect. ~/.bashrc still runs last to preserve behaviour for users who put PATH logic there without the guard. Added two tests covering the new behaviour plus an E2E test that spins up a real LocalEnvironment with a guard-prefixed ~/.bashrc and a ~/.profile PATH export, and verifies the captured snapshot PATH contains the profile entry.	2026-04-23 05:15:37 -07:00
Teknium	d45c738a52	fix(gateway): preflight user D-Bus before systemctl --user start (#14531 ) On fresh RHEL/Debian SSH sessions without linger, `systemctl --user start hermes-gateway` fails with 'Failed to connect to bus: No medium found' because /run/user/$UID/bus doesn't exist. Setup previously showed a raw CalledProcessError and continued claiming success, so the gateway never actually started. systemd_start() and systemd_restart() now call _preflight_user_systemd() for the user scope first: - Bus socket already there → no-op (desktop / linger-enabled servers) - Linger off → try loginctl enable-linger (works when polkit permits, needs sudo otherwise), wait for socket - Still unreachable → raise UserSystemdUnavailableError with a clean remediation message pointing to sudo loginctl + hermes gateway run as the foreground fallback Setup's start/restart handlers and gateway_command() catch the new exception and render the multi-line guidance instead of a traceback.	2026-04-23 05:09:38 -07:00
Teknium	24e8a6e701	feat(skills_sync): surface collision with reset-hint When a newly-bundled skill's name collides with a pre-existing user skill, sync silently kept the user's copy. Users never learned that a bundled version shipped by that name. Now (on non-quiet sync only) print: ⚠ <name>: bundled version shipped but you already have a local skill by this name — yours was kept. Run `hermes skills reset <name>` to replace it with the bundled version. No behavior change to manifest writes or to the kept user copy — purely additive warning on the existing collision-skip path.	2026-04-23 05:09:08 -07:00
j0sephz	3a97fb3d47	fix(skills_sync): don't poison manifest on new-skill collision When a new bundled skill's name collided with a pre-existing user skill (from hub, custom, or leftover), sync_skills() recorded the bundled hash in the manifest even though the on-disk copy was unrelated to bundled. On the next sync, user_hash != origin_hash (bundled_hash) marked the skill as "user-modified" permanently, blocking all bundled updates for that skill until the user ran `hermes skills reset`. Fix: only baseline the manifest entry when the user's on-disk copy is byte-identical to bundled (safe to track — this is the reset re-sync or coincidentally-identical install case). Otherwise skip the manifest write entirely: the on-disk skill is unrelated to bundled and shouldn't be tracked as if it were. This preserves reset_bundled_skill()'s re-baseline flow (its post-delete sync still writes to the manifest when user copy matches bundled) while fixing the poisoning scenario for genuinely unrelated collisions. Adds two tests following the existing test_failed_copy_does_not_poison_manifest pattern: one verifying the manifest stays clean after a collision with differing content, one verifying no false user_modified flag on resync.	2026-04-23 05:09:08 -07:00
David VV	39fcf1d127	fix(model_switch): group custom_providers by endpoint in /model picker (#9210 ) Multiple custom_providers entries sharing the same base_url + api_key are now grouped into a single picker row. A local Ollama host with per-model display names ("Ollama — GLM 5.1", "Ollama — Qwen3-coder", "Ollama — Kimi K2", "Ollama — MiniMax M2.7") previously produced four near-duplicate picker rows that differed only by suffix; now it appears as one "Ollama" row with four models. Key changes: - Grouping key changed from slug-by-name to (base_url, api_key). Names frequently differ per model while the endpoint stays the same. - When the grouped endpoint matches current_base_url, the row's slug is set to current_provider so picker-driven switches route through the live credential pipeline (no re-resolution needed). - Per-model suffix is stripped from the display name ("Ollama — X" → "Ollama") via em-dash / " - " separators. - Two groups with different api_keys at the same base_url (or otherwise colliding on cleaned name) are disambiguated with a numeric suffix (custom:openai, custom:openai-2) so both stay visible. - current_base_url parameter plumbed through both gateway call sites. Existing #8216, #11499, #13509 regressions covered (dict/list shapes of models:, section-3/section-4 dedup, normalized list-format entries). Salvaged from @davidvv's PR #9210 — the underlying code had diverged ~1400 commits since that PR was opened, so this is a reconstruction of the same approach on current main rather than a clean cherry-pick. Authorship preserved via --author on this commit. Closes #9210	2026-04-23 03:10:30 -07:00
Aslaaen	51c1d2de16	fix(profiles): stage profile imports to prevent directory clobbering	2026-04-23 03:02:34 -07:00
Wysie	be99feff1f	fix(image-gen): force-refresh plugin providers in long-lived sessions	2026-04-23 03:01:18 -07:00
drstrangerujn	a5b0c7e2ec	fix(config): preserve list-format models in custom_providers normalize _normalize_custom_provider_entry silently drops the models field when it's a list. Hand-edited configs (and the shape used by older Hermes versions) still write models as a plain list of ids, so after the normalize pass the entry reaches list_authenticated_providers() with no models and /model shows the provider with (0) models — even though the underlying picker code handles lists fine. Convert list-format models into the empty-value dict shape the rest of the pipeline already expects. Dict-format entries keep passing through unchanged. Repro (before the fix): custom_providers: - name: acme base_url: https://api.example.com/v1 models: [foo, bar, baz] /model shows "acme (0)"; bypassing normalize in list_authenticated_providers returns three models, confirming the drop happens in normalize. Adds four unit tests covering list→dict conversion, dict pass-through, filtering of empty/non-string entries, and the empty-list case.	2026-04-23 02:37:07 -07:00
kshitijk4poor	43de1ca8c2	refactor: remove _nr_to_assistant_message shim + fix flush_memories guard NormalizedResponse and ToolCall now have backward-compat properties so the agent loop can read them directly without the shim: ToolCall: .type, .function (returns self), .call_id, .response_item_id NormalizedResponse: .reasoning_content, .reasoning_details, .codex_reasoning_items This eliminates the 35-line shim and its 4 call sites in run_agent.py. Also changes flush_memories guard from hasattr(response, 'choices') to self.api_mode in ('chat_completions', 'bedrock_converse') so it works with raw boto3 dicts too. WS1 items 3+4 of Cycle 2 (#14418).	2026-04-23 02:30:05 -07:00
kshitijk4poor	f4612785a4	refactor: collapse normalize_anthropic_response to return NormalizedResponse directly 3-layer chain (transport → v2 → v1) was collapsed to 2-layer in PR 7. This collapses the remaining 2-layer (transport → v1 → NR mapping in transport) to 1-layer: v1 now returns NormalizedResponse directly. Before: adapter returns (SimpleNamespace, finish_reason) tuple, transport unpacks and maps to NormalizedResponse (22 lines). After: adapter returns NormalizedResponse, transport is a 1-line passthrough. Also updates ToolCall construction — adapter now creates ToolCall dataclass directly instead of SimpleNamespace(id, type, function). WS1 item 1 of Cycle 2 (#14418).	2026-04-23 02:30:05 -07:00
Julien Talbot	d8cc85dcdc	review(stt-xai): address cetej's nits - Replace hardcoded 'fr' default with DEFAULT_LOCAL_STT_LANGUAGE ('en') — removes locale leak, matches other providers - Drop redundant default=True on is_truthy_value (dict .get already defaults) - Update auto-detect comment to include 'xai' in the chain - Fix docstring: 21 languages (match PR body + actual xAI API) - Update test_sends_language_and_format to set HERMES_LOCAL_STT_LANGUAGE=fr explicitly, since default is no longer 'fr' All 18 xAI STT tests pass locally.	2026-04-23 01:57:33 -07:00
Julien Talbot	18b29b124a	test(stt): add unit tests for xAI Grok STT provider Covers: - _transcribe_xai: no key, successful transcription, whitespace stripping, API error (HTTP 400), empty transcript, permission error, network error, language/format params sent, custom base_url, diarize config - _get_provider xAI: key set, no key, auto-detect after mistral, mistral preferred over xai, no key returns none - transcribe_audio xAI dispatch: dispatch, default model (grok-stt), model override	2026-04-23 01:57:33 -07:00
helix4u	bace220d29	fix(image-gen): persist plugin provider on reconfigure	2026-04-23 01:56:09 -07:00
Siddharth Balyan	d1ce358646	feat(agent): add PLATFORM_HINTS for matrix, mattermost, and feishu (#14428 ) * feat(agent): add PLATFORM_HINTS for matrix, mattermost, and feishu These platform adapters fully support media delivery (send_image, send_document, send_voice, send_video) but were missing from PLATFORM_HINTS, leaving agents unaware of their platform context, markdown rendering, and MEDIA: tag support. Salvaged from PR #7370 by Rutimka — wecom excluded since main already has a more detailed version. Co-Authored-By: Marco Rutsch <marco@rutimka.de> * test: add missing Markdown assertion for feishu platform hint --------- Co-authored-by: Marco Rutsch <marco@rutimka.de>	2026-04-23 12:50:22 +05:30
Teknium	eda5ae5a5e	feat(image_gen): add openai-codex plugin (gpt-image-2 via Codex OAuth) (#14317 ) New built-in image_gen backend at plugins/image_gen/openai-codex/ that exposes the same gpt-image-2 low/medium/high tier catalog as the existing 'openai' plugin, but routes generation through the ChatGPT/ Codex Responses image_generation tool path. Available whenever the user has Codex OAuth signed in; no OPENAI_API_KEY required. The two plugins are independent — users select between them via 'hermes tools' → Image Generation, and image_gen.provider in config.yaml. The existing 'openai' (API-key) plugin is unchanged. Reuses _read_codex_access_token() and _codex_cloudflare_headers() from agent.auxiliary_client so token expiry / cred-pool / Cloudflare originator handling stays in one place. Inspired by #14047 by @Hygaard, but re-implemented as a separate plugin instead of an in-place fork of the openai plugin. Closes #11195	2026-04-22 20:43:21 -07:00
Teknium	a2a8092e90	feat(cli): add --ignore-user-config and --ignore-rules flags Port from openai/codex#18646. Adds two flags to 'hermes chat' that fully isolate a run from user-level configuration and rules: * --ignore-user-config: skip ~/.hermes/config.yaml and fall back to built-in defaults. Credentials in .env are still loaded so the agent can actually call a provider. * --ignore-rules: skip auto-injection of AGENTS.md, SOUL.md, .cursorrules, and persistent memory (maps to AIAgent(skip_context_files=True, skip_memory=True)). Primary use cases: - Reproducible CI runs that should not pick up developer-local config - Third-party integrations (e.g. Chronicle in Codex) that bring their own config and don't want user preferences leaking in - Bug-report reproduction without the reporter's personal overrides - Debugging: bisect 'was it my config?' vs 'real bug' in one command Both flags are registered on the parent parser AND the 'chat' subparser (with argparse.SUPPRESS on the subparser to avoid overwriting the parent value when the flag is placed before the subcommand, matching the existing --yolo/--worktree/--pass-session-id pattern). Env vars HERMES_IGNORE_USER_CONFIG=1 and HERMES_IGNORE_RULES=1 are set by cmd_chat BEFORE 'from cli import main' runs, which is critical because cli.py evaluates CLI_CONFIG = load_cli_config() at module import time. The cli.py / hermes_cli.config.load_cli_config() function checks the env var and skips ~/.hermes/config.yaml when set. Tests: 11 new tests in tests/hermes_cli/test_ignore_user_config_flags.py covering the env gate, constructor wiring, cmd_chat simulation, and argparse flag registration. All pass; existing hermes_cli + cli suites unaffected (3005 pass, 2 pre-existing unrelated failures).	2026-04-22 19:58:42 -07:00
kshitijk4poor	d30ee2e545	refactor: unify transport dispatch + collapse normalize shims Consolidate 4 per-transport lazy singleton helpers (_get_anthropic_transport, _get_codex_transport, _get_chat_completions_transport, _get_bedrock_transport) into one generic _get_transport(api_mode) with a shared dict cache. Collapse the 65-line main normalize block (3 api_mode branches, each with its own SimpleNamespace shim) into 7 lines: one _get_transport() call + one _nr_to_assistant_message() shared shim. The shim extracts provider_data fields (codex_reasoning_items, reasoning_details, call_id, response_item_id) into the SimpleNamespace shape downstream code expects. Wire chat_completions and bedrock_converse normalize through their transports for the first time — these were previously falling into the raw response.choices[0].message else branch. Remove 8 dead codex adapter imports that have zero callers after PRs 1-6. Transport lifecycle improvements: - Eagerly warm transport cache at __init__ (surfaces import errors early) - Invalidate transport cache on api_mode change (switch_model, fallback activation, fallback restore, transport recovery) — prevents stale transport after mid-session provider switch run_agent.py: -32 net lines (11,988 -> 11,956). PR 7 of the provider transport refactor.	2026-04-22 18:34:25 -07:00
Teknium	36730b90c4	fix(gateway): also clear session-scoped approval state on /new Follow-up to the /resume and /branch cleanup in the previous commit: /new is a conversation-boundary operation too, so session-scoped dangerous-command approvals and /yolo state must not survive it. Adds a scoped unit test for _clear_session_boundary_security_state that also covers the /new path (which calls the same helper).	2026-04-22 18:26:59 -07:00
Es1la	050aabe2d4	fix(gateway): reset approval and yolo state on session boundary	2026-04-22 18:26:59 -07:00
Ubuntu	a3014a4481	fix(docker): add SETUID/SETGID caps so gosu drop in entrypoint succeeds The Docker terminal backend runs containers with `--cap-drop ALL` and re-adds only DAC_OVERRIDE, CHOWN, FOWNER. Since commit `fee0e0d3` ("run as non-root user, use virtualenv") the image entrypoint drops from root to the `hermes` user via `gosu`, which requires CAP_SETUID and CAP_SETGID. Without them every sandbox container exits immediately with: Dropping root privileges error: failed switching to 'hermes': operation not permitted Breaking every terminal/file tool invocation in `terminal.backend: docker` mode. Fix: add SETUID and SETGID to the cap-add list. The `no-new-privileges` security-opt is kept, so gosu still cannot escalate back to root after the one-way drop — the hardening posture is preserved. Reproduction ------------ With any image whose ENTRYPOINT calls `gosu <user>`, the container exits immediately under the pre-fix cap set. Post-fix, the drop succeeds and the container proceeds normally. docker run --rm \ --cap-drop ALL \ --cap-add DAC_OVERRIDE --cap-add CHOWN --cap-add FOWNER \ --security-opt no-new-privileges \ --entrypoint /usr/local/bin/gosu \ hermes-claude:latest hermes id # -> error: failed switching to 'hermes': operation not permitted # Same command with SETUID+SETGID added: # -> uid=10000(hermes) gid=10000(hermes) groups=10000(hermes) Tests ----- Added `test_security_args_include_setuid_setgid_for_gosu_drop` that asserts both caps are present and the overall hardening posture (cap-drop ALL + no-new-privileges) is preserved. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 18:13:14 -07:00
Teknium	c345ec9a63	fix(display): strip standalone tool-call XML tags from visible text Port from openclaw/openclaw#67318. Some open models (notably Gemma variants served via OpenRouter) emit tool calls as XML blocks inside assistant content instead of via the structured tool_calls field: <function name="read_file"><parameter name="path">/tmp/x</parameter></function> <tool_call>{"name":"x"}</tool_call> <function_calls>[{...}]</function_calls> Left unstripped, this raw XML leaked to gateway users (Discord, Telegram, Matrix, Feishu, Signal, WhatsApp, etc.) and the CLI, since hermes-agent's existing reasoning-tag stripper handled only <think>/<thinking>/<thought> variants. Extend _strip_think_blocks (run_agent.py) and _strip_reasoning_tags (cli.py) to cover: * <tool_call>, <tool_calls>, <tool_result> * <function_call>, <function_calls> * <function name="..."> ... </function> (Gemma-style) The <function> variant is boundary-gated (only strips when the tag sits at start-of-line or after sentence punctuation AND carries a name="..." attribute) so prose mentions like 'Use <function> declarations in JS' are preserved. Dangling <function name="..."> with no close is intentionally left visible — matches OpenClaw's asymmetry so a truncated streaming tail still reaches the user. Tests: 9 new cases in TestStripThinkBlocks (run_agent) + 9 in new file tests/run_agent/test_strip_reasoning_tags_cli.py. Covers Qwen-style <tool_call>, Gemma-style <function name="...">, multi-line payloads, prose preservation, stray close tags, dangling open tags, and mixed reasoning+tool_call content. Note: this port covers the post-streaming final-text path, which is what gateway adapters and CLI display consume. Extending the per-delta stream filter in gateway/stream_consumer.py to hide these tags live as they stream is a separate follow-up; for now users may see raw XML briefly during a stream before the final cleaned text replaces it. Refs: openclaw/openclaw#67318	2026-04-22 18:12:42 -07:00
brooklyn!	64b61cc24b	Merge pull request #11887 from liftaris/fix/tui-provider-resolution fix(tui): resolve runtime provider in _make_agent	2026-04-22 20:11:21 -05:00
brooklyn!	e47537e99d	Merge pull request #14135 from helix4u/fix/tui-state-db-optional fix(tui): degrade gracefully when state.db init fails	2026-04-22 20:11:07 -05:00
Teknium	9bd1518425	fix(feishu): correct identity model docs and prefer tenant-scoped user_id Feishu's open_id is app-scoped (same user gets different open_ids per bot app), not a canonical identity. Functionally correct for single-bot mode but semantically misleading. - Add comprehensive Feishu identity model documentation to module docstring - Prefer user_id (tenant-scoped) over open_id (app-scoped) in _resolve_sender_profile when both are available - Document bot_open_id usage for @mention matching - Update user_id_alt comment in SessionSource to be platform-generic Ref: closes analysis from PR #8388 (closed as over-scoped)	2026-04-22 18:06:22 -07:00
Teknium	c9c6182839	fix(anthropic): guard max_tokens against non-positive values Port from openclaw/openclaw#66664. The build_anthropic_kwargs call site used 'max_tokens or _get_anthropic_max_output(model)', which correctly falls back when max_tokens is 0 or None (falsy) but lets negative ints (-1, -500), fractional floats (0.5, 8192.7), NaN, and infinity leak through to the Anthropic API. Anthropic rejects these with HTTP 400 ('max_tokens: must be greater than or equal to 1'), turning a local config error into a surprise mid-conversation failure. Add two resolver helpers matching OpenClaw's: _resolve_positive_anthropic_max_tokens — returns int(value) only if value is a finite positive number; excludes bools, strings, NaN, infinity, sub-one positives (floor to 0). _resolve_anthropic_messages_max_tokens — prefers a positive requested value, else falls back to the model's output ceiling; raises ValueError only if no positive budget can be resolved. The context-window clamp at the call site (max_tokens > context_length) is preserved unchanged — it handles oversized values; the new resolver handles non-positive values. These concerns are now cleanly separated. Tests: 17 new cases covering positive/zero/negative ints, fractional floats (both >1 and <1), NaN, infinity, booleans, strings, None, and integration via build_anthropic_kwargs. Refs: openclaw/openclaw#66664	2026-04-22 18:04:47 -07:00
Teknium	7d8b2eee63	fix(delegate): default inherit_mcp_toolsets=true, drop version bump Follow-up on helix4u's PR #14211: - Flip default to true: narrowing toolsets=['web','browser'] expresses 'I want these extras', not 'silently strip MCP'. Parent MCP tools (registered at runtime) should survive narrowing by default. - Drop _config_version bump (22->23); additive nested key under delegation.* is handled by _deep_merge, no migration needed. - Update tests to reflect new default behavior.	2026-04-22 17:45:48 -07:00
helix4u	3e96c87f37	fix(delegate): make MCP toolset inheritance configurable	2026-04-22 17:45:48 -07:00
Teknium	d74eaef5f9	fix(error_classifier): retry mid-stream SSL/TLS alert errors as transport Mid-stream SSL alerts (bad_record_mac, tls_alert_internal_error, handshake failures) previously fell through the classifier pipeline to the 'unknown' bucket because: - ssl.SSLError type names weren't in _TRANSPORT_ERROR_TYPES (the isinstance(OSError) catch picks up some but not all SDK-wrapped forms) - the message-pattern list had no SSL alert substrings The 'unknown' bucket is still retryable, but: (a) logs tell the user 'unknown' instead of identifying the cause, (b) it bypasses the transport-specific backoff/fallback logic, and (c) if the SSL error happens on a large session with a generic 'connection closed' wrapper, the existing disconnect-on-large-session heuristic would incorrectly trigger context compression — expensive, and never fixes a transport hiccup. Changes: - Add ssl.SSLError and its subclass type names to _TRANSPORT_ERROR_TYPES - New _SSL_TRANSIENT_PATTERNS list (separate from _SERVER_DISCONNECT_PATTERNS so SSL alerts route to timeout, not context_overflow+compress) - New step 5 in the classifier pipeline: SSL pattern check runs BEFORE the disconnect check to pre-empt the large-session-compress path Patterns cover both space-separated ('ssl alert', 'bad record mac') and underscore-separated ('ERR_SSL_SSL/TLS_ALERT_BAD_RECORD_MAC') forms. This is load-bearing because OpenSSL 3.x changed the error-code separator from underscore to slash (e.g. SSLV3_ALERT_BAD_RECORD_MAC → SSL/TLS_ALERT_BAD_RECORD_MAC) and will likely churn again — matching on stable alert reason substrings survives future format changes. Tests (8 new): - BAD_RECORD_MAC in Python ssl.c format - OpenSSL 3.x underscore format - TLSV1_ALERT_INTERNAL_ERROR - ssl handshake failure - [SSL: ...] prefix fallback - Real ssl.SSLError instance - REGRESSION GUARD: SSL on large session does NOT compress - REGRESSION GUARD: plain disconnect on large session STILL compresses	2026-04-22 17:44:50 -07:00
Teknium	b9463e32c6	fix(usage): read top-level Anthropic cache fields from OAI-compatible proxies Port from cline/cline#10266. When OpenAI-compatible proxies (OpenRouter, Vercel AI Gateway, Cline) route Claude models, they sometimes surface the Anthropic-native cache counters (`cache_read_input_tokens`, `cache_creation_input_tokens`) at the top level of the `usage` object instead of nesting them inside `prompt_tokens_details`. Our chat-completions branch of `normalize_usage()` only read the nested `prompt_tokens_details` fields, so those responses: - reported `cache_write_tokens = 0` even when the model actually did a prompt-cache write, - reported only some of the cache-read tokens when the proxy exposed them top-level only, - overstated `input_tokens` by the missed cache-write amount, which in turn made cost estimation and the status-bar cache-hit percentage wrong for Claude traffic going through these gateways. Now the chat-completions branch tries the OpenAI-standard `prompt_tokens_details` first and falls back to the top-level Anthropic-shape fields only if the nested values are absent/zero. The Anthropic and Codex Responses branches are unchanged. Regression guards added for three shapes: top-level write + nested read, top-level-only, and both-present (nested wins).	2026-04-22 17:40:49 -07:00
Teknium	9eb543cafe	feat(/model): merge models.dev entries for lesser-loved providers (#14221 ) New and newer models from models.dev now surface automatically in /model (both hermes model CLI and the gateway Telegram/Discord picker) for a curated set of secondary providers — no Hermes release required when the registry publishes a new model. Primary user-visible fix: on OpenCode Go, typing '/model mimo-v2.5-pro' no longer silently fuzzy-corrects to 'mimo-v2-pro'. The exact match against the merged models.dev catalog wins. Scope (opt-in frozenset _MODELS_DEV_PREFERRED in hermes_cli/models.py): opencode-go, opencode-zen, deepseek, kilocode, fireworks, mistral, togetherai, cohere, perplexity, groq, nvidia, huggingface, zai, gemini, google. Explicitly NOT merged: - openrouter and nous (never): curated list is already a hand-picked subset / Portal is source of truth. - xai, xiaomi, minimax, minimax-cn, kimi-coding, kimi-coding-cn, alibaba, qwen-oauth (per-project decision to keep curated-only). - providers with dedicated live-endpoint paths (copilot, anthropic, ai-gateway, ollama-cloud, custom, stepfun, openai-codex) — those paths already handle freshness themselves. Changes: - hermes_cli/models.py: add _MODELS_DEV_PREFERRED + _merge_with_models_dev helper. provider_model_ids() branches on the set at its curated-fallback return. Merge is models.dev-first, curated-only extras appended, case-insensitive dedup, graceful fallback when models.dev is offline. - hermes_cli/model_switch.py: list_authenticated_providers() calls the same merge in both its code paths (PROVIDER_TO_MODELS_DEV loop + HERMES_OVERLAYS loop). Picker AND validation-fallback both see fresh entries. - tests/hermes_cli/test_models_dev_preferred_merge.py (new): 13 tests — merge-helper unit tests (empty/raise/order/dedup), opencode-go/zen behavior, openrouter+nous explicitly guarded from merge. - tests/hermes_cli/test_opencode_go_in_model_list.py: converted from snapshot-style assertion to a behavior-based floor check, so it doesn't break when models.dev publishes additional opencode-go entries. Addresses a report from @pfanis via Telegram: newer Xiaomi variants on OpenCode Go weren't appearing in the /model picker, and /model was silently routing requests for new variants to older ones.	2026-04-22 17:33:42 -07:00
Teknium	402d048eb6	fix(gateway): also unlink stale PID + lock files on cleanup Follow-up for salvaged PR #14179. `_cleanup_invalid_pid_path` previously called `remove_pid_file()` for the default PID path, but that helper defensively refuses to delete a PID file whose pid field differs from `os.getpid()` (to protect --replace handoffs). Every realistic stale-PID scenario is exactly that case: a crashed/Ctrl+C'd gateway left behind a PID file owned by a now-dead foreign PID. Once `get_running_pid()` has confirmed the runtime lock is inactive, the on-disk metadata is known to belong to a dead process, so we can force-unlink both the PID file and the sibling `gateway.lock` directly instead of going through the defensive helper. Also adds a regression test with a dead foreign PID that would have failed against the previous cleanup logic.	2026-04-22 16:33:46 -07:00
helix4u	b52123eb15	fix(gateway): recover stale pid and planned restart state	2026-04-22 16:33:46 -07:00
Teknium	51ca575994	feat(gateway): expose plugin slash commands natively on all platforms + decision-capable command hook Plugin slash commands now surface as first-class commands in every gateway enumerator — Discord native slash picker, Telegram BotCommand menu, Slack /hermes subcommand map — without a separate per-platform plugin API. The existing 'command:<name>' gateway hook gains a decision protocol via HookRegistry.emit_collect(): handlers that return a dict with {'decision': 'deny'\|'handled'\|'rewrite'\|'allow'} can intercept slash command dispatch before core handling runs, unifying what would otherwise have been a parallel 'pre_gateway_command' hook surface. Changes: - gateway/hooks.py: add HookRegistry.emit_collect() that fires the same handler set as emit() but collects non-None return values. Backward compatible — fire-and-forget telemetry hooks still work via emit(). - hermes_cli/plugins.py: add optional 'args_hint' param to register_command() so plugins can opt into argument-aware native UI registration (Discord arg picker, future platforms). - hermes_cli/commands.py: add _iter_plugin_command_entries() helper and merge plugin commands into telegram_bot_commands() and slack_subcommand_map(). New is_gateway_known_command() recognizes both built-in and plugin commands so the gateway hook fires for either. - gateway/platforms/discord.py: extract _build_auto_slash_command helper from the COMMAND_REGISTRY auto-register loop and reuse it for plugin-registered commands. Built-in name conflicts are skipped. - gateway/run.py: before normal slash dispatch, call emit_collect on command:<canonical> and honor deny/handled/rewrite/allow decisions. Hook now fires for plugin commands too. - scripts/release.py: AUTHOR_MAP entry for @Magaav. - Tests: emit_collect semantics, plugin command surfacing per platform, decision protocol (deny/handled/rewrite/allow + non-dict tolerance), Discord plugin auto-registration + conflict skipping, is_gateway_known_command. Salvaged from #14131 (@Magaav). Original PR added a parallel 'pre_gateway_command' hook and a platform-keyed plugin command registry; this re-implementation reuses the existing 'command:<name>' hook and treats plugin commands as platform-agnostic so the same capability reaches Telegram and Slack without new API surface. Co-authored-by: Magaav <73175452+Magaav@users.noreply.github.com>	2026-04-22 16:23:21 -07:00
brooklyn!	a1d57292af	Merge pull request #14145 from NousResearch/bb/tui-polish fix(tui): input wrap, shift-tab yolo, statusline, clean boot	2026-04-22 16:48:37 -05:00
Yukipukii1	1e8254e599	fix(agent): guard context compressor against structured message content	2026-04-22 14:46:51 -07:00
ismell0992-afk	6513138f26	fix(agent): recognize Tailscale CGNAT (100.64.0.0/10) as local for Ollama timeouts `is_local_endpoint()` leaned on `ipaddress.is_private`, which classifies RFC-1918 ranges and link-local as private but deliberately excludes the RFC 6598 CGNAT block (100.64.0.0/10) — the range Tailscale uses for its mesh IPs. As a result, Ollama reached over Tailscale (e.g. `http://100.77.243.5:11434`) was treated as remote and missed the automatic stream-read / stale-stream timeout bumps, so cold model load plus long prefill would trip the 300 s watchdog before the first token. Add a module-level `_TAILSCALE_CGNAT = ipaddress.IPv4Network("100.64.0.0/10")` (built once) and extend `is_local_endpoint()` to match the block both via the parsed-`IPv4Address` path and the existing bare-string fallback (for symmetry with the 10/172/192 checks). Also hoist the previously function-local `import ipaddress` to module scope now that it's used by the constant. Extend `TestIsLocalEndpoint` with a CGNAT positive set (lower bound, representative host, MagicDNS anchor, upper bound) and a near-miss negative set (just below 100.64.0.0, just above 100.127.255.255, well outside the block, and first-octet-wrong).	2026-04-22 14:46:10 -07:00
Yukipukii1	44a16c5d9d	guard terminal_tool import-time env parsing	2026-04-22 14:45:50 -07:00
Roy-oss1	e86acad8f1	feat(feishu): preserve @mention context on inbound messages Resolve Feishu @_user_N / @_all placeholders into display names plus a structured [Mentioned: Name (open_id=...), ...] hint so agents can both reason about who was mentioned and call Feishu OpenAPI tools with stable open_ids. Strip bot self-mentions only at message edges (leading unconditionally, trailing only before whitespace/terminal punctuation) so commands parse cleanly while mid-text references are preserved. Covers both plain-text and rich-post payloads. Also fixes a pre-existing hydration bug: Client.request no longer accepts the 'method' kwarg on lark-oapi 1.5.3, so bot identity silently failed to hydrate and self-filtering never worked. Migrate to the BaseRequest.builder() pattern and accept the 'app_name' field the API actually returns. Tighten identity matching precedence so open_id is authoritative when present on both sides.	2026-04-22 14:44:07 -07:00
LeonSGP43	4ac1c959b2	fix(agent): resolve fallback provider key_env secrets	2026-04-22 14:42:48 -07:00
Aslaaen	76c454914a	fix(core): ensure non-blocking executor shutdown on async timeout	2026-04-22 14:42:32 -07:00
kshitijk4poor	d6ed35d047	feat(security): add global toggle to allow private/internal URL resolution Adds security.allow_private_urls / HERMES_ALLOW_PRIVATE_URLS toggle so users on OpenWrt routers, TUN-mode proxies (Clash/Mihomo/Sing-box), corporate split-tunnel VPNs, and Tailscale networks — where DNS resolves public domains to 198.18.0.0/15 or 100.64.0.0/10 — can use web_extract, browser, vision URL fetching, and gateway media downloads. Single toggle in tools/url_safety.py; all 23 is_safe_url() call sites inherit automatically. Cached for process lifetime. Cloud metadata endpoints stay ALWAYS blocked regardless of the toggle: 169.254.169.254 (AWS/GCP/Azure/DO/Oracle), 169.254.170.2 (AWS ECS task IAM creds), 169.254.169.253 (Azure IMDS wire server), 100.100.100.200 (Alibaba), fd00:ec2::254 (AWS IPv6), the entire 169.254.0.0/16 link-local range, and the metadata.google.internal / metadata.goog hostnames (checked pre-DNS so they can't be bypassed on networks where those names resolve to local IPs). Supersedes #3779 (narrower HERMES_ALLOW_RFC2544 for the same class of users). Co-authored-by: kshitijk4poor <82637225+kshitijk4poor@users.noreply.github.com>	2026-04-22 14:38:59 -07:00
bobashopcashier	b49a1b71a7	fix(agent): accept empty content with stop_reason=end_turn as valid anthropic response Anthropic's API can legitimately return content=[] with stop_reason="end_turn" when the model has nothing more to add after a turn that already delivered the user-facing text alongside a trivial tool call (e.g. memory write). The transport validator was treating that as an invalid response, triggering 3 retries that each returned the same valid-but-empty response, then failing the run with "Invalid API response after 3 retries." The downstream normalizer already handles empty content correctly (empty loop over response.content, content=None, finish_reason="stop"), so the only fix needed is at the validator boundary. Tests: - Empty content + stop_reason="end_turn" → valid (the fix) - Empty content + stop_reason="tool_use" → still invalid (regression guard) - Empty content without stop_reason → still invalid (existing behavior preserved)	2026-04-22 14:26:23 -07:00
Teknium	ea67e49574	fix(streaming): silent retry when stream dies mid tool-call (#14151 ) When the streaming connection dropped AFTER user-visible text was delivered but a tool call was in flight, we stubbed the turn with a '⚠ Stream stalled mid tool-call; Ask me to retry' warning — costing an iteration and breaking the flow. Users report this happening increasingly often on long SSE streams through flaky provider routes. Fix: in the existing inner stream-retry loop, relax the deltas_were_sent short-circuit. If a tool call was in flight (partial_tool_names populated) AND the error is a transient connection error (timeout, RemoteProtocolError, SSE 'connection lost', etc.), silently retry instead of bailing out. Fire a brief 'Connection dropped mid tool-call; reconnecting…' marker so the user understands the preamble is about to be re-streamed. Researched how Claude Code (tombstone + non-streaming fallback), OpenCode (blind Effect.retry wrapping whole stream), and Clawdbot (4-way gate: stopReason==error + output==0 + !hadPotentialSideEffects) handle this. Chose the narrow Clawdbot-style gate: retry only when (a) a tool call was actually in flight (otherwise the existing stub-with-recovered-text is correct for pure-text stalls) and (b) the error is transient. Side-effect safety is automatic — no tool has been dispatched within this single API call yet. UX trade-off: user sees preamble text twice on retry (OpenCode-style). Strictly better than a lost action with a 'retry manually' message. If retries exhaust, falls through to the existing stub-with-warning path so the user isn't left with zero signal. Tests: 3 new tests in TestSilentRetryMidToolCall covering (1) silent retry recovers tool call; (2) exhausted retries fall back to stub; (3) text-only stalls don't trigger retry. 30/30 pass.	2026-04-22 13:47:33 -07:00
Brooklyn Nicholson	b641639e42	fix(debug): distinguish empty-log from missing-log in report placeholder Copilot on #14138 flagged that the share report says '(file not found)' when the log exists but is empty (either because the primary is empty and no .1 rotation exists, or in the rare race where the file is truncated between _resolve_log_path() and stat()). - Split _primary_log_path() out of _resolve_log_path so both can share the LOG_FILES/home math without duplication. - _capture_log_snapshot now reports '(file empty)' when the primary path exists on disk with zero bytes, and keeps '(file not found)' for the truly-missing case. Tests: rename test_returns_none_for_empty → test_empty_primary_reports_file_empty with the new assertion, plus a race-path test that monkeypatches _resolve_log_path to exercise the size==0 branch directly.	2026-04-22 15:27:54 -05:00
Brooklyn Nicholson	6fb98f343a	fix(tui): address copilot review on #14103 - normalizeStatusBar: trim/lowercase + 'on' → 'top' alias so user-edited YAML variants (Top, " bottom ", on) coerce correctly - shift-tab yolo: no-op with sys note when no live session; success-gated echo and catch fallback so RPC failures don't report as 'yolo off' - tui_gateway config.set/get statusbar: isinstance(display, dict) guards mirroring the compact branch so a malformed display scalar in config.yaml can't raise Tests: +1 vitest for trim/case/on, +2 pytest for non-dict display survival.	2026-04-22 15:27:54 -05:00

1 2 3 4 5 ...

2354 commits