hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-25 00:51:20 +00:00

Author	SHA1	Message	Date
0xbyt4	98418afd5d	fix(tui): break TTS→STT feedback loop + colorize REC badge TTS feedback loop (hermes_cli/voice.py) The VAD loop kept the microphone live while speak_text played the agent's reply over the speakers, so the reply itself was picked up, transcribed, and submitted — the agent then replied to its own echo ("Ha, looks like we're in a loop"). Ported cli.py:_voice_tts_done synchronisation: - _tts_playing: threading.Event (initially set = "not playing"). - speak_text cancels the active recorder before opening the speakers, clears _tts_playing, and on exit waits 300 ms before re-starting the recorder — long enough for the OS audio device to settle so afplay and sounddevice don't race for it. - _continuous_on_silence now waits on _tts_playing (up to 60 s) before re-arming the mic with another 300 ms gap, mirroring cli.py:10619-10621. If the user flips voice off during the wait the loop exits cleanly instead of fighting for the device. Without both halves the loop races: if the silence callback fires before TTS starts it re-arms immediately; if TTS is already playing the pause-and-resume path catches it. Red REC badge (ui-tui appChrome + useMainApp) Classic CLI (cli.py:_get_voice_status_fragments) renders "● REC" in red and "◉ STT" in amber. TUI was showing a dim "REC" with no dot, making it hard to spot at a glance. voiceLabel now emits the same glyphs and appChrome colours them via t.color.error / t.color.warn, falling back to dim for the idle label.	2026-04-23 16:18:15 -07:00
0xbyt4	42ff785771	fix(tui): voice TTS speak-back + transcript-key bug + auto-submit Three issues surfaced during end-to-end testing of the CLI-parity voice loop and are fixed together because they all blocked "speak → agent responds → TTS reads it back" from working at all: 1. Wrong result key (hermes_cli/voice.py) transcribe_recording() returns {"success": bool, "transcript": str}, matching cli.py:_voice_stop_and_transcribe. The wrapper was reading result.get("text"), which is None, so every successful Groq / local STT response was thrown away and the 3-strikes halt fired after three silent-looking cycles. Fixed by reading "transcript" and also honouring "success" like the CLI does. Updated the loop simulation tests to return the correct shape. 2. TTS speak-back was missing (tui_gateway/server.py + hermes_cli/voice.py) The TUI had a voice.toggle "tts" subcommand but nothing downstream actually read the flag — agent replies never spoke. Mirrored cli.py:8747-8754's dispatch: on message.complete with status == "complete", if _voice_tts_enabled() is true, spawn a daemon thread running speak_text(response). Rewrote speak_text as a full port of cli.py:_voice_speak_response — same markdown-strip regex pipeline (code blocks, links, bold/italic, inline code, headers, list bullets, horizontal rules, excessive newlines), same 4000-char cap, same explicit mp3 output path, same MP3-over-OGG playback choice (afplay misbehaves on OGG), same cleanup of both extensions. Keeps TUI TTS audible output byte-for-byte identical to the classic CLI. 3. Auto-submit swallowed on non-empty composer (createGatewayEventHandler.ts) The voice.transcript handler branched on prev input via a setInput updater and fired submitRef.current inside the updater when prev was empty. React strict mode double-invokes state updaters, which would queue the submit twice; and when the composer had any content the transcript was merely appended — the agent never saw it. CLI _pending_input.put(transcript) unconditionally feeds the transcript as the next turn, so match that: always clear the composer and setTimeout(() => submitRef.current(text), 0) outside any updater. Side effect can't run twice this way, and a half-typed draft on the rare occasion is a fair trade vs. silently dropping the turn. Also added peak_rms to the rec.stop debug line so "recording too quiet" is diagnosable at a glance when HERMES_VOICE_DEBUG=1.	2026-04-23 16:18:15 -07:00
0xbyt4	04c489b587	feat(tui): match CLI's voice slash + VAD-continuous recording model The TUI had drifted from the CLI's voice model in two ways: - /voice on was lighting up the microphone immediately and Ctrl+B was interpreted as a mode toggle. The CLI separates the two: /voice on just flips the umbrella bit, recording only starts once the user presses Ctrl+B, which also sets _voice_continuous so the VAD loop auto-restarts until the user presses Ctrl+B again or three silent cycles pass. - /voice tts was missing entirely, so users couldn't turn agent reply speech on/off from inside the TUI. This commit brings the TUI to parity. Python - hermes_cli/voice.py: continuous-mode API (start_continuous, stop_continuous, is_continuous_active) layered on the existing PTT wrappers. The silence callback transcribes, fires on_transcript, tracks consecutive no-speech cycles, and auto-restarts — mirroring cli.py:_voice_stop_and_transcribe + _restart_recording. - tui_gateway/server.py: - voice.toggle now supports on / off / tts / status. The umbrella bit lives in HERMES_VOICE + display.voice_enabled; tts lives in HERMES_VOICE_TTS + display.voice_tts. /voice off also tears down any active continuous loop so a toggle-off really releases the microphone. - voice.record start/stop now drives start_continuous/stop_continuous. start is refused with a clear error when the mode is off, matching cli.py:handle_voice_record's early return on `not _voice_mode`. - New voice.transcript / voice.status events emit through _voice_emit (remembers the sid that last enabled the mode so events land in the right session). TypeScript - gatewayTypes.ts: voice.status + voice.transcript event discriminants; VoiceToggleResponse gains tts; VoiceRecordResponse gains status for the new "started/stopped" responses. - interfaces.ts: GatewayEventHandlerContext gains composer.setInput + submission.submitRef + voice.{setRecording, setProcessing, setVoiceEnabled}; InputHandlerContext.voice gains enabled + setVoiceEnabled for the mode-aware Ctrl+B handler. - createGatewayEventHandler.ts: voice.status drives REC/STT badges; voice.transcript auto-submits when the composer is empty (CLI _pending_input.put parity) and appends when a draft is in flight. no_speech_limit flips voice off + sys line. - useInputHandlers.ts: Ctrl+B now calls voice.record (start/stop), not voice.toggle, and nudges the user with a sys line when the mode is off instead of silently flipping it on. - useMainApp.ts: wires the new event-handler context fields. - slash/commands/session.ts: /voice handles on / off / tts / status with CLI-matching output ("voice: mode on · tts off"). Backward compat preserved for voice.record (was always PTT shape; gateway still honours start/stop with mode-gating added).	2026-04-23 16:18:15 -07:00
0xbyt4	0bb460b070	fix(tui): add missing hermes_cli.voice wrapper for gateway RPC tui_gateway/server.py:3486/3491/3509 imports start_recording, stop_and_transcribe, and speak_text from hermes_cli.voice, but the module never existed (not in git history — never shipped, never deleted). Every voice.record / voice.tts RPC call hit the ImportError branch and the TUI surfaced it as "voice module not available — install audio dependencies" even on boxes with sounddevice / faster-whisper / numpy installed. Adds a thin wrapper on top of tools.voice_mode (recording + transcription) and tools.tts_tool (text-to-speech): - start_recording() — idempotent; stores the active AudioRecorder in a module-global guarded by a Lock so repeat Ctrl+B presses don't fight over the mic. - stop_and_transcribe() — returns None for no-op / no-speech / Whisper-hallucination cases so the TUI's existing "no speech detected" path keeps working unchanged. - speak_text(text) — lazily imports tts_tool (optional provider SDKs stay unloaded until the first /voice tts call), parses the tool's JSON result, and plays the audio via play_audio_file. Paired with the Ctrl+B keybinding fix in the prior commit, the TUI voice pipeline now works end-to-end for the first time.	2026-04-23 16:18:15 -07:00
Teknium	50d97edbe1	feat(delegation): bump default child_timeout_seconds to 600s (#14809 ) The 300s default was too tight for high-reasoning models on non-trivial delegated tasks — e.g. gpt-5.5 xhigh reviewing 12 files would burn >5min on reasoning tokens before issuing its first tool call, tripping the hard wall-clock timeout with 0 api_calls logged. - tools/delegate_tool.py: DEFAULT_CHILD_TIMEOUT 300 -> 600 - hermes_cli/config.py: surface delegation.child_timeout_seconds in DEFAULT_CONFIG so it's discoverable (previously the key was read by _get_child_timeout() but absent from the default config schema) Users can still override via config.yaml delegation.child_timeout_seconds or DELEGATION_CHILD_TIMEOUT_SECONDS env var (floor 30s, no ceiling).	2026-04-23 16:14:55 -07:00
Teknium	ef5eaf8d87	feat(cron): honor `hermes tools` config for the cron platform (#14798 ) Cron now resolves its toolset from the same per-platform config the gateway uses — `_get_platform_tools(cfg, 'cron')` — instead of blindly loading every default toolset. Existing cron jobs without a per-job override automatically lose `moa`, `homeassistant`, and `rl` (the `_DEFAULT_OFF_TOOLSETS` set), which stops the "surprise $4.63 mixture_of_agents run" class of bug (Norbert, Discord). Precedence inside `run_job`: 1. per-job `enabled_toolsets` (PR #14767 / #6130) — wins if set 2. `_get_platform_tools(cfg, 'cron')` — new, the blanket gate 3. `None` fallback (legacy) — only on resolver exception Changes: - hermes_cli/platforms.py: register 'cron' with default_toolset 'hermes-cron' - toolsets.py: add 'hermes-cron' toolset (mirrors 'hermes-cli'; `_get_platform_tools` then filters via `_DEFAULT_OFF_TOOLSETS`) - cron/scheduler.py: add `_resolve_cron_enabled_toolsets(job, cfg)`, call it at the `AIAgent(...)` kwargs site - tests/cron/test_scheduler.py: replace the 'None when not set' test (outdated contract) with an invariant ('moa not in default cron toolset') + new per-job-wins precedence test - tests/hermes_cli/test_tools_config.py: mark 'cron' as non-messaging in the gateway-toolset-coverage test	2026-04-23 15:48:50 -07:00
Teknium	bf196a3fc0	chore: release v0.11.0 (2026.4.23) (#14791 ) The Interface release — new Ink-based TUI, pluggable transport architecture, native AWS Bedrock, five new inference paths (NVIDIA NIM, Arcee, Step Plan, Gemini CLI OAuth, ai-gateway), GPT-5.5 via Codex OAuth, QQBot (17th platform), expanded plugin surface, dashboard plugin system + live theme switching, /steer mid-run nudges, shell hooks, webhook direct-delivery, smarter delegation, and auxiliary models config UI. Also folds in the v0.10.0 deferred batch (v0.10.0 shipped only the Nous Tool Gateway). 1,556 commits · 761 PRs · 290 contributors since v0.9.0.	2026-04-23 15:31:59 -07:00
Teknium	f593c367be	feat(dashboard): reskin extension points for themes and plugins (#14776 ) Themes and plugins can now pull off arbitrary dashboard reskins (cockpit HUD, retro terminal, etc.) without touching core code. Themes gain four new fields: - layoutVariant: standard \| cockpit \| tiled — shell layout selector - assets: {bg, hero, logo, crest, sidebar, header, custom: {...}} — artwork URLs exposed as --theme-asset-* CSS vars - customCSS: raw CSS injected as a scoped <style> tag on theme apply (32 KiB cap, cleaned up on theme switch) - componentStyles: per-component CSS-var overrides (clipPath, borderImage, background, boxShadow, ...) for card/header/sidebar/ backdrop/tab/progress/badge/footer/page Plugin manifests gain three new fields: - tab.override: replaces a built-in route instead of adding a tab - tab.hidden: register component + slots without adding a nav entry - slots: declares shell slots the plugin populates 10 named shell slots: backdrop, header-left/right/banner, sidebar, pre-main, post-main, footer-left/right, overlay. Plugins register via window.__HERMES_PLUGINS__.registerSlot(name, slot, Component). A <PluginSlot> React helper is exported on the plugin SDK. Ships a full demo at plugins/strike-freedom-cockpit/ — theme YAML + slot-only plugin that reproduces a Gundam cockpit dashboard: MS-STATUS sidebar with live telemetry, COMPASS crest in header, notched card corners via componentStyles, scanline overlay via customCSS, gold/cyan palette, Orbitron typography. Validation: - 15 new tests in test_web_server.py covering every extended field - tests/hermes_cli/: 2615 passed (3 pre-existing unrelated failures) - tsc -b --noEmit: clean - vite build: 418 kB bundle, ~2 kB delta for slots/theme extensions Co-authored-by: Teknium <p@nousresearch.com>	2026-04-23 15:31:01 -07:00
sprmn24	78d1e252fa	fix(web_server): guard GATEWAY_HEALTH_TIMEOUT against invalid env values float(os.getenv(...)) at module level raises ValueError on any non-numeric value, crashing the web server at import before it starts. Wrap in try/except with a warning log and fallback to 3.0s.	2026-04-23 15:07:25 -07:00
helix4u	1cc0bdd5f3	fix(dashboard): avoid auth header collision with reverse proxies	2026-04-23 14:05:23 -07:00
Teknium	97b9b3d6a6	fix(gateway): drain-aware hermes update + faster still-working pings (#14736 ) cmd_update no longer SIGKILLs in-flight agent runs, and users get 'still working' status every 3 min instead of 10. Two long-standing sources of '@user — agent gives up mid-task' reports on Telegram and other gateways. Drain-aware update: - New helper hermes_cli.gateway._graceful_restart_via_sigusr1(pid, drain_timeout) sends SIGUSR1 to the gateway and polls os.kill(pid, 0) until the process exits or the budget expires. - cmd_update's systemd loop now reads MainPID via 'systemctl show --property=MainPID --value' and tries the graceful path first. The gateway's existing SIGUSR1 handler -> request_restart(via_service= True) -> drain -> exit(75) is wired in gateway/run.py and is respawned by systemd's Restart=on-failure (and the explicit RestartForceExitStatus=75 on newer units). - Falls back to 'systemctl restart' when MainPID is unknown, the drain budget elapses, or the unit doesn't respawn after exit (older units missing Restart=on-failure). Old install behavior preserved. - Drain budget = max(restart_drain_timeout, 30s) + 15s margin so the drain loop in run_agent + final exit have room before fallback fires. Composes with #14728's tool-subprocess reaping. Notification interval: - agent.gateway_notify_interval default 600 -> 180. - HERMES_AGENT_NOTIFY_INTERVAL env-var fallback in gateway/run.py matched. - 9-minute weak-model spinning runs now ping at 3 min and 6 min instead of 27 seconds before completion, removing the 'is the bot dead?' reflex that drives gateway-restart cycles. Tests: - Two new tests in tests/hermes_cli/test_update_gateway_restart.py: one asserts SIGUSR1 is sent and 'systemctl restart' is NOT called when MainPID is known and the helper succeeds; one asserts the fallback fires when the helper returns False. - E2E: spawned detached bash processes confirm the helper returns True on SIGUSR1-handling exit (~0.5s) and False on SIGUSR1-ignoring processes (timeout). Verified non-existent PID and pid=0 edge cases. - 41/41 in test_update_gateway_restart.py (was 39, +2 new). - 154/154 in shutdown-related suites including #14728's new tests. Reported by @GeoffWellman and @ANT_1515 on X.	2026-04-23 14:01:57 -07:00
Teknium	165b2e481a	feat(agent): make API retry count configurable via agent.api_max_retries (#14730 ) Closes #11616. The agent's API retry loop hardcoded max_retries = 3, so users with fallback providers on flaky primaries burned through ~3 × provider timeout (e.g. 3 × 180s = 9 minutes) before their fallback chain got a chance to kick in. Expose a new config key: agent: api_max_retries: 3 # default unchanged Set it to 1 for fast failover when you have fallback providers, or raise it if you prefer longer tolerance on a single provider. Values < 1 are clamped to 1 (single attempt, no retry); non-integer values fall back to the default. This wraps the Hermes-level retry loop only — the OpenAI SDK's own low-level retries (max_retries=2 default) still run beneath this for transient network errors. Changes: - hermes_cli/config.py: add agent.api_max_retries default 3 with comment. - run_agent.py: read self._api_max_retries in AIAgent.__init__; replace hardcoded max_retries = 3 in the retry loop with self._api_max_retries. - cli-config.yaml.example: documented example entry. - hermes_cli/tips.py: discoverable tip line. - tests/run_agent/test_api_max_retries_config.py: 4 tests covering default, override, clamp-to-one, and invalid-value fallback.	2026-04-23 13:59:32 -07:00
Teknium	327b57da91	fix(gateway): kill tool subprocesses before adapter disconnect on drain timeout (#14728 ) Closes #8202. Root cause: stop() reclaimed tool-call bash/sleep children only at the very end of the shutdown sequence — after a 60s drain, 5s interrupt grace, and per-adapter disconnect. Under systemd (TimeoutStopSec bounded by drain_timeout), that meant the cgroup SIGKILL escalation fired first, and systemd reaped the bash/sleep children instead of us. Fix: - Extract tool-subprocess cleanup into a local helper _kill_tool_subprocesses() in _stop_impl(). - Invoke it eagerly right after _interrupt_running_agents() on the drain-timeout path, before adapter disconnect. - Keep the existing catch-all call at the end for the graceful path and defense in depth against mid-teardown respawns. - Bump generated systemd unit TimeoutStopSec to drain_timeout + 30s so cleanup + disconnect + DB close has headroom above the drain budget, matching the 'subprocess timeout > TimeoutStopSec + margin' rule from the skill. Tests: - New: test_gateway_stop_kills_tool_subprocesses_before_adapter_disconnect_on_timeout asserts kill_all() runs before disconnect() when drain times out. - New: test_gateway_stop_kills_tool_subprocesses_on_graceful_path guards that the final catch-all still fires when drain succeeds (regression guard against accidental removal during refactor). - Updated: existing systemd unit generator tests expect TimeoutStopSec=90 (= 60s drain + 30s headroom) with explanatory comment.	2026-04-23 13:59:29 -07:00
Teknium	b5333abc30	fix(auth): refuse to touch real auth.json during pytest; delete sandbox-escaping test (#14729 ) A test in tests/agent/test_credential_pool.py (test_try_refresh_current_updates_only_current_entry) monkeypatched refresh_codex_oauth_pure() to return the literal fixture strings 'access-new'/'refresh-new', then executed the real production code path in agent/credential_pool.py::try_refresh_current which calls _sync_device_code_entry_to_auth_store → _save_provider_state → writes to `providers.openai-codex.tokens`. That writer resolves the target via get_hermes_home()/auth.json. If the test ran with HERMES_HOME unset (direct pytest invocation, IDE runner bypassing conftest discovery, or any other sandbox escape), it would overwrite the real user's auth store with the fixture strings. Observed in the wild: Teknium's ~/.hermes/auth.json providers.openai-codex.tokens held 'access-new'/'refresh-new' for five days. His CLI kept working because the credential_pool entries still held real JWTs, but `hermes model`'s live discovery path (which reads via resolve_codex_runtime_credentials → _read_codex_tokens → providers.tokens) was silently 401-ing. Fixes: - Delete test_try_refresh_current_updates_only_current_entry. It was the only test that exercised a writer hitting providers.openai-codex.tokens with literal stub tokens. The entry-level rotation behavior it asserted is still covered by test_mark_exhausted_and_rotate_persists_status above. - Add a seat belt in hermes_cli.auth._auth_file_path(): if PYTEST_CURRENT_TEST is set AND the resolved path equals the real ~/.hermes/auth.json, raise with a clear message. In production (no PYTEST_CURRENT_TEST), a single dict lookup. Any future test that forgets to monkeypatch HERMES_HOME fails loudly instead of corrupting the user's credentials. Validation: - production (no PYTEST_CURRENT_TEST): returns real path, unchanged behavior - pytest + HERMES_HOME unset (points at real home): raises with message - pytest + HERMES_HOME=/tmp/...: returns tmp path, tests pass normally	2026-04-23 13:50:21 -07:00
Teknium	255ba5bf26	feat(dashboard): expand themes to fonts, layout, density (#14725 ) Dashboard themes now control typography and layout, not just colors. Each built-in theme picks its own fonts, base size, radius, and density so switching produces visible changes beyond hue. Schema additions (per theme): - typography — fontSans, fontMono, fontDisplay, fontUrl, baseSize, lineHeight, letterSpacing. fontUrl is injected as <link> on switch so Google/Bunny/self-hosted stylesheets all work. - layout — radius (any CSS length) and density (compact \| comfortable \| spacious, multiplies Tailwind spacing). - colorOverrides (optional) — pin individual shadcn tokens that would otherwise derive from the palette. Built-in themes are now distinct beyond palette: - default — system stack, 15px, 0.5rem radius, comfortable - midnight — Inter + JetBrains Mono, 14px, 0.75rem, comfortable - ember — Spectral (serif) + IBM Plex Mono, 15px, 0.25rem - mono — IBM Plex Sans + Mono, 13px, 0 radius, compact - cyberpunk— Share Tech Mono everywhere, 14px, 0 radius, compact - rose — Fraunces (serif) + DM Mono, 16px, 1rem, spacious Also fixes two bugs: 1. Custom user themes silently fell back to default. ThemeProvider only applied BUILTIN_THEMES[name], so YAML files in ~/.hermes/dashboard-themes/ showed in the picker but did nothing. Server now ships the full normalised definition; client applies it. 2. Docs documented a 21-token flat colors schema that never matched the code (applyPalette reads a 3-layer palette). Rewrote the Themes section against the actual shape. Implementation: - web/src/themes/types.ts: extend DashboardTheme with typography, layout, colorOverrides; ThemeListEntry carries optional definition. - web/src/themes/presets.ts: 6 built-ins with distinct typography+layout. - web/src/themes/context.tsx: applyTheme() writes palette+typography+ layout+overrides as CSS vars, injects fontUrl stylesheet, fixes the fallback-to-default bug via resolveTheme(name). - web/src/index.css: html/body/code read the new theme-font vars; --radius-sm/md/lg/xl derive from --theme-radius; --spacing scales with --theme-spacing-mul so Tailwind utilities shift with density. - hermes_cli/web_server.py: _normalise_theme_definition() parses loose YAML (bare hex strings, partial blocks) into the canonical wire shape; /api/dashboard/themes ships full definitions for user themes. - tests/hermes_cli/test_web_server.py: 16 new tests covering the normaliser and discovery (rejection cases, clamping, defaults). - website/docs/user-guide/features/web-dashboard.md: rewrite Themes section with real schema, per-model tables, full YAML example.	2026-04-23 13:49:51 -07:00
Teknium	8f5fee3e3e	feat(codex): add gpt-5.5 and wire live model discovery into picker (#14720 ) OpenAI launched GPT-5.5 on Codex today (Apr 23 2026). Adds it to the static catalog and pipes the user's OAuth access token into the openai-codex path of provider_model_ids() so /model mid-session and the gateway picker hit the live ChatGPT codex/models endpoint — new models appear for each user according to what ChatGPT actually lists for their account, without a Hermes release. Verified live: 'gpt-5.5' returns priority 0 (featured) from the endpoint, 400k context per OpenAI's launch article. 'hermes chat --provider openai-codex --model gpt-5.5' completes end-to-end. Changes: - hermes_cli/codex_models.py: add gpt-5.5 to DEFAULT_CODEX_MODELS + forward-compat - agent/model_metadata.py: 400k context length entry - hermes_cli/models.py: resolve codex OAuth token before calling get_codex_model_ids() in provider_model_ids('openai-codex')	2026-04-23 13:32:43 -07:00
kshitijk4poor	e91be4d7dc	fix: resolve_alias prefers highest version + merges static catalog Three bugs fixed in model alias resolution: 1. resolve_alias() returned the FIRST catalog match with no version preference. '/model mimo' picked mimo-v2-omni (index 0 in dict) instead of mimo-v2.5-pro. Now collects all prefix matches, sorts by version descending with pro/max ranked above bare names, and returns the highest. 2. models.dev registry missing newly added models (e.g. v2.5 for native xiaomi). resolve_alias() now merges static _PROVIDER_MODELS entries into the catalog so models resolve immediately without waiting for models.dev to sync. 3. hermes model picker showed only models.dev results (3 xiaomi models), hiding curated entries (5 total). The picker now merges curated models into the models.dev list so all models appear. Also fixes a trailing-dot float parsing edge case in _model_sort_key where '5.4.' failed float() and multi-dot versions like '5.4.1' weren't parsed correctly.	2026-04-23 23:18:33 +05:30
kshitij	82a0ed1afb	feat: add Xiaomi MiMo v2.5-pro and v2.5 model support (#14635 ) ## Merged Adds MiMo v2.5-pro and v2.5 support to Xiaomi native provider, OpenCode Go, and setup wizard. ### Changes - Context lengths: added v2.5-pro (1M) and v2.5 (1M), corrected existing MiMo entries to exact values (262144) - Provider lists: xiaomi, opencode-go, setup wizard - Vision: upgraded from mimo-v2-omni to mimo-v2.5 (omnimodal) - Config description updated for XIAOMI_API_KEY - Tests updated for new vision model preference ### Verification - 4322 tests passed, 0 new regressions - Live API tested on Xiaomi portal: basic, reasoning, tool calling, multi-tool, file ops, system prompt, vision — all pass - Self-review found and fixed 2 issues (redundant vision check, stale HuggingFace context length)	2026-04-23 10:06:25 -07:00
Teknium	ce089169d5	feat(skills-guard): gate agent-created scanner on config.skills.guard_agent_created (default off) Replaces the blanket 'always allow' change from the previous commit with an opt-in config flag so users who want belt-and-suspenders security can still get the keyword scan on skill_manage output. ## Default behavior (flag off) skill_manage(action='create'\|'edit'\|'patch') no longer runs the keyword scanner. The agent can write skills that mention risky keywords in prose (documenting what reviewers should watch for, describing cache-bust semantics in a PR-review skill, referencing AGENTS.md, etc.) without getting blocked. Rationale: the agent can already execute the same code paths via terminal() with no gate, so the scan adds friction without meaningful security against a compromised or malicious agent. ## Opt-in behavior (flag on) Set skills.guard_agent_created: true in config.yaml to get the original behavior back. Scanner runs on every skill_manage write; dangerous verdicts surface as a tool error the agent can react to (retry without the flagged content). ## External hub installs unaffected trusted/community sources (hermes skills install) always get scanned regardless of this flag. The gate is specifically for skill_manage, which only agents call. ## Changes - hermes_cli/config.py: add skills.guard_agent_created: False to DEFAULT_CONFIG - tools/skill_manager_tool.py: _guard_agent_created_enabled() reads the flag; _security_scan_skill() short-circuits to None when the flag is off - tools/skills_guard.py: restore INSTALL_POLICY['agent-created'] = ('allow', 'allow', 'ask') so the scan remains strict when it does run - tests/tools/test_skills_guard.py: restore original ask/force tests - tests/tools/test_skill_manager_tool.py: new TestSecurityScanGate class covering both flag states + config error handling ## Validation - tests/tools/test_skills_guard.py + test_skill_manager_tool.py: 115/115 pass - E2E: flagged-keyword skill creates with default config, blocks with flag on	2026-04-23 06:20:47 -07:00
Teknium	5a26938aa5	fix(terminal): auto-source ~/.profile and ~/.bash_profile so n/nvm PATH survives (#14534 ) The environment-snapshot login shell was auto-sourcing only ~/.bashrc when building the PATH snapshot. On Debian/Ubuntu the default ~/.bashrc starts with a non-interactive short-circuit: case $- in i) ;; *) return;; esac Sourcing it from a non-interactive shell returns before any PATH export below that guard runs. Node version managers like n and nvm append their PATH line under that guard, so Hermes was capturing a PATH without ~/n/bin — and the terminal tool saw 'node: command not found' even when node was on the user's interactive shell PATH. Expand the auto-source list (when auto_source_bashrc is on) to: ~/.profile → ~/.bash_profile → ~/.bashrc ~/.profile and ~/.bash_profile have no interactivity guard — installers that write their PATH there (n's n-install, nvm's curl installer on most setups) take effect. ~/.bashrc still runs last to preserve behaviour for users who put PATH logic there without the guard. Added two tests covering the new behaviour plus an E2E test that spins up a real LocalEnvironment with a guard-prefixed ~/.bashrc and a ~/.profile PATH export, and verifies the captured snapshot PATH contains the profile entry.	2026-04-23 05:15:37 -07:00
Teknium	d45c738a52	fix(gateway): preflight user D-Bus before systemctl --user start (#14531 ) On fresh RHEL/Debian SSH sessions without linger, `systemctl --user start hermes-gateway` fails with 'Failed to connect to bus: No medium found' because /run/user/$UID/bus doesn't exist. Setup previously showed a raw CalledProcessError and continued claiming success, so the gateway never actually started. systemd_start() and systemd_restart() now call _preflight_user_systemd() for the user scope first: - Bus socket already there → no-op (desktop / linger-enabled servers) - Linger off → try loginctl enable-linger (works when polkit permits, needs sudo otherwise), wait for socket - Still unreachable → raise UserSystemdUnavailableError with a clean remediation message pointing to sudo loginctl + hermes gateway run as the foreground fallback Setup's start/restart handlers and gateway_command() catch the new exception and render the multi-line guidance instead of a traceback.	2026-04-23 05:09:38 -07:00
David VV	39fcf1d127	fix(model_switch): group custom_providers by endpoint in /model picker (#9210 ) Multiple custom_providers entries sharing the same base_url + api_key are now grouped into a single picker row. A local Ollama host with per-model display names ("Ollama — GLM 5.1", "Ollama — Qwen3-coder", "Ollama — Kimi K2", "Ollama — MiniMax M2.7") previously produced four near-duplicate picker rows that differed only by suffix; now it appears as one "Ollama" row with four models. Key changes: - Grouping key changed from slug-by-name to (base_url, api_key). Names frequently differ per model while the endpoint stays the same. - When the grouped endpoint matches current_base_url, the row's slug is set to current_provider so picker-driven switches route through the live credential pipeline (no re-resolution needed). - Per-model suffix is stripped from the display name ("Ollama — X" → "Ollama") via em-dash / " - " separators. - Two groups with different api_keys at the same base_url (or otherwise colliding on cleaned name) are disambiguated with a numeric suffix (custom:openai, custom:openai-2) so both stay visible. - current_base_url parameter plumbed through both gateway call sites. Existing #8216, #11499, #13509 regressions covered (dict/list shapes of models:, section-3/section-4 dedup, normalized list-format entries). Salvaged from @davidvv's PR #9210 — the underlying code had diverged ~1400 commits since that PR was opened, so this is a reconstruction of the same approach on current main rather than a clean cherry-pick. Authorship preserved via --author on this commit. Closes #9210	2026-04-23 03:10:30 -07:00
Aslaaen	51c1d2de16	fix(profiles): stage profile imports to prevent directory clobbering	2026-04-23 03:02:34 -07:00
Wysie	be99feff1f	fix(image-gen): force-refresh plugin providers in long-lived sessions	2026-04-23 03:01:18 -07:00
drstrangerujn	a5b0c7e2ec	fix(config): preserve list-format models in custom_providers normalize _normalize_custom_provider_entry silently drops the models field when it's a list. Hand-edited configs (and the shape used by older Hermes versions) still write models as a plain list of ids, so after the normalize pass the entry reaches list_authenticated_providers() with no models and /model shows the provider with (0) models — even though the underlying picker code handles lists fine. Convert list-format models into the empty-value dict shape the rest of the pipeline already expects. Dict-format entries keep passing through unchanged. Repro (before the fix): custom_providers: - name: acme base_url: https://api.example.com/v1 models: [foo, bar, baz] /model shows "acme (0)"; bypassing normalize in list_authenticated_providers returns three models, confirming the drop happens in normalize. Adds four unit tests covering list→dict conversion, dict pass-through, filtering of empty/non-string entries, and the empty-list case.	2026-04-23 02:37:07 -07:00
sharziki	d7452af257	fix(pairing): handle null user_name in pairing list display When user_name is stored as None (e.g. Telegram users without a display name), dict.get('user_name', '') returns None because the key exists — the default is only used for missing keys. This causes a TypeError when the format specifier :<20 is applied to None. Use `or ''` to coerce None to an empty string. Fixes #7392 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-23 02:34:11 -07:00
helix4u	bace220d29	fix(image-gen): persist plugin provider on reconfigure	2026-04-23 01:56:09 -07:00
xinpengdr	5eefdd9c02	fix: skip non-API-key auth providers in env-var credential detection In list_authenticated_providers(), providers like qwen-oauth that use OAuth authentication were incorrectly flagged as authenticated because the env-var check fell back to models.dev provider env vars (e.g. DASHSCOPE_API_KEY for alibaba). Any user with an alibaba API key would see a ghost qwen-oauth entry in /model picker with 0 models listed. Fix: skip providers whose auth_type is not api_key in the env-var detection section (step 1). OAuth/external-process providers are properly handled in step 2 (HERMES_OVERLAYS) which checks the auth store.	2026-04-22 21:18:28 -07:00
Teknium	a2a8092e90	feat(cli): add --ignore-user-config and --ignore-rules flags Port from openai/codex#18646. Adds two flags to 'hermes chat' that fully isolate a run from user-level configuration and rules: * --ignore-user-config: skip ~/.hermes/config.yaml and fall back to built-in defaults. Credentials in .env are still loaded so the agent can actually call a provider. * --ignore-rules: skip auto-injection of AGENTS.md, SOUL.md, .cursorrules, and persistent memory (maps to AIAgent(skip_context_files=True, skip_memory=True)). Primary use cases: - Reproducible CI runs that should not pick up developer-local config - Third-party integrations (e.g. Chronicle in Codex) that bring their own config and don't want user preferences leaking in - Bug-report reproduction without the reporter's personal overrides - Debugging: bisect 'was it my config?' vs 'real bug' in one command Both flags are registered on the parent parser AND the 'chat' subparser (with argparse.SUPPRESS on the subparser to avoid overwriting the parent value when the flag is placed before the subcommand, matching the existing --yolo/--worktree/--pass-session-id pattern). Env vars HERMES_IGNORE_USER_CONFIG=1 and HERMES_IGNORE_RULES=1 are set by cmd_chat BEFORE 'from cli import main' runs, which is critical because cli.py evaluates CLI_CONFIG = load_cli_config() at module import time. The cli.py / hermes_cli.config.load_cli_config() function checks the env var and skips ~/.hermes/config.yaml when set. Tests: 11 new tests in tests/hermes_cli/test_ignore_user_config_flags.py covering the env gate, constructor wiring, cmd_chat simulation, and argparse flag registration. All pass; existing hermes_cli + cli suites unaffected (3005 pass, 2 pre-existing unrelated failures).	2026-04-22 19:58:42 -07:00
Dev-Mriganka	a14fb3ab1a	fix(cli): guard fallback_model list format in save_config_value When a user manually sets fallback_model as a YAML list instead of a dict, save_config_value() crashes with: AttributeError: 'list' object has no attribute 'get' at the fb.get('provider') call on hermes_cli/config.py. The fix adds isinstance(fb, dict) so list-format values are treated as unconfigured — the fallback_model comment block is appended to guide correct usage — instead of crashing. Fixes #4091 Co-authored-by: [AI-assisted — Claude Sonnet 4.6 via Milo/Hermes]	2026-04-22 18:16:49 -07:00
Abdelmajid NIDNASSER	ce4214ec94	Normalize claw workspace paths for Windows	2026-04-22 18:15:27 -07:00
li0near	82cce3d26c	fix: add base_url_env_var to Anthropic ProviderConfig The Anthropic provider entry in PROVIDER_REGISTRY is the only standard API-key provider missing a base_url_env_var. This causes the credential pool to hardcode base_url to https://api.anthropic.com, ignoring ANTHROPIC_BASE_URL from the environment. When using a proxy (e.g. LiteLLM, custom gateway), subagent delegation fails with 401 because: 1. _seed_from_env() creates pool entries with the hardcoded base_url 2. On error recovery, _swap_credential() overwrites the child agent's proxy URL with the pool entry's api.anthropic.com 3. The proxy API key is sent to real Anthropic → authentication_error Adding base_url_env_var="ANTHROPIC_BASE_URL" aligns Anthropic with the 20+ other providers that already have this field set (alibaba, gemini, deepseek, xai, etc.).	2026-04-22 17:56:14 -07:00
Teknium	7d8b2eee63	fix(delegate): default inherit_mcp_toolsets=true, drop version bump Follow-up on helix4u's PR #14211: - Flip default to true: narrowing toolsets=['web','browser'] expresses 'I want these extras', not 'silently strip MCP'. Parent MCP tools (registered at runtime) should survive narrowing by default. - Drop _config_version bump (22->23); additive nested key under delegation.* is handled by _deep_merge, no migration needed. - Update tests to reflect new default behavior.	2026-04-22 17:45:48 -07:00
helix4u	3e96c87f37	fix(delegate): make MCP toolset inheritance configurable	2026-04-22 17:45:48 -07:00
Teknium	9eb543cafe	feat(/model): merge models.dev entries for lesser-loved providers (#14221 ) New and newer models from models.dev now surface automatically in /model (both hermes model CLI and the gateway Telegram/Discord picker) for a curated set of secondary providers — no Hermes release required when the registry publishes a new model. Primary user-visible fix: on OpenCode Go, typing '/model mimo-v2.5-pro' no longer silently fuzzy-corrects to 'mimo-v2-pro'. The exact match against the merged models.dev catalog wins. Scope (opt-in frozenset _MODELS_DEV_PREFERRED in hermes_cli/models.py): opencode-go, opencode-zen, deepseek, kilocode, fireworks, mistral, togetherai, cohere, perplexity, groq, nvidia, huggingface, zai, gemini, google. Explicitly NOT merged: - openrouter and nous (never): curated list is already a hand-picked subset / Portal is source of truth. - xai, xiaomi, minimax, minimax-cn, kimi-coding, kimi-coding-cn, alibaba, qwen-oauth (per-project decision to keep curated-only). - providers with dedicated live-endpoint paths (copilot, anthropic, ai-gateway, ollama-cloud, custom, stepfun, openai-codex) — those paths already handle freshness themselves. Changes: - hermes_cli/models.py: add _MODELS_DEV_PREFERRED + _merge_with_models_dev helper. provider_model_ids() branches on the set at its curated-fallback return. Merge is models.dev-first, curated-only extras appended, case-insensitive dedup, graceful fallback when models.dev is offline. - hermes_cli/model_switch.py: list_authenticated_providers() calls the same merge in both its code paths (PROVIDER_TO_MODELS_DEV loop + HERMES_OVERLAYS loop). Picker AND validation-fallback both see fresh entries. - tests/hermes_cli/test_models_dev_preferred_merge.py (new): 13 tests — merge-helper unit tests (empty/raise/order/dedup), opencode-go/zen behavior, openrouter+nous explicitly guarded from merge. - tests/hermes_cli/test_opencode_go_in_model_list.py: converted from snapshot-style assertion to a behavior-based floor check, so it doesn't break when models.dev publishes additional opencode-go entries. Addresses a report from @pfanis via Telegram: newer Xiaomi variants on OpenCode Go weren't appearing in the /model picker, and /model was silently routing requests for new variants to older ones.	2026-04-22 17:33:42 -07:00
helix4u	b52123eb15	fix(gateway): recover stale pid and planned restart state	2026-04-22 16:33:46 -07:00
Teknium	51ca575994	feat(gateway): expose plugin slash commands natively on all platforms + decision-capable command hook Plugin slash commands now surface as first-class commands in every gateway enumerator — Discord native slash picker, Telegram BotCommand menu, Slack /hermes subcommand map — without a separate per-platform plugin API. The existing 'command:<name>' gateway hook gains a decision protocol via HookRegistry.emit_collect(): handlers that return a dict with {'decision': 'deny'\|'handled'\|'rewrite'\|'allow'} can intercept slash command dispatch before core handling runs, unifying what would otherwise have been a parallel 'pre_gateway_command' hook surface. Changes: - gateway/hooks.py: add HookRegistry.emit_collect() that fires the same handler set as emit() but collects non-None return values. Backward compatible — fire-and-forget telemetry hooks still work via emit(). - hermes_cli/plugins.py: add optional 'args_hint' param to register_command() so plugins can opt into argument-aware native UI registration (Discord arg picker, future platforms). - hermes_cli/commands.py: add _iter_plugin_command_entries() helper and merge plugin commands into telegram_bot_commands() and slack_subcommand_map(). New is_gateway_known_command() recognizes both built-in and plugin commands so the gateway hook fires for either. - gateway/platforms/discord.py: extract _build_auto_slash_command helper from the COMMAND_REGISTRY auto-register loop and reuse it for plugin-registered commands. Built-in name conflicts are skipped. - gateway/run.py: before normal slash dispatch, call emit_collect on command:<canonical> and honor deny/handled/rewrite/allow decisions. Hook now fires for plugin commands too. - scripts/release.py: AUTHOR_MAP entry for @Magaav. - Tests: emit_collect semantics, plugin command surfacing per platform, decision protocol (deny/handled/rewrite/allow + non-dict tolerance), Discord plugin auto-registration + conflict skipping, is_gateway_known_command. Salvaged from #14131 (@Magaav). Original PR added a parallel 'pre_gateway_command' hook and a platform-keyed plugin command registry; this re-implementation reuses the existing 'command:<name>' hook and treats plugin commands as platform-agnostic so the same capability reaches Telegram and Slack without new API surface. Co-authored-by: Magaav <73175452+Magaav@users.noreply.github.com>	2026-04-22 16:23:21 -07:00
Teknium	c96a548bde	feat(models): add xiaomi/mimo-v2.5-pro and mimo-v2.5 to openrouter + nous (#14184 ) Replace xiaomi/mimo-v2-pro with xiaomi/mimo-v2.5-pro and xiaomi/mimo-v2.5 in the OpenRouter fallback catalog and the nous provider model list. Add matching DEFAULT_CONTEXT_LENGTHS entries (1M tokens each).	2026-04-22 16:12:39 -07:00
brooklyn!	a1d57292af	Merge pull request #14145 from NousResearch/bb/tui-polish fix(tui): input wrap, shift-tab yolo, statusline, clean boot	2026-04-22 16:48:37 -05:00
kshitijk4poor	d6ed35d047	feat(security): add global toggle to allow private/internal URL resolution Adds security.allow_private_urls / HERMES_ALLOW_PRIVATE_URLS toggle so users on OpenWrt routers, TUN-mode proxies (Clash/Mihomo/Sing-box), corporate split-tunnel VPNs, and Tailscale networks — where DNS resolves public domains to 198.18.0.0/15 or 100.64.0.0/10 — can use web_extract, browser, vision URL fetching, and gateway media downloads. Single toggle in tools/url_safety.py; all 23 is_safe_url() call sites inherit automatically. Cached for process lifetime. Cloud metadata endpoints stay ALWAYS blocked regardless of the toggle: 169.254.169.254 (AWS/GCP/Azure/DO/Oracle), 169.254.170.2 (AWS ECS task IAM creds), 169.254.169.253 (Azure IMDS wire server), 100.100.100.200 (Alibaba), fd00:ec2::254 (AWS IPv6), the entire 169.254.0.0/16 link-local range, and the metadata.google.internal / metadata.goog hostnames (checked pre-DNS so they can't be bypassed on networks where those names resolve to local IPs). Supersedes #3779 (narrower HERMES_ALLOW_RFC2544 for the same class of users). Co-authored-by: kshitijk4poor <82637225+kshitijk4poor@users.noreply.github.com>	2026-04-22 14:38:59 -07:00
Brooklyn Nicholson	4107538da8	style(debug): add missing blank line between LogSnapshot and helpers Copilot on #14145 flagged PEP 8 / Black convention — two blank lines between top-level class and next top-level function.	2026-04-22 16:34:05 -05:00
Brooklyn Nicholson	b641639e42	fix(debug): distinguish empty-log from missing-log in report placeholder Copilot on #14138 flagged that the share report says '(file not found)' when the log exists but is empty (either because the primary is empty and no .1 rotation exists, or in the rare race where the file is truncated between _resolve_log_path() and stat()). - Split _primary_log_path() out of _resolve_log_path so both can share the LOG_FILES/home math without duplication. - _capture_log_snapshot now reports '(file empty)' when the primary path exists on disk with zero bytes, and keeps '(file not found)' for the truly-missing case. Tests: rename test_returns_none_for_empty → test_empty_primary_reports_file_empty with the new assertion, plus a race-path test that monkeypatches _resolve_log_path to exercise the size==0 branch directly.	2026-04-22 15:27:54 -05:00
Teknium	88564ad8bc	fix(skins): don't inherit status_bar_* into light-mode skins The salvaged status-bar skin keys were seeded on the default skin, but _build_skin_config merges default.colors into every skin — so daylight and warm-lightmode silently inherited silver status_bar_text (#C0C0C0) on their light backgrounds, rendering as low-contrast gray on gray. Drop the seven status_bar_{text,strong,dim,good,warn,bad,critical} entries from the default skin's colors and let get_prompt_toolkit_style _overrides fall back to banner_text / banner_title / banner_dim / ui_ok / ui_warn / ui_error. Dark skins keep their explicit overrides and render identically; light skins now inherit their own dark banner colors for readable status-bar text.	2026-04-22 13:20:02 -07:00
kshitij	81a504a4a0	fix: align status bar skin tests with upstream main Drop rebased test assumptions about theme-mode helpers removed on main and keep the status bar skin integration aligned with the current skin engine model.	2026-04-22 13:20:02 -07:00
kshitij	c323217188	fix: make CLI status bar skin-aware Route prompt_toolkit status bar colors through the skin engine so /skin updates the status bar alongside the rest of the interactive TUI. Add regression coverage for the new status bar style override keys and CLI style composition.	2026-04-22 13:20:02 -07:00
kshitijk4poor	de849c410d	refactor(debug): remove dead _read_log_tail/_read_full_log wrappers These thin wrappers around _capture_log_snapshot had zero production callers after the snapshot refactor — run_debug_share uses snapshots directly and collect_debug_report captures internally. The wrappers also caused a performance regression: _read_log_tail read up to 512KB and built full_text just to return tail_text. Remove both wrappers and migrate TestReadFullLog → TestCaptureLogSnapshot to test _capture_log_snapshot directly. Same coverage, tests the real API instead of dead indirection.	2026-04-22 11:59:39 -07:00
Junass1	61d0a99c11	fix(debug): sweep expired pending pastes on slash debug paths	2026-04-22 11:59:39 -07:00
kshitijk4poor	921133cfa5	fix(debug): preserve full line at truncation boundary and cap memory Adapt the byte-boundary-safe truncation fix from PR #14040 by taosiyuan163 into the new _capture_log_snapshot() code path: when the truncation cut lands exactly on a line boundary, keep the first retained line instead of unconditionally dropping it. Also add a 2x max_bytes safety cap to the backward-reading loop to prevent unbounded memory consumption when log files contain very long lines (e.g. JSON blobs) with few newlines. Based on #14040 by @taosiyuan163.	2026-04-22 11:59:39 -07:00
helix4u	fc3862bdd6	fix(debug): snapshot logs once for debug share	2026-04-22 11:59:39 -07:00
Teknium	3e652f75b2	fix(plugins+nous): auto-coerce memory plugins; actionable Nous 401 diagnostic (#14005 ) * fix(plugins): auto-coerce user-installed memory plugins to kind=exclusive User-installed memory provider plugins at $HERMES_HOME/plugins/<name>/ were being dispatched to the general PluginManager, which has no register_memory_provider method on PluginContext. Every startup logged: Failed to load plugin 'mempalace': 'PluginContext' object has no attribute 'register_memory_provider' Bundled memory providers were already skipped via skip_names={memory, context_engine} in discover_and_load, but user-installed ones weren't. Fix: _parse_manifest now scans the plugin's __init__.py source for 'register_memory_provider' or 'MemoryProvider' (same heuristic as plugins/memory/__init__.py:_is_memory_provider_dir) and auto-coerces kind to 'exclusive' when the manifest didn't declare one explicitly. This routes the plugin to plugins/memory discovery instead of the general loader. The escape hatch: if a manifest explicitly declares kind: standalone, the heuristic doesn't override it. Reported by Uncle HODL on Discord. * fix(nous): actionable CLI message when Nous 401 refresh fails Mirrors the Anthropic 401 diagnostic pattern. When Nous returns 401 and the credential refresh (_try_refresh_nous_client_credentials) also fails, the user used to see only the raw APIError. Now prints: 🔐 Nous 401 — Portal authentication failed. Response: <truncated body> Most likely: Portal OAuth expired, account out of credits, or agent key revoked. Troubleshooting: • Re-authenticate: hermes login --provider nous • Check credits / billing: https://portal.nousresearch.com • Verify stored credentials: $HERMES_HOME/auth.json • Switch providers temporarily: /model <model> --provider openrouter Addresses the common 'my hermes model hangs' pattern where the user's Portal OAuth expired and the CLI gave no hint about the next step.	2026-04-22 05:54:11 -07:00

1 2 3 4 5 ...

1370 commits