hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-08 03:01:47 +00:00

Author	SHA1	Message	Date
Teknium	dd7921d514	fix(honcho): isolate session routing for multi-user gateway (#1500 ) Salvaged from PR #1470 by adavyas. Core fix: Honcho tool calls in a multi-session gateway could route to the wrong session because honcho_tools.py relied on process-global state. Now threads session context through the call chain: AIAgent._invoke_tool() → handle_function_call() → registry.dispatch() → handler **kw → _resolve_session_context() Changes: - Add _resolve_session_context() to prefer per-call context over globals - Plumb honcho_manager + honcho_session_key through handle_function_call - Add sync_honcho=False to run_conversation() for synthetic flush turns - Pass honcho_session_key through gateway memory flush lifecycle - Harden gateway PID detection when /proc cmdline is unreadable - Make interrupt test scripts import-safe for pytest-xdist - Wrap BibTeX examples in Jekyll raw blocks for docs build - Fix thread-order-dependent assertion in client lifecycle test - Expand Honcho docs: session isolation, lifecycle, routing internals Dropped from original PR: - Indentation change in _create_request_openai_client that would move client creation inside the lock (causes unnecessary contention) Co-authored-by: adavyas <adavyas@users.noreply.github.com>	2026-03-16 00:23:47 -07:00
Teknium	eb4f0348e1	fix: persist CLI token counts to session DB for /insights Token usage was tracked in-memory during CLI sessions (session_prompt_tokens, session_completion_tokens) but never written to the SQLite session DB. The gateway persisted tokens via session_store.update_session(), but CLI sessions always showed 0 tokens in /insights. Now run_agent.py persists token deltas to the DB after each API call for CLI sessions. Gateway sessions continue to use their existing persist path to avoid double-counting.	2026-03-16 00:23:13 -07:00
teknium1	38b4fd3737	fix(gateway): make group session isolation configurable default group and channel sessions to per-user isolation, allow opting back into shared room sessions via config.yaml, and document Discord gateway routing and session behavior.	2026-03-16 00:22:23 -07:00
ygd58	36dd7a3e8d	fix(setup): defer config.yaml write until after model selection _update_config_for_provider() was called immediately after provider selection for zai, kimi-coding, minimax, minimax-cn, and anthropic — before model selection happened. Since the gateway re-reads config.yaml per-message, this created a race where the gateway would pick up the new provider but still use the old (incompatible) model name. Capture selected_base_url in each provider block, then call _update_config_for_provider() once, after model selection completes, right before save_config(). The in-memory _set_model_provider() calls stay in place so the config object remains consistent during setup. Closes #1182	2026-03-16 00:18:30 -07:00
Teknium	dd698f6d5d	fix(gateway): SSL certificate auto-detection for NixOS and non-standard systems (#1494 ) fix(gateway): SSL certificate auto-detection for NixOS and non-standard systems	2026-03-16 00:14:13 -07:00
teknium1	06a7d19f98	fix(gateway): isolate group sessions per user Include participant identifiers in non-DM session keys when available so group and channel conversations no longer share one transcript across every active user in the chat.	2026-03-15 23:08:56 -07:00
teknium1	3801532bd3	fix(gateway): SSL certificate auto-detection for NixOS and non-standard systems Add _ensure_ssl_certs() that discovers CA certificate bundles before any HTTP library is imported. Resolution order: 1. Python's ssl.get_default_verify_paths() 2. certifi (if installed) 3. Common distro/macOS paths Only sets SSL_CERT_FILE if not already present in the environment. Wrapped in a function (called immediately) to avoid polluting module namespace. Based on PR #1151 by sylvesterroos.	2026-03-15 23:04:34 -07:00
Teknium	aaacab7de7	docs: explain checkpoints, /rollback, and git worktrees * docs: explain checkpoints, rollback, and git worktrees * fix: correct hermes -w description — auto-creates worktree, takes no path arg --------- Co-authored-by: aydnOktay <xaydinoktay@gmail.com>	2026-03-15 23:04:07 -07:00
Teknium	4298c6fd9a	fix: route background process watcher notifications to Telegram forum topics (#1481 ) Salvaged from PR #1146 by spanishflu-est1918. Background process progress/completion messages were sent with only chat_id, landing in the general topic instead of the originating forum topic. Thread the thread_id from HERMES_SESSION_THREAD_ID through the watcher payload and pass it as metadata to adapter.send() so Telegram routes notifications to the correct topic. The env var export (HERMES_SESSION_THREAD_ID in _set_session_env / _clear_session_env) already existed on main — this commit adds the missing watcher plumbing. Co-authored-by: spanishflu-est1918 <spanishflu-est1918@users.noreply.github.com>	2026-03-15 23:01:57 -07:00
Teknium	c30505dddd	feat: add OSS Security Forensics skill (Skills Hub) (#1482 ) * feat: add OSS Security Forensics skill (Skills Hub) Salvaged from PR #1066 by zagiscoming. Adds a 7-phase multi-agent investigation framework for GitHub supply chain attack forensics. Skill contents (optional-skills/security/oss-forensics/): - SKILL.md: 420-line investigation framework with 8 anti-hallucination guardrails, 5 specialist investigators, ethical use guidelines, and API rate limiting guidance - evidence-store.py: CLI evidence manager with add/list/verify/query/ export/summary + SHA-256 integrity + chain of custody - references/: evidence types, GH Archive BigQuery guide (expanded with 12 event types and 6 query templates), recovery techniques (4 methods), investigation templates (5 attack patterns) - templates/: forensic report template (151 lines), malicious package report template Changes from original PR: - Dropped unrelated core tool changes (delegate_tool.py role parameter, AGENTS.md, README.md modifications) - Removed duplicate skills/security/oss-forensics/ placement - Fixed github-archive-guide.md (missing from optional-skills/, expanded from 33 to 160+ lines with all 12 event types and query templates) - Added ethical use guidelines and API rate limiting sections - Rewrote tests to match the v2 evidence store API (12 tests, all pass) Closes #384 * fix: use python3 and SKILL_DIR paths throughout oss-forensics skill - Replace all 'python' invocations with 'python3' for portability (Ubuntu doesn't ship 'python' by default) - Replace relative '../scripts/' and '../templates/' paths with SKILL_DIR/scripts/ and SKILL_DIR/templates/ convention - Add path convention note before Phase 0 explaining SKILL_DIR - Fix double --- separator (cosmetic) - Applies to SKILL.md, evidence-store.py docstring, recovery-techniques.md, and forensic-report.md template --------- Co-authored-by: zagiscoming <zagiscoming@users.noreply.github.com>	2026-03-15 21:59:53 -07:00
Teknium	70e24d77a1	Merge pull request #1490 from NousResearch/fix/1033-telegram-voice-fallback fix: restore local STT fallback for gateway voice notes	2026-03-15 21:58:32 -07:00
Teknium	fa3db2671a	docs(readme): add CLI vs messaging quick reference Co-authored-by: Frank <97429702+tsubasakong@users.noreply.github.com>	2026-03-15 21:58:11 -07:00
Teknium	6fd9f2a0c5	fix(gateway): null-coalesce mode in SessionResetPolicy.from_dict (#1488 ) fix(gateway): null-coalesce mode in SessionResetPolicy.from_dict	2026-03-15 21:57:31 -07:00
teknium1	1f72ce71b7	fix: restore local STT fallback for gateway voice notes Restore local STT command fallback for voice transcription, detect whisper and ffmpeg in common local install paths, and avoid bogus no-provider messaging when only a backend-specific key is missing.	2026-03-15 21:51:40 -07:00
teknium1	102a255575	fix(gateway): null-coalesce mode in SessionResetPolicy.from_dict Complete the YAML null handling for all three SessionResetPolicy fields. at_hour and idle_minutes already had null coalescing; mode was still using data.get('mode', 'both') which returns None when the key exists with an explicit null value. Add regression test covering all-null input. Based on PR #1120 by stablegenius49.	2026-03-15 21:40:22 -07:00
Teknium	5beb681c70	fix(cli): prefer curses over simple_term_menu in setup.py (#1487 )	2026-03-15 21:16:21 -07:00
Teknium	c9a9db318e	feat(tools): persistent shell mode for local and SSH backends (#1483 ) feat(tools): persistent shell mode for local and SSH backends	2026-03-15 21:14:01 -07:00
teknium1	01e62c067b	merge: resolve conflicts with origin/main (SSH preflight check)	2026-03-15 21:13:40 -07:00
Teknium	ceb970c559	fix(terminal): add SSH preflight check (#1486 )	2026-03-15 21:09:07 -07:00
teknium1	6894358fe1	docs: add persistent shell section to configuration and env-vars reference Documents terminal.persistent_shell config option, per-backend env var overrides, precedence table, and what state persists across commands.	2026-03-15 21:01:50 -07:00
Teknium	3f0f4a04a9	fix(agent): skip reasoning extra_body for unsupported OpenRouter models (#1485 ) * fix(agent): skip reasoning extra_body for models that don't support it Sending reasoning config to models like MiniMax or Nvidia via OpenRouter causes a 400 BadRequestError. Previously, reasoning extra_body was sent to all OpenRouter and Nous models unconditionally. Fix: only send reasoning extra_body when the model slug starts with a known reasoning-capable prefix (deepseek/, anthropic/, openai/, x-ai/, google/gemini-2, qwen/qwen3) or when using Nous Portal directly. Applies to both the main API call path (_build_api_kwargs) and the conversation summary path. Fixes #1083 * test(agent): cover reasoning extra_body gating --------- Co-authored-by: ygd58 <buraysandro9@gmail.com>	2026-03-15 20:42:07 -07:00
Teknium	c564e1c3dc	feat(tools): centralize tool emoji metadata in registry + skin integration (#1484 ) feat(tools): centralize tool emoji metadata in registry + skin integration	2026-03-15 20:35:24 -07:00
teknium1	210d5ade1e	feat(tools): centralize tool emoji metadata in registry + skin integration - Add 'emoji' field to ToolEntry and 'get_emoji()' to ToolRegistry - Add emoji= to all 50+ registry.register() calls across tool files - Add get_tool_emoji() helper in agent/display.py with 3-tier resolution: skin override → registry default → hardcoded fallback - Replace hardcoded emoji maps in run_agent.py, delegate_tool.py, and gateway/run.py with centralized get_tool_emoji() calls - Add 'tool_emojis' field to SkinConfig so skins can override per-tool emojis (e.g. ares skin could use swords instead of wrenches) - Add 11 tests (5 registry emoji, 6 display/skin integration) - Update AGENTS.md skin docs table Based on the approach from PR #1061 by ForgingAlex (emoji centralization in registry). This salvage fixes several issues from the original: - Does NOT split the cronjob tool (which would crash on missing schemas) - Does NOT change image_generate toolset/requires_env/is_async - Does NOT delete existing tests - Completes the centralization (gateway/run.py was missed) - Hooks into the skin system for full customizability	2026-03-15 20:21:21 -07:00
teknium1	33ebedc76d	feat: enable persistent shell by default for SSH, add config option SSH persistent shell now defaults to true — non-local backends benefit most from state persistence across execute() calls. Local backend remains opt-in via TERMINAL_LOCAL_PERSISTENT env var. New config.yaml option: terminal.persistent_shell (default: true) Controls the default for non-local backends. Users can disable with: hermes config set terminal.persistent_shell false Precedence: per-backend env var > TERMINAL_PERSISTENT_SHELL > default. Wired through cli.py, gateway/run.py, and hermes_cli/config.py so the config.yaml value reaches terminal_tool via env var bridge.	2026-03-15 20:17:13 -07:00
teknium1	5b80654198	feat(tools): add persistent shell mode to local and SSH backends Cherry-picked from PR #1067 by alt-glitch. Adds PersistentShellMixin with file-based IPC protocol for long-lived bash shells. LocalEnvironment and SSHEnvironment gain persistent=True option. Controlled via TERMINAL_LOCAL_PERSISTENT / TERMINAL_SSH_PERSISTENT env vars. Fixes latent stderr pipe buffer deadlock. Co-authored-by: alt-glitch <balyan.sid@gmail.com>	2026-03-15 20:13:02 -07:00
Teknium	25e53f3c1a	fix(custom-endpoint): verify /models and suggest working /v1 base URL (#1480 )	2026-03-15 20:09:50 -07:00
Teknium	103f7b1ebc	fix: verbose mode shows full untruncated output * fix(cli): silence tirith prefetch install warnings at startup * fix: verbose mode now shows full untruncated tool args, results, content, and think blocks When tool progress is set to 'verbose' (via /verbose or config), the display was still truncating tool arguments to 100 chars, tool results to 100-200 chars, assistant content to 100 chars, and think blocks to 5 lines. This defeated the purpose of verbose mode. Changes: - Tool args: show full JSON args (not truncated to log_prefix_chars) - Tool results: show full result content in both display and debug logs - Assistant content: show full content during tool-call loops - Think blocks: show full reasoning text (not truncated to 5 lines/100 chars) - Auto-enable reasoning display when verbose mode is active - Fix initial agent creation to respect verbose config (was always quiet_mode=True) - Updated verbose label to mention think blocks	2026-03-15 20:03:37 -07:00
Teknium	a56937735e	fix(telegram): escape chunk indicators in MarkdownV2 (#1478 )	2026-03-15 19:27:15 -07:00
Teknium	7148534401	fix(gateway): make /status report live state and tokens (#1476 )	2026-03-15 19:18:58 -07:00
Teknium	4e91b0240b	fix(honcho): correct seed_ai_identity to use session.add_messages() (#1475 ) The seed_ai_identity method was calling assistant_peer.add_message() which doesn't exist on the Honcho SDK's Peer class. Fixed to use the correct pattern: session.add_messages([peer.message(content)]), matching the existing message sync code at line 294. Discovered and fixed by Yuqi (Hermes Agent), Angello's AI companion. Co-authored-by: Angello Picasso <angello.picasso@devsu.com>	2026-03-15 19:07:57 -07:00
Teknium	5e92a4ce5a	fix: auto-reload MCP tools when mcp_servers config changes without restart (#1474 ) Fixes #1036 After adding an MCP server to config.yaml, users had to restart Hermes before the new tools became visible — even though /reload-mcp existed. Add _check_config_mcp_changes() called from process_loop every 5s: - stat() config.yaml for mtime changes (fast path, no YAML parse) - On mtime change, parse and compare mcp_servers section - If mcp_servers changed, auto-trigger _reload_mcp() and notify user - Skip check while agent is running to avoid interrupting tool calls - Throttled to CONFIG_WATCH_INTERVAL=5s to avoid busy-polling /reload-mcp still works for manual force-reload. Tests: 6 new tests in TestMCPConfigWatch, all passed Co-authored-by: teyrebaz33 <hakanerten02@hotmail.com>	2026-03-15 19:03:34 -07:00
Teknium	471c663fdf	fix(cli): silence tirith prefetch install warnings at startup (#1452 )	2026-03-15 18:07:03 -07:00
Teknium	64d333204b	Merge pull request #1242 from NousResearch/fix/file-tool-log-noise fix: reduce file tool log noise	2026-03-15 11:11:18 -07:00
Teknium	c44af43840	Merge pull request #1401 from NousResearch/hermes/hermes-eca4a640 test: protect atomic temp cleanup on interrupts	2026-03-15 11:10:41 -07:00
alt-glitch	4511322f56	Merge origin/main into sid/persistent-backend Resolve conflict in local.py: keep refactored _make_run_env helper over inline _sanitize_subprocess_env logic.	2026-03-15 21:08:11 +05:30
Teknium	934fc9df22	Merge pull request #1440 from NousResearch/fix/1071-dict-tool-args fix: handle dict tool call arguments from local backends	2026-03-15 08:04:09 -07:00
teknium1	5847c180c6	test: restore vllm integration coverage and add dict-args regression Restore the existing vLLM integration test module that was accidentally replaced during development and add a focused agent-loop regression test for dict tool-call arguments from OpenAI-compatible local backends.	2026-03-15 08:02:29 -07:00
teknium1	93a0c0cddd	fix: handle dict tool call arguments from local backends Normalize tool call arguments when OpenAI-compatible backends return parsed dict/list payloads instead of JSON strings. This prevents the .strip() crash during tool-call validation for llama.cpp and similar servers, while preserving existing empty-string and invalid-JSON handling. Adds a focused regression test for dict arguments in the agent loop.	2026-03-15 08:00:19 -07:00
Teknium	23e8fdd167	feat(discord): auto-thread on @mention + skip mention in bot threads Two changes to align Discord behavior with Slack: 1. Auto-thread on @mention (default: true) - When someone @mentions the bot in a server channel, a thread is automatically created from their message and the response goes there. - Each thread gets its own isolated session (like Slack). - Configurable via discord.auto_thread in config.yaml (default: true) or DISCORD_AUTO_THREAD env var (env takes precedence). - DMs and existing threads are unaffected. 2. Skip @mention in bot-participated threads - Once the bot has responded in a thread (auto-created or manually entered), subsequent messages in that thread no longer require @mention. Users can just type normally. - Tracked via in-memory set (_bot_participated_threads). After a gateway restart, users need to @mention once to re-establish. - Threads the bot hasn't participated in still require @mention. Config change: discord: auto_thread: true # new, added to DEFAULT_CONFIG Tests: 7 new tests covering auto-thread default, disable, bot thread participation tracking, and mention skip logic. All 903 gateway tests pass.	2026-03-15 07:59:55 -07:00
Teknium	3268b98779	Merge pull request #1437 from NousResearch/fix/1219-cron-thread-context fix: preserve thread context for cronjob deliver=origin	2026-03-15 06:58:37 -07:00
teyrebaz33	20f381cfb6	fix: preserve thread context for cronjob deliver=origin When a cronjob is created from within a Telegram or Slack thread, deliver=origin was posting to the parent channel instead of the thread. Root cause: the gateway never set HERMES_SESSION_THREAD_ID in the session environment, so cronjob_tools.py could not capture thread_id into the job's origin metadata — even though the scheduler already reads origin.get('thread_id'). Fix: - gateway/run.py: set HERMES_SESSION_THREAD_ID when thread_id is present on the session context, and clear it in _clear_session_env - tools/cronjob_tools.py: read HERMES_SESSION_THREAD_ID into origin Closes #1219	2026-03-15 06:57:00 -07:00
Teknium	77bfa252b9	Merge pull request #1434 from NousResearch/fix/1244-env-override fix(config): reload .env over stale shell overrides	2026-03-15 06:47:40 -07:00
teknium1	f24c00a5bf	fix(config): reload .env over stale shell overrides Hermes startup entrypoints now load ~/.hermes/.env and project fallback env files with user config taking precedence over stale shell-exported values. This makes model/provider/base URL changes in .env actually take effect after restarting Hermes. Adds a shared env loader plus regression coverage, and reproduces the original bug case where OPENAI_BASE_URL and HERMES_INFERENCE_PROVIDER remained stuck on old shell values before import.	2026-03-15 06:46:28 -07:00
Teknium	463239ed85	docs: fallback providers + /background command documentation * docs: comprehensive fallback providers documentation - New dedicated page: user-guide/features/fallback-providers.md covering both primary model fallback and auxiliary task fallback systems - Updated configuration.md with fallback_model config section - Updated environment-variables.md noting fallback is config-only - Fleshed out developer-guide/provider-runtime.md fallback section with internal architecture details (trigger points, activation flow, config flow) - Added cross-reference from provider-routing.md distinguishing OpenRouter sub-provider routing from Hermes-level model fallback - Added new page to sidebar under Integrations * docs: comprehensive /background command documentation - Added Background Sessions section to cli.md covering how it works (daemon threads, isolated sessions, config inheritance, Rich panel output, bell notification, concurrent tasks) - Added Background Sessions section to messaging/index.md covering messaging-specific behavior (async execution, result delivery back to same chat, fire-and-forget pattern) - Documented background_process_notifications config (all/result/error/off) in messaging docs and configuration.md - Added HERMES_BACKGROUND_NOTIFICATIONS env var to reference page - Fixed inconsistency in slash-commands.md: /background was listed as messaging-only but works in both CLI and messaging. Moved it to the 'both surfaces' note. - Expanded one-liner table descriptions with detail and cross-references	2026-03-15 06:24:28 -07:00
Teknium	60cce9ca6d	Merge pull request #1429 from NousResearch/fix/1336-discord-voice-reliability fix(voice): Discord voice channel reliability fixes	2026-03-15 05:25:45 -07:00
teknium1	2d57946ee9	test(voice): clarify install guidance and local skips Add an explicit messaging-extra install hint to the missing PyNaCl/davey error path, cover it with a voice-channel join regression test, and skip the low-level NaCl packet tests when PyNaCl is not installed locally.	2026-03-15 05:24:34 -07:00
0xbyt4	5f32fd8b6d	feat(voice): add discord-voice-doctor diagnostic script Checks the full voice environment and reports what's missing: - Python packages: discord.py, PyNaCl, davey, STT/TTS providers - System tools: Opus codec (macOS + Linux paths), ffmpeg - Environment: bot token, allowed users (resolved to usernames), API keys - Configuration: STT/TTS provider, voice mode state - Bot permissions: live Discord API check for Connect, Speak, VAD, etc. All sensitive values are masked. Gracefully handles missing deps, invalid tokens, API timeouts, and unreachable Discord API.	2026-03-15 05:20:17 -07:00
0xbyt4	3ea039684e	test(voice): add integration tests with real NaCl crypto and Opus codec End-to-end voice channel tests using real crypto (no mocks): NaCl decrypt (5): valid packet, wrong key, bot SSRC, multi-packet, multi-SSRC DAVE passthrough (3): unknown SSRC, Unencrypted error, real error drop Full flow (5): utterance lifecycle, auto-map, pause/resume, corruption, cleanup SPEAKING hook (4): hook installed, map/overwrite, mapped audio processed Auth filtering (3): allowed user, rejected user, empty allowlist Rejoin flow (3): clean state, new SSRC, missing SPEAKING auto-map Multi-guild (2): independent receivers, stop isolation Echo prevention (2): paused audio ignored, resumed audio processed	2026-03-15 05:20:17 -07:00
0xbyt4	63f0ec96ec	test(voice): add comprehensive flow tests for voice channel fixes Tests cover the actual code paths changed in voice fixes: _on_packet DAVE passthrough (8 tests): - Known SSRC + DAVE decrypt success → buffered - Unknown SSRC + DAVE → skip DAVE, passthrough to Opus - DAVE "Unencrypted" error → passthrough, not dropped - DAVE other error → packet dropped - No DAVE session → direct decode - Bot's own SSRC → ignored (echo prevention) - Multiple SSRCs → separate buffers SSRC auto-mapping (6 tests): - Single allowed user → auto-mapped - Multiple allowed users → no auto-map - No allowlist → sole non-bot member inferred - Unallowed user → rejected - Only bot in channel → no map - Auto-map persists across checks Buffer lifecycle (4 tests): - Known SSRC completed utterance - Short buffer ignored - Recent audio waits - Stale unknown buffer discarded TTS playback (10 tests): - play_tts calls play_in_voice_channel in VC - play_tts falls through when not in VC - play_tts wrong channel no match - Voice input dedup (runner skips) - Text + voice_mode combinations - Error/empty response skipped - Agent TTS tool dedup UDP keepalive (2 tests): - Interval within bounds - Silence frame actually sent via send_packet	2026-03-15 05:20:17 -07:00
0xbyt4	1cacaccca6	fix(voice): show clear error when voice dependencies are missing When PyNaCl or davey is not installed, joining a voice channel fails with a raw exception. Now shows a human-readable message pointing the user to reinstall with voice support. Closes #1336	2026-03-15 05:20:17 -07:00

... 2 3 4 5 6 ...

2125 commits