hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-16 09:31:37 +00:00

Author	SHA1	Message	Date
Teknium	95c0bee7f8	Merge pull request #1299 from NousResearch/hermes/hermes-f5fb1d3b fix: salvage PR #327 voice mode onto current main	2026-03-14 06:45:20 -07:00
Teknium	c1cca65168	Merge pull request #1302 from NousResearch/hermes/hermes-315847fd feat(mcp): salvage selective tool loading with utility policies	2026-03-14 06:40:45 -07:00
teknium1	67e80def53	docs(mcp): add comprehensive Hermes MCP docs Expand the MCP feature docs with filtering and capability-aware registration details, add a practical 'Use MCP with Hermes' tutorial, add a config reference page, and wire the new docs into the sidebar and landing page.	2026-03-14 06:36:01 -07:00
Teknium	63309065b6	Merge pull request #1305 from NousResearch/hermes/hermes-2ba57c8a fix: email adapter IMAP UID tracking and SMTP TLS verification	2026-03-14 06:32:35 -07:00
teknium1	71cffbfa4f	fix: verify SMTP TLS in send_message_tool Add regression coverage for the standalone email send path and pass an explicit default SSL context to STARTTLS for certificate verification, matching the gateway email adapter hardening salvaged from PR #994.	2026-03-14 06:31:52 -07:00
teknium1	9633ddd8d8	fix: initialize CLI voice state for single-query mode - initialize voice and interrupt runtime state in HermesCLI.__init__ - prevent chat -q from crashing before run() has executed - add regression coverage for single-query state initialization	2026-03-14 06:31:32 -07:00
Himess	344adc72a1	fix: update email test mocks to use imap.uid() instead of imap.search/fetch Tests were still mocking imap.search() and imap.fetch() but the implementation was changed to use imap.uid("search", ...) and imap.uid("fetch", ...) for proper UID-based IMAP operations.	2026-03-14 06:29:00 -07:00
Himess	fa72f4ff55	fix: email adapter IMAP UID tracking and SMTP TLS verification - Use imap.uid() for search and fetch instead of imap.search/fetch. Sequence numbers shift when messages are deleted, causing the adapter to skip new messages or reprocess old ones. UIDs are stable. - Pass ssl.create_default_context() to starttls() so the server certificate is actually verified. Without it smtplib uses ssl._create_stdlib_context() which skips verification.	2026-03-14 06:29:00 -07:00
Teknium	914bb12035	Merge pull request #1301 from NousResearch/hermes/hermes-2ba57c8a feat: add Parallel CLI research skill	2026-03-14 06:24:16 -07:00
teknium1	04e151714f	feat(mcp): make selective tool loading capability-aware Extend the salvaged MCP filtering work so utility tools are also governed by policy and server capabilities. Store the registered tool subset per server so rediscovery and status reporting stay accurate after filtering.	2026-03-14 06:22:02 -07:00
Teknium	2ff03ebafe	fix: use non-greedy regex in DeepSeek V3 parser for multi-tool calls (#1300 ) The greedy `.` captures with `re.DOTALL` cause `findall()` to merge multiple tool calls into a single match — silently dropping all but the last tool call. Switching to `.?` (non-greedy) fixes extraction when models return multiple tool calls in one response. Adds test coverage for the DeepSeek V3 parser including a multi-tool call regression test. Co-authored-by: Himess <semihcvlk53@gmail.com>	2026-03-14 06:19:28 -07:00
teknium1	d2869de477	docs: tighten Parallel CLI skill guidance Clarify that Parallel is an optional paid vendor workflow, add headless auth and context-chaining guidance, and align command examples more closely with upstream docs before salvaging PR #985.	2026-03-14 06:18:04 -07:00
kshitij	8d61ebe183	feat: add Parallel CLI research skill	2026-03-14 06:15:16 -07:00
teknium1	7b10881b9e	fix: persist clean voice transcripts and /voice off state - keep CLI voice prefixes API-local while storing the original user text - persist explicit gateway off state and restore adapter auto-TTS suppression on restart - add regression coverage for both behaviors	2026-03-14 06:14:22 -07:00
Teknium	a0f0f4fe52	Merge pull request #1297 from NousResearch/hermes/hermes-5556ee7e docs: salvage #980 terminal backend and Windows troubleshooting	2026-03-14 06:14:03 -07:00
teyrebaz33	3198cc8fd9	feat(mcp): per-server tool filtering via include/exclude and enabled flag Add optional config keys under each mcp_servers entry: - tools.include: whitelist, only listed tools are registered - tools.exclude: blacklist, all tools except listed are registered - enabled: false: skip server entirely, no connection attempt Backward-compatible: no config keys = all tools registered as before. Tests: TestMCPSelectiveToolLoading (4 tests), 134 passed total.	2026-03-14 06:12:17 -07:00
Teknium	fb3c163612	fix(gateway): surface missing linger in status and doctor (#1296 ) * fix(gateway): surface missing linger in status and doctor Warn when a systemd user gateway service has linger disabled so users can spot the common 'gateway sleeps after logout' deployment issue from both hermes doctor and hermes gateway status. * fix(gateway): check linger status after install After installing the systemd user service, report whether linger is already enabled instead of always printing the generic hint. This makes post-install guidance match the user's actual deployment state.	2026-03-14 06:11:33 -07:00
Teknium	6fa197f973	Merge pull request #1298 from NousResearch/hermes/hermes-aa653753 fix: clearer terminal backend requirement errors	2026-03-14 06:05:58 -07:00
Oktay Aydin	00a0f18544	fix: clearer terminal backend requirement errors Salvaged from PR #979 onto current main. Preserve the current terminal backend checks while surfacing actionable preflight errors for unknown TERMINAL_ENV values, missing SSH host/user configuration, and missing Modal credentials/config. Tighten the modal regression test so it deterministically exercises the config-missing path.	2026-03-14 06:04:39 -07:00
teknium1	523a1b6faf	merge: salvage PR #327 voice mode branch Merge contributor branch feature/voice-mode onto current main for follow-up fixes.	2026-03-14 06:03:07 -07:00
teknium1	dd6a5732e7	docs: fix salvaged PR #980 troubleshooting details Correct the PowerShell UTF-8 snippet in the new Windows encoding tip and soften the Docker CLI wording to match Hermes' actual lookup behavior.	2026-03-14 06:02:57 -07:00
aydnOktay	767b5463f9	docs: add terminal backend and windows troubleshooting	2026-03-14 06:01:22 -07:00
Teknium	acc669645f	Merge pull request #1294 from NousResearch/hermes/hermes-315847fd fix(update): salvage autostash update flow from PR #978	2026-03-14 05:59:03 -07:00
teknium1	42c778b5eb	fix(update): warn and prompt before restoring autostash Add a restore prompt for interactive updates, keep the stash when the user declines, and print a post-restore warning that local changes were reapplied on top of updated code.	2026-03-14 05:50:18 -07:00
smillunchick	f764c7135d	fix: auto-stash local changes during updates	2026-03-14 05:44:48 -07:00
Teknium	b646440ca0	fix(mcp): resolve npx stdio connection failures (#1291 ) Salvaged from PR #977 onto current main. Preserves the MCP stdio command resolution and improved error diagnostics, with deterministic regression tests for the npx/node PATH cases. Co-authored-by: kshitij <82637225+kshitijk4poor@users.noreply.github.com>	2026-03-14 05:44:00 -07:00
0xbyt4	92c14ec4b0	fix(test): add missing voice state attrs to CLI stub in skin tests The rebase added voice prompt checks to _get_tui_prompt_fragments but the test stub was missing _voice_recording, _voice_processing and _voice_mode attributes, causing AttributeError.	2026-03-14 15:00:45 +03:00
0xbyt4	eb34c0b09a	fix: voice pipeline hardening — 7 bug fixes with tests 1. Anthropic + ElevenLabs TTS silence: forward full response to TTS callback for non-streaming providers (choices first, then native content blocks fallback). 2. Subprocess timeout kill: play_audio_file now kills the process on TimeoutExpired instead of leaving zombie processes. 3. Discord disconnect cleanup: leave all voice channels before closing the client to prevent leaked state. 4. Audio stream leak: close InputStream if stream.start() fails. 5. Race condition: read/write _on_silence_stop under lock in audio callback thread. 6. _vprint force=True: show API error, retry, and truncation messages even during streaming TTS. 7. _refresh_level lock: read _voice_recording under _voice_lock.	2026-03-14 14:27:21 +03:00
0xbyt4	7a24168080	fix: add missing choices/Choice to discord mock in test_discord_free_response The mock's app_commands SimpleNamespace lacked choices and Choice attrs, causing xdist test ordering failures when this mock loaded before test_discord_slash_commands.	2026-03-14 14:27:21 +03:00
0xbyt4	cc0a453476	fix: address PR review round 5 — streaming guard, VC auth, history prefix, auto-TTS control 1. Gate _streaming_api_call to chat_completions mode only — Anthropic and Codex fall back to _interruptible_api_call. Preserve Anthropic base_url across all client rebuild paths (interrupt, fallback, 401 refresh). 2. Discord VC synthetic events now use chat_type="channel" instead of defaulting to "dm" — prevents session bleed into DM context. Authorization runs before echoing transcript. Sanitize @everyone/@here in voice transcripts. 3. CLI voice prefix ("[Voice input...]") is now API-call-local only — stripped from returned history so it never persists to session DB or resumed sessions. 4. /voice off now disables base adapter auto-TTS via _auto_tts_disabled_chats set — voice input no longer triggers TTS when voice mode is off.	2026-03-14 14:27:21 +03:00
0xbyt4	35748a2fb0	fix: address PR review round 4 — remove web UI, fix audio/import/interface issues Remove web UI gateway (web.py, tests, docs, toolset, env vars, Platform.WEB enum) per maintainer request — Nous is building their own official chat UI. Fix 1: Replace sd.wait() with polling pattern in play_audio_file() to prevent indefinite hang when audio device stalls (consistent with play_beep()). Fix 2: Use importlib.util.find_spec() for faster_whisper/openai availability checks instead of module-level imports that trigger heavy native library loading (CUDA/cuDNN) at import time. Fix 3: Remove inspect.signature() hack in _send_voice_reply() — add **kwargs to Telegram send_voice() so all adapters accept metadata uniformly. Fix 4: Make session loading resilient to removed platform enum values — skip entries with unknown platforms instead of crashing the entire gateway.	2026-03-14 14:27:21 +03:00
0xbyt4	1ad5e0ed15	feat: add voice channel awareness — inject participant and speaking state into agent context	2026-03-14 14:27:21 +03:00
0xbyt4	49f3f0fc62	fix: add choices/Choice to discord mock for /voice slash command test	2026-03-14 14:27:21 +03:00
0xbyt4	e3126aeb40	fix: STT consistency — web.py model param, error matching, local provider key - web.py: pass stt_model from config like discord.py and run.py do - run.py: match new error messages (No STT provider / not set) - _transcribe_local: add missing "provider": "local" to return dict	2026-03-14 14:27:21 +03:00
0xbyt4	41162e0aca	fix: prevent shutdown deadlock and unblockable Ctrl+C on exit Move stream close outside the lock in shutdown() to prevent deadlock when audio callback tries to acquire the same lock. Replace single t.join(timeout) with a polling loop (0.1s intervals) so KeyboardInterrupt is not blocked during stream cleanup.	2026-03-14 14:27:21 +03:00
0xbyt4	69cb373864	fix: update /voice status to show correct STT provider Voice status was hardcoded to check API keys only. Now uses the actual provider resolution (local/groq/openai) so it correctly shows "local faster-whisper" when installed instead of "Groq" or "MISSING".	2026-03-14 14:27:21 +03:00
0xbyt4	eb052b1b42	fix: add explicit metadata param to Discord send_voice signature	2026-03-14 14:27:21 +03:00
0xbyt4	b8f8d3ef9e	feat: integrate faster-whisper local STT with three-provider fallback Merge main's faster-whisper (local, free) with our Groq support into a unified three-provider STT pipeline: local > groq > openai. Provider priority ensures free options are tried first. Each provider has its own transcriber function with model auto-correction, env- overridable endpoints, and proper error handling. 74 tests cover the full provider matrix, fallback chains, model correction, config loading, validation edge cases, and dispatch.	2026-03-14 14:27:21 +03:00
0xbyt4	c433c89d7d	fix: demote RTP debug logs to DEBUG and isolate web sessions - Change RTP packet logging from INFO to DEBUG level to reduce noise (SPEAKING events remain at INFO as they are important lifecycle events) - Use per-session chat_id (web_{session_id}) instead of shared "web" to isolate conversation context between simultaneous web users	2026-03-14 14:27:21 +03:00
0xbyt4	fa2c825e2f	fix: isolate WEB_UI_HOST env var in test and handle empty string - Patch WEB_UI_HOST in test_web_defaults to avoid env leak - Handle empty WEB_UI_HOST string in config (fall back to 127.0.0.1)	2026-03-14 14:27:21 +03:00
0xbyt4	5b47b87c42	fix: show only reachable URLs in Web UI startup message When bound to 127.0.0.1, only show localhost URL instead of listing unreachable network interfaces. Add hint about WEB_UI_HOST=0.0.0.0 for phone/tablet access. Add VPN/multi-interface and token exposure tests (11 new tests).	2026-03-14 14:27:21 +03:00
0xbyt4	a21f518c0b	fix: hide configured token value in Web UI startup log Only print the access token when auto-generated (user needs it to log in). When set via WEB_UI_TOKEN env var, just confirm it is set without exposing the value in console output.	2026-03-14 14:27:21 +03:00
0xbyt4	44abe852fb	fix: add macOS Homebrew Opus fallback and fix shutdown dict iteration - Add Homebrew library path fallback when ctypes.util.find_library fails on macOS (Apple Silicon + Intel paths, guarded by platform check) - Fix RuntimeError in gateway stop() by iterating over dict copy - Update Opus tests to verify find_library-first + conditional fallback	2026-03-14 14:27:21 +03:00
0xbyt4	c797314fcf	test: add security and hardening tests for voice mode fixes - Path traversal sanitization (Path.name strips ../) - Media endpoint authentication (401 without token, 404 on traversal) - hmac.compare_digest usage verification (no == for tokens) - DOMPurify XSS prevention in HTML template - Default bind 127.0.0.1 (adapter and config) - /remote-control token hiding in group chats - Opus find_library instead of hardcoded paths - Opus decode error logging (no silent swallow) - Interrupt _vprint force=True on all 6 calls - Anthropic interrupt handler in both API call paths - Update test_web_defaults for new 127.0.0.1 default	2026-03-14 14:27:21 +03:00
0xbyt4	0ff1b4ade2	fix: harden web gateway security and fix error swallowing - Use hmac.compare_digest for timing-safe token comparison (3 endpoints) - Default bind to 127.0.0.1 instead of 0.0.0.0 - Sanitize upload filenames with Path.name to prevent path traversal - Add DOMPurify to sanitize marked.parse() output against XSS - Replace add_static with authenticated media handler - Hide token in group chats for /remote-control command - Use ctypes.util.find_library for Opus instead of hardcoded paths - Add force=True to 5 interrupt _vprint calls for visibility - Log Opus decode errors and voice restart failures instead of swallowing	2026-03-14 14:27:21 +03:00
0xbyt4	d646442692	fix: restore Anthropic interrupt handler in _interruptible_api_call Rebase auto-merge silently overwrote main's Anthropic-aware interrupt handler with the older OpenAI-only version. Without this fix, interrupting an Anthropic API call closes the wrong client and leaves token generation running on the Anthropic side.	2026-03-14 14:27:21 +03:00
0xbyt4	0a8985acf9	fix: add missing load_config import in _show_voice_status	2026-03-14 14:27:21 +03:00
0xbyt4	2c84979d77	refactor: extract get_stt_model_from_config helper to eliminate DRY violation Duplicated YAML config parsing for stt.model existed in gateway/run.py and gateway/platforms/discord.py. Moved to a single helper in transcription_tools.py and added 5 tests covering all edge cases.	2026-03-14 14:27:21 +03:00
0xbyt4	3260413cc7	docs: add STT override env vars to .env.example	2026-03-14 14:27:20 +03:00
0xbyt4	238a431545	fix: make STT config env-overridable and fix doc issues Code fixes: - STT model, Groq base URL, and OpenAI STT base URL are now configurable via env vars (STT_GROQ_MODEL, STT_OPENAI_MODEL, GROQ_BASE_URL, STT_OPENAI_BASE_URL) instead of hardcoded - Gateway and Discord VC now read stt.model from config.yaml (previously only CLI did this — gateway always used defaults) Doc fixes: - voice-mode.md: move Web UI troubleshooting to web.md (was duplicated) - voice-mode.md: simplify "How It Works" for end users (remove NaCl, DAVE, RTP internals) - voice-mode.md: clarify STT priority (OpenAI used first if both keys set, Groq recommended for free tier) - voice-mode.md: document new STT env overrides in config reference - web.md: remove duplicate Quick Start / Step 1-3 sections - web.md: add mobile HTTPS mic workarounds (moved from voice-mode.md) - web.md: clarify STT fallback order	2026-03-14 14:27:20 +03:00

1 2 3 4 5 ...

1720 commits