hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-26 01:01:40 +00:00

Author	SHA1	Message	Date
0xbyt4	b859dfab16	fix: address voice mode review feedback 1. Fully lazy imports: sounddevice, numpy, elevenlabs, edge_tts, and openai are never imported at module level. Each is imported only when the feature is explicitly activated, preventing crashes in headless environments (SSH, Docker, WSL, no PortAudio). 2. No core agent loop changes: streaming TTS path extracted from _interruptible_api_call() into separate _streaming_api_call() method. The original method is restored to its upstream form. 3. Configurable key binding: push-to-talk key changed from Ctrl+R (conflicts with readline reverse-search) to Ctrl+B by default. Configurable via voice.push_to_talk_key in config.yaml. 4. Environment detection: new detect_audio_environment() function checks for SSH, Docker, WSL, and missing audio devices before enabling voice mode. Auto-disables with clear warnings in incompatible environments. 5. Graceful degradation: every audio touchpoint (sd.play, sd.InputStream, sd.OutputStream) wrapped in try/except with ImportError/OSError handling. Failures produce warnings, not crashes.	2026-03-14 14:27:20 +03:00
0xbyt4	143cc68946	fix(test): add /voice to EXPECTED_COMMANDS set in test_commands.py	2026-03-14 14:27:20 +03:00
0xbyt4	46db7aeffd	fix: streaming tool call parsing, error handling, and fake HA state mutation - Fix Gemini streaming tool call merge bug: multiple tool calls with same index but different IDs are now parsed as separate calls instead of concatenating names (e.g. ha_call_serviceha_call_service) - Handle partial results in voice mode: show error and stop continuous mode when agent returns partial/failed results with empty response - Fix error display during streaming TTS: error messages are shown in full response box even when streaming box was already opened - Add duplicate sentence filter in TTS: skip near-duplicate sentences from LLM repetition - Fix fake HA server state mutation: turn_on/turn_off/set_temperature correctly update entity states; temperature sensor simulates change when thermostat is adjusted	2026-03-14 14:27:20 +03:00
0xbyt4	dad865e920	fix: fix silence detection bugs and add Phase 4 voice mode features Fix 3 critical bugs in silence detection: - Micro-pause tolerance now tracks dip duration (not time since speech start) - Peak RMS check in stop() prevents discarding recordings with real speech - Reduced min_speech_duration from 0.5s to 0.3s for reliable speech confirmation Phase 4 features: configurable silence params, visual audio level indicator, voice system prompt, tool call audio cues, TTS interrupt, continuous mode auto-restart, interruptable playback via Popen tracking.	2026-03-14 14:26:30 +03:00
0xbyt4	32b033c11c	feat: add silence filter, hallucination guard, and continuous mode control - Skip silent recordings before STT call (RMS check in AudioRecorder.stop) - Filter known Whisper hallucinations ("Thank you.", "Bye." etc.) - Continuous mode: Ctrl+R starts loop, Ctrl+R during recording exits it - Wait for TTS to finish before auto-restart to avoid recording speaker - Silence timeout increased to 3s for natural pauses - Tests: hallucination filter, silent recording skip, real speech passthrough	2026-03-14 14:25:28 +03:00
0xbyt4	bfd9c97705	feat: add Phase 4 low-latency features for voice mode - Audio cues: beep on record start (880Hz), double beep on stop (660Hz) - Silence detection: auto-stop recording after 3s of silence (RMS-based) - Continuous mode: auto-restart recording after agent responds - Ctrl+R starts continuous mode, Ctrl+R during recording exits it - Waits for TTS to finish before restarting to avoid recording speaker - Tests: 7 new tests for beep generation and silence detection	2026-03-14 14:25:28 +03:00
0xbyt4	a69bd55b5a	fix: isolate GROQ_API_KEY in test_missing_stt_key test The test was failing because GROQ_API_KEY leaked from the environment. Now both VOICE_TOOLS_OPENAI_KEY and GROQ_API_KEY are removed to properly test the "no STT key" scenario.	2026-03-14 14:25:28 +03:00
0xbyt4	c23928d089	fix: improve voice mode robustness and add integration tests - Show TTS errors to user instead of silently logging - Improve markdown stripping: code blocks, URLs, links, horizontal rules - Fix stripping order: process markdown links before removing URLs - Add threading.Lock for voice state variables (cross-thread safety) - Add 14 CLI integration tests (markdown stripping, command parsing, thread safety) - Total: 47 voice-related tests	2026-03-14 14:25:28 +03:00
0xbyt4	37b01ab964	test: add transcription_tools tests for multi-provider STT - Provider resolution: OpenAI priority, Groq fallback, no keys - Model auto-correction: Groq corrects OpenAI models and vice versa - Success path: transcription, API errors, whitespace stripping - 12 new tests, 33 total voice-related tests	2026-03-14 14:25:28 +03:00
0xbyt4	1a6fbef8a9	feat: add voice mode with push-to-talk and TTS output for CLI Implements Issue #314 Phase 2 & 3: - /voice command to toggle voice mode (on/off/tts/status) - Ctrl+Space push-to-talk recording via sounddevice - Whisper STT transcription via existing transcription_tools - Optional TTS response playback via existing tts_tool - Visual indicators in prompt (recording/transcribing/voice) - 21 unit tests, all mocked (no real mic/API) - Optional deps: sounddevice, numpy (pip install hermes-agent[voice])	2026-03-14 14:25:28 +03:00
Teknium	1a857123b3	feat(skills): add optional telephony skill with Twilio, SMS, and AI calls (#1289 ) * feat: improve context compaction handoff summaries Adapt PR #916 onto current main by replacing the old context summary marker with a clearer handoff wrapper, updating the summarization prompt for resume-oriented summaries, and preserving the current call_llm-based compression path. * fix: clearer error when docker backend is unavailable * fix: preserve docker discovery in backend preflight Follow up on salvaged PR #940 by reusing find_docker() during the new availability check so non-PATH Docker Desktop installs still work. Add a regression test covering the resolved executable path. * test: make gateway async tests xdist-safe Replace sync test usage of asyncio.get_event_loop().run_until_complete() with asyncio.run() so tests do not depend on an ambient current event loop. Also create the email disconnect poll task inside a running loop. This fixes xdist/CI failures where workers have no current loop in MainThread. * feat(skills): add phone-calls skill for outbound AI voice calls Reformulated from core tool (PR #847 feedback) into a skill with a standalone helper script. No new dependencies — uses only Python stdlib. Two providers supported: - Bland.ai (default): simple setup, one API key - Vapi: flexible, better voice quality via ElevenLabs/Deepgram + Twilio Includes: - SKILL.md with full procedure, safety rules, provider docs, pitfalls - scripts/phone_call.py CLI helper (call, status, diagnose commands) * feat(skills): expand phone-calls into optional telephony skill Follow up on salvaged PR #965 by moving the capability into optional-skills and broadening it from outbound AI calling to a full telephony skill. Add Twilio number provisioning, env/state persistence, SMS/MMS, inbound SMS polling, Vapi import helpers, and a provider decision tree while keeping telephony out of core runtime code. * docs(skills): clarify Hermes TTS telephony workflow --------- Co-authored-by: aydnOktay <xaydinoktay@gmail.com> Co-authored-by: mormio <morganemoss@gmai.com>	2026-03-14 04:16:48 -07:00
Teknium	02752c83b4	Merge pull request #1287 from NousResearch/hermes/hermes-cc060dd9 fix(gateway): avoid slash-command crash with GatewayConfig	2026-03-14 04:13:56 -07:00
Teknium	a48ebc68f4	Merge pull request #1288 from NousResearch/hermes/hermes-de3d4e49-pr976 fix: reliably notify gateway users when updates finish	2026-03-14 04:13:13 -07:00
teknium1	5c9a84219d	fix: complete send_message MEDIA delivery salvage - prevent raw MEDIA tag leakage outside the gateway pipeline - make extract_media handle quoted/backticked paths and optional whitespace - send Telegram media natively with explicit error/warning handling - add regression tests for Telegram media dispatch and MEDIA parsing	2026-03-14 04:02:03 -07:00
clabbe-bot	3126c60885	fix: notify gateway users when updates finish or fail	2026-03-14 03:59:05 -07:00
Teknium	cac238c2a3	Merge pull request #1286 from NousResearch/hermes/hermes-315847fd fix(patch): avoid corrupting pipe chars in v4a patch apply	2026-03-14 03:58:27 -07:00
teknium1	7e52e8eb54	fix(gateway): bridge quick commands into GatewayConfig runtime Follow-up on salvaged PR #975. Bridge quick_commands from config.yaml into load_gateway_config(), normalize non-dict quick command config at runtime, and add coverage for GatewayConfig round-trips plus config.yaml bridging. This makes the GatewayConfig quick-command fix complete for the real user-facing config path implicated by issue #973.	2026-03-14 03:57:25 -07:00
teknium1	96c250e538	test: cover pipe characters in v4a patch apply Add a regression test for apply_v4a_operations when read content contains a literal pipe character outside a line-number prefix.	2026-03-14 03:54:46 -07:00
stablegenius49	ce56b45514	fix(gateway): support quick commands from GatewayConfig	2026-03-14 03:51:28 -07:00
Teknium	cf3dceafe1	Merge pull request #1284 from NousResearch/hermes/hermes-de3d4e49-pr964 fix: show effective model and provider in status	2026-03-14 03:42:16 -07:00
teknium1	b5a7e807d0	test: cover provider label formatting	2026-03-14 03:39:12 -07:00
luisv-1	c2c37ef158	Show configured model and provider in status output Made-with: Cursor	2026-03-14 03:35:37 -07:00
Teknium	2f8dbe4e77	Merge pull request #1283 from NousResearch/hermes/hermes-f48b210a fix(setup): salvage keep-current provider handling from #951	2026-03-14 03:26:44 -07:00
Teknium	95d49401ee	Merge pull request #1282 from NousResearch/hermes/hermes-cc060dd9 fix(cli): make TUI prompt and accent output skin-aware	2026-03-14 03:24:24 -07:00
StefanIsMe	26f8b790c9	fix(setup): persist provider when switching model endpoints	2026-03-14 03:21:46 -07:00
Teknium	7901d863dd	Merge pull request #1280 from NousResearch/hermes/hermes-de3d4e49-pr944 fix: make session log writes reuse shared atomic JSON helper	2026-03-14 03:15:52 -07:00
teknium1	e9a7441c9b	test: restore default event loop for sync tests	2026-03-14 03:14:34 -07:00
Wayne	41f22de20f	fix(cli): make TUI prompt and accent output skin-aware Salvaged from PR #932 by Wayne onto current main. Apply skin-aware prompt symbols and live prompt_toolkit color refresh, replace lingering hardcoded accent output with active-skin colors, keep ANSI-safe response rendering, preserve secret-capture and approval-prompt state handling, and add integration coverage for prompt state and style refresh behavior.	2026-03-14 03:12:52 -07:00
Teknium	b91cac7b4b	test: make gateway async tests xdist-safe (#1281 ) * feat: improve context compaction handoff summaries Adapt PR #916 onto current main by replacing the old context summary marker with a clearer handoff wrapper, updating the summarization prompt for resume-oriented summaries, and preserving the current call_llm-based compression path. * fix: clearer error when docker backend is unavailable * fix: preserve docker discovery in backend preflight Follow up on salvaged PR #940 by reusing find_docker() during the new availability check so non-PATH Docker Desktop installs still work. Add a regression test covering the resolved executable path. * test: make gateway async tests xdist-safe Replace sync test usage of asyncio.get_event_loop().run_until_complete() with asyncio.run() so tests do not depend on an ambient current event loop. Also create the email disconnect poll task inside a running loop. This fixes xdist/CI failures where workers have no current loop in MainThread. --------- Co-authored-by: aydnOktay <xaydinoktay@gmail.com>	2026-03-14 03:12:15 -07:00
kshitij	0bb7ed1d95	refactor: salvage adapter and CLI cleanup from PR #939 Salvaged from PR #939 by kshitij. - deduplicate Discord slash command dispatch and local file send helpers - deduplicate Slack file uploads while preserving thread metadata - extract shared CLI session relative-time formatting - hoist browser PATH cleanup constants and throttle screenshot pruning - tidy small type and import cleanups	2026-03-14 03:07:11 -07:00
Teknium	f279bb004f	Merge pull request #1278 from NousResearch/hermes/hermes-f48b210a test: fix gateway async tests without implicit event loop	2026-03-14 02:57:47 -07:00
teknium1	cbbba87099	fix: reuse shared atomic session log helper	2026-03-14 02:56:13 -07:00
Teknium	6036793f60	fix: clearer docker backend preflight errors (#1276 ) * feat: improve context compaction handoff summaries Adapt PR #916 onto current main by replacing the old context summary marker with a clearer handoff wrapper, updating the summarization prompt for resume-oriented summaries, and preserving the current call_llm-based compression path. * fix: clearer error when docker backend is unavailable * fix: preserve docker discovery in backend preflight Follow up on salvaged PR #940 by reusing find_docker() during the new availability check so non-PATH Docker Desktop installs still work. Add a regression test covering the resolved executable path. --------- Co-authored-by: aydnOktay <xaydinoktay@gmail.com>	2026-03-14 02:53:02 -07:00
teknium1	115dd17b3c	test: fix gateway async test event loop usage Use asyncio.run in sync tests that were relying on an implicit current event loop. This makes the gateway send-image and Slack connect tests pass reliably under Python 3.11+ and xdist workers.	2026-03-14 02:52:47 -07:00
Teknium	486cb772b8	Merge pull request #1275 from NousResearch/hermes/hermes-f48b210a feat(gateway): salvage reasoning hot reload from #938	2026-03-14 02:47:11 -07:00
Teknium	11e6775f98	Merge pull request #1274 from NousResearch/hermes/hermes-de3d4e49-pr920 fix: handle headless setup flows end-to-end	2026-03-14 02:45:19 -07:00
Verne	52ba940c9b	feat(gateway): add reasoning hot reload Add a /reasoning command across gateway adapters so users can inspect or change reasoning effort without editing config by hand. Reload reasoning settings from config.yaml before each agent run, including background tasks, so the next message picks up the new value consistently.	2026-03-14 02:42:47 -07:00
teknium1	9492f42aa7	fix: cover headless first-run setup flow	2026-03-14 02:37:29 -07:00
Teknium	5c479eedf1	feat: improve context compaction handoff summaries (#1273 ) Adapt PR #916 onto current main by replacing the old context summary marker with a clearer handoff wrapper, updating the summarization prompt for resume-oriented summaries, and preserving the current call_llm-based compression path.	2026-03-14 02:33:31 -07:00
teyrebaz33	4aa94ae7cc	fix: detect non-interactive TTY in setup wizard to prevent hang hermes setup hung indefinitely on headless SSH sessions, Docker containers, and CI/CD environments because the interactive provider selection menu could not receive input. Two-layer fix: 1. sys.stdin.isatty() check — auto-detects non-interactive environments 2. --non-interactive flag support — already in CLI parser, now honored In both cases the wizard exits immediately with helpful guidance pointing users to 'hermes config set' commands. Closes #905	2026-03-14 02:32:46 -07:00
teknium1	1e23d14568	fix: log prompt builder skill parsing fallbacks	2026-03-14 02:22:17 -07:00
Teknium	1117a21065	Merge pull request #1271 from NousResearch/hermes/hermes-de3d4e49 fix: guard init-time stdio writes	2026-03-14 02:21:39 -07:00
teknium1	936040d8f7	fix: guard init-time stdio writes	2026-03-14 02:19:46 -07:00
Teknium	1869e88169	Merge pull request #1256 from NousResearch/hermes/hermes-720acdad feat(security): add tirith pre-exec command scanning	2026-03-14 00:24:56 -07:00
teknium1	6f1889b0fa	fix: preserve current approval semantics for tirith guard Restore gateway/run.py to current main behavior while keeping tirith startup and pattern_keys replay, preserve yolo and non-interactive bypass semantics in the combined guard, and add regression tests for yolo and view-full flows.	2026-03-14 00:17:04 -07:00
teknium1	f5cf1f8a45	fix(cron): tag persisted cron sessions and test wiring - store cron-run sessions with source=cron instead of falling back to cli - close the per-run SessionDB after completion - add regression coverage for cron session_db/platform wiring	2026-03-14 00:12:34 -07:00
sheeki003	375ce8a881	feat(security): add tirith pre-exec command scanning Integrate tirith as a pre-execution security scanner that detects homograph URLs, pipe-to-interpreter patterns, terminal injection, zero-width Unicode, and environment variable manipulation — threats the existing 50-pattern dangerous command detector doesn't cover. Architecture: gather-then-decide — both tirith and the dangerous command detector run before any approval prompt, preventing gateway force=True replay from bypassing one check when only the other was shown to the user. New files: - tools/tirith_security.py: subprocess wrapper with auto-installer, mandatory cosign provenance verification, non-blocking background download, disk-persistent failure markers with retryable-cause tracking (cosign_missing auto-clears when cosign appears on PATH) - tests/tools/test_tirith_security.py: 62 tests covering exit code mapping, fail_open, cosign verification, background install, HERMES_HOME isolation, and failure recovery - tests/tools/test_command_guards.py: 21 integration tests for the combined guard orchestration Modified files: - tools/approval.py: add check_all_command_guards() orchestrator, add allow_permanent parameter to prompt_dangerous_approval() - tools/terminal_tool.py: replace _check_dangerous_command with consolidated check_all_command_guards - cli.py: update _approval_callback for allow_permanent kwarg, call ensure_installed() at startup - gateway/run.py: iterate pattern_keys list on replay approval, call ensure_installed() at startup - hermes_cli/config.py: add security config defaults, split commented sections for independent fallback - cli-config.yaml.example: document tirith security config	2026-03-14 00:11:27 -07:00
Teknium	29176f302e	fix: sanitize chat payloads and provider precedence (#1253 ) fix: sanitize chat payloads and provider precedence	2026-03-14 00:09:14 -07:00
Teknium	25481d4286	feat: restore ACP server implementation from PR #949 (#1254 ) Restore the ACP editor-integration implementation that was present on the original PR branch but did not actually land in main. Includes: - acp_adapter/ server, session manager, event bridge, auth, permissions, and tool helpers - hermes acp subcommand and hermes-acp entry point - hermes-acp curated toolset - ACP registry manifest, setup guide, and ACP test suite - jupyter-live-kernel data science skill from the original branch Also updates the revived ACP code for current main by: - resolving runtime providers through the modern shared provider router - binding ACP sessions to per-session cwd task overrides - tracking duplicate same-name tool calls with FIFO IDs - restoring terminal approval callbacks after prompts - normalizing supporting docs/skill metadata Validated with tests/acp and the full pytest suite (-n0).	2026-03-14 00:09:05 -07:00
Teknium	2fe853bcc9	Merge pull request #1251 from NousResearch/hermes/hermes-f7e92273 fix: prevent logging handler accumulation in gateway mode	2026-03-14 00:04:13 -07:00

1 2 3 4 5 ...

599 commits