mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-04-25 00:51:20 +00:00
feat: add voice mode with push-to-talk and TTS output for CLI
Implements Issue #314 Phase 2 & 3: - /voice command to toggle voice mode (on/off/tts/status) - Ctrl+Space push-to-talk recording via sounddevice - Whisper STT transcription via existing transcription_tools - Optional TTS response playback via existing tts_tool - Visual indicators in prompt (recording/transcribing/voice) - 21 unit tests, all mocked (no real mic/API) - Optional deps: sounddevice, numpy (pip install hermes-agent[voice])
This commit is contained in:
parent
cf3dceafe1
commit
1a6fbef8a9
6 changed files with 977 additions and 1 deletions
|
|
@ -202,6 +202,12 @@ DEFAULT_CONFIG = {
|
|||
"model": "whisper-1", # whisper-1, gpt-4o-mini-transcribe, gpt-4o-transcribe
|
||||
},
|
||||
},
|
||||
|
||||
"voice": {
|
||||
"record_key": "ctrl+space",
|
||||
"max_recording_seconds": 120,
|
||||
"auto_tts": False,
|
||||
},
|
||||
|
||||
"human_delay": {
|
||||
"mode": "off",
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue