hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-25 17:18:11 +00:00

History

Teknium 8a9ded5b21 feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS (#39659 ) * feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS Discord voice mode can now feel conversational: the bot speaks a short acknowledgement before it starts working, and a subtle ambient 'thinking' bed plays underneath while tools run, ducking under speech and swelling back — the Grok-voice-mode feel. discord.py plays only one audio stream per voice connection, so this adds a software mixer (VoiceMixer, a discord.AudioSource) installed once per guild on join. It sums an ambient loop, verbal acks, and TTS replies into that single 20ms/48kHz/stereo stream (numpy int16 add + clip), so they overlap instead of stop-and-swap. Speech ducks the ambient gain down and releases it smoothly. - plugins/platforms/discord/voice_mixer.py: VoiceMixer + MixerChild (gain, loop, fade, duck/release), decode_to_pcm (ffmpeg), synth_ambient_pcm (no asset needed — synthesised pad). - adapter: install mixer on join, tear down on leave, route play_in_voice_channel through the mixer (legacy one-shot path kept as fallback), play_ack_in_voice, voice_mixer_active. Defensive getattr for the object.__new__ test helpers. - gateway/run.py: tool_start_callback fires a one-time verbal ack on the first tool call of a turn when in a voice channel (independent of the text tool-progress gate). No system-prompt or message-flow changes. - config: discord.voice_fx.* (OFF by default; ambient/duck/speech gains, ack phrases). All in config.yaml, not .env. - docs + tests (mixer unit + adapter integration). Verified: 19 new tests pass, existing voice suite green (2 pre-existing davey-module env failures unchanged), and a real-mixer E2E confirms ambient streams, TTS overlaps it, acks layer in, and teardown is clean. * fix(discord): make voice mixer numpy import lazy (numpy is voice-extra-only) numpy ships in the optional 'voice' extra, not [all,dev], so a module-level 'import numpy' broke CI test collection (and would break the always-imported Discord adapter on any install without the voice extra). Defer numpy to the functions that actually mix audio via _require_numpy(); guard the test module with pytest.importorskip('numpy').		2026-06-05 03:10:40 -07:00
..
_category_.json	feat: add documentation website (Docusaurus)	2026-03-05 05:24:55 -08:00
bluebubbles.md	feat(bluebubbles): support group mention gating	2026-06-01 18:52:05 -07:00
dingtalk.md	docs: resync reference, user-guide, developer-guide, and messaging pages against code (#17738 )	2026-04-29 20:55:59 -07:00
discord.md	feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS (#39659 )	2026-06-05 03:10:40 -07:00
email.md	docs(email): clarify gateway vs Himalaya setup	2026-05-28 05:42:09 -07:00
feishu.md	feat(gateway): handle Feishu meeting invitations	2026-06-04 06:15:23 -07:00
google_chat.md	revert: keep Google Chat OAuth secret + active_provider profile-scoped (#39398 )	2026-06-04 16:54:40 -07:00
homeassistant.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
index.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
line.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
matrix.md	feat(matrix): support bang command aliases	2026-06-03 17:19:27 +05:30
mattermost.md	docs: comprehensive 2-week sweep of feature/PR coverage gaps (#28497 )	2026-05-18 23:55:25 -07:00
msgraph-webhook.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
ntfy.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
open-webui.md	fix(website): cross-locale doc links + drop empty ko locale (#31895 )	2026-05-24 23:16:20 -07:00
qqbot.md	docs: round 2 audit — messaging, developer-guide, guides, integrations (#22858 )	2026-05-09 15:00:24 -07:00
signal.md	feat(gateway/signal): add support for multiple images sending	2026-04-30 04:28:08 -07:00
simplex.md	Correct URL format for simplex-chat download	2026-05-29 12:06:22 -07:00
slack.md	docs: comprehensive 2-week sweep of feature/PR coverage gaps (#28497 )	2026-05-18 23:55:25 -07:00
sms.md	fix(website): cross-locale doc links + drop empty ko locale (#31895 )	2026-05-24 23:16:20 -07:00
teams-meetings.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
teams.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
telegram.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
webhooks.md	fix(webhook): widen INSECURE_NO_AUTH loopback check + tests + docs	2026-05-07 07:38:43 -07:00
wecom-callback.md	docs: 30-day overhaul — correctness audit, PR coverage, Nous Portal weave, sidebar reorg (#33782 )	2026-05-28 02:41:36 -07:00
wecom.md	docs(wecom): stop implying live streaming and typing support (#38990 )	2026-06-04 05:57:01 -07:00
weixin.md	fix(gateway): config.yaml path for WhatsApp/Weixin text-batch delays	2026-05-30 07:33:15 -07:00
whatsapp.md	fix(gateway): config.yaml path for WhatsApp/Weixin text-batch delays	2026-05-30 07:33:15 -07:00
yuanbao.md	fix(website): cross-locale doc links + drop empty ko locale (#31895 )	2026-05-24 23:16:20 -07:00