hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-26 17:38:36 +00:00

History

Teknium 8a9ded5b21 feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS (#39659 ) * feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS Discord voice mode can now feel conversational: the bot speaks a short acknowledgement before it starts working, and a subtle ambient 'thinking' bed plays underneath while tools run, ducking under speech and swelling back — the Grok-voice-mode feel. discord.py plays only one audio stream per voice connection, so this adds a software mixer (VoiceMixer, a discord.AudioSource) installed once per guild on join. It sums an ambient loop, verbal acks, and TTS replies into that single 20ms/48kHz/stereo stream (numpy int16 add + clip), so they overlap instead of stop-and-swap. Speech ducks the ambient gain down and releases it smoothly. - plugins/platforms/discord/voice_mixer.py: VoiceMixer + MixerChild (gain, loop, fade, duck/release), decode_to_pcm (ffmpeg), synth_ambient_pcm (no asset needed — synthesised pad). - adapter: install mixer on join, tear down on leave, route play_in_voice_channel through the mixer (legacy one-shot path kept as fallback), play_ack_in_voice, voice_mixer_active. Defensive getattr for the object.__new__ test helpers. - gateway/run.py: tool_start_callback fires a one-time verbal ack on the first tool call of a turn when in a voice channel (independent of the text tool-progress gate). No system-prompt or message-flow changes. - config: discord.voice_fx.* (OFF by default; ambient/duck/speech gains, ack phrases). All in config.yaml, not .env. - docs + tests (mixer unit + adapter integration). Verified: 19 new tests pass, existing voice suite green (2 pre-existing davey-module env failures unchanged), and a real-mixer E2E confirms ambient streams, TTS overlaps it, acks layer in, and teardown is clean. * fix(discord): make voice mixer numpy import lazy (numpy is voice-extra-only) numpy ships in the optional 'voice' extra, not [all,dev], so a module-level 'import numpy' broke CI test collection (and would break the always-imported Discord adapter on any install without the voice extra). Defer numpy to the functions that actually mix audio via _require_numpy(); guard the test module with pytest.importorskip('numpy').		2026-06-05 03:10:40 -07:00
..
browser	fix(managed-gateway): keep tool availability scans off the Nous token-refresh path	2026-05-30 07:58:08 -07:00
context_engine	feat(context-engine): host contract for external context engines	2026-05-28 01:45:30 -07:00
dashboard_auth	fix(desktop): gate OAuth remote connect on AT-or-RT, not access token alone	2026-06-04 22:18:46 -07:00
disk-cleanup	fix(cron): re-validate stale cron-output entries before deletion (#37721 )	2026-06-04 07:52:04 -07:00
google_meet	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
hermes-achievements	fix(dashboard): sanction plugin WS/upload auth via SDK helpers (gated mode)	2026-06-03 16:59:36 -07:00
image_gen	feat(image_gen): add Krea provider plugin (Krea 2 Medium + Large) (#33236 )	2026-05-27 11:01:47 -07:00
kanban	fix(kanban-dashboard): use context-local board pin in specify/decompose endpoints	2026-06-04 07:39:53 -07:00
memory	fix(openviking): add missing /agent/{agent}/ segment to memory URI — fixes #36969	2026-06-04 17:40:33 -07:00
model-providers	fix(vision): detect vision-capable custom providers via ProviderProfile flag	2026-06-04 17:53:49 -07:00
observability	feat(observability): observer-grade telemetry hooks + NeMo-Relay plugin	2026-06-03 06:36:46 -07:00
platforms	feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS (#39659 )	2026-06-05 03:10:40 -07:00
security-guidance	plugins: add security-guidance — pattern-matched warnings on dangerous code writes (#33131 )	2026-05-27 02:07:21 -07:00
spotify	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
teams_pipeline	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
video_gen	fix(xai): route video models by modality	2026-06-01 19:00:30 -07:00
web	fix(managed-gateway): keep tool availability scans off the Nous token-refresh path	2026-05-30 07:58:08 -07:00
__init__.py	feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )	2026-04-02 15:33:51 -07:00