hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-21 10:22:18 +00:00

History

jasnoorgill da34fca2bb fix(signal): detect ADTS AAC voice notes and remux to MP4 Android Signal delivers voice notes as raw ADTS AAC frames, which share the `0xFF 0xFx` sync word with MPEG-1/2 Layer 3 (MP3). The `_guess_extension` byte-signature test in gateway/platforms/signal.py was matching both, so ADTS AAC was being misclassified as MP3 — saved to disk with the wrong extension and rejected by every major STT API (Groq, OpenAI) because their server-side format sniffers inspect the actual codec, not the file extension. Two changes: 1. Tighten the MP3 vs ADTS disambiguator. ADTS packs `ID`, `layer`, and `protection_absent` into bits 3-0 of byte 1, where `ID=0` and `layer=00` for AAC. Real MP3 has `ID=1` and `layer` in {01, 10, 11}. The mask `0xF6` against target `0xF0` cleanly separates them. 2. Remux raw ADTS AAC to MP4 container at the cache step via `ffmpeg -c:a copy`. Single demux/remux, no re-encode, no quality loss, sub-100ms on a Pi 5. The cached file is a normal `.m4a` that all major STT providers accept. ffmpeg is a transitive dependency of many other Hermes features (TTS, video skills) so this isn't a new install requirement; the remux degrades gracefully to a no-op if ffmpeg is missing. The new helper `_remux_aac_to_m4a` is unit-tested with a real Android voice note from the audio cache that originally triggered the bug, plus synthetic ADTS frames for the byte-level disambiguator and garbage-input graceful failure. Closes the gap that broke transcription for any Android Signal user sending voice messages to Hermes.		2026-06-20 13:48:05 +05:30
..
assets	fix: improve telegram topic mode setup	2026-05-04 12:07:17 -07:00
builtin_hooks	remove: BOOT.md built-in hook (#17093 )	2026-04-28 09:50:27 -07:00
platforms	fix(signal): detect ADTS AAC voice notes and remux to MP4	2026-06-20 13:48:05 +05:30
relay	fix(relay): make hosted gateways actually connect AND complete the inbound/outbound round-trip (#48828 )	2026-06-19 16:30:24 +10:00
__init__.py	docs(gateway): mention Weixin in gateway help and docstrings	2026-05-12 17:08:51 -07:00
authz_mixin.py	fix(gateway): preserve WeCom per-group sender allowlists	2026-06-13 07:18:54 -07:00
channel_directory.py	fix: harden WhatsApp target alias salvage	2026-06-15 05:51:47 -07:00
config.py	fix(managed-scope): honor managed scope in all standalone config loaders	2026-06-19 07:46:33 -07:00
delivery.py	fix(gateway): drop outbound silence-narration messages pre-send	2026-05-29 19:06:05 -07:00
display_config.py	feat(gateway): rename to tool_progress_grouping, add config/docs/tests	2026-06-16 05:49:24 -07:00
hooks.py	feat(hooks): expose thread_id and chat_type in agent:start/end context (#41672 )	2026-06-07 19:16:36 -07:00
kanban_watchers.py	fix: open dispatcher lock file with explicit utf-8 encoding	2026-06-19 07:35:33 -07:00
memory_monitor.py	Port from cline/cline#10343: periodic gateway memory logging (#27102 )	2026-05-16 12:55:23 -07:00
message_timestamps.py	feat(gateway): inject stable human-readable message timestamps	2026-06-16 15:49:59 -07:00
mirror.py	refactor(gateway): drop _append_to_jsonl from mirror	2026-05-20 13:00:57 -07:00
pairing.py	fix(gateway): preserve WhatsApp pairing approvals across JID/LID alias flips	2026-05-23 01:46:34 -07:00
platform_registry.py	refactor(plugins): add apply_yaml_config_fn registry hook	2026-05-13 22:20:30 -07:00
response_filters.py	fix(gateway): suppress exact silence tokens without mutating history	2026-06-14 03:25:08 -07:00
restart.py	fix(gateway): address restart review feedback	2026-04-10 21:18:34 -07:00
rich_sent_store.py	fix(telegram): resolve replies to rich (sendRichMessage) messages	2026-06-16 13:04:20 -07:00
run.py	fix(gateway): break the restart loop at the source on session resume	2026-06-19 16:59:58 -07:00
runtime_footer.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
session.py	feat(gateway): multiplex phase 1 — HTTP-inbound /p/<profile>/ routing (webhook)	2026-06-19 07:34:15 -07:00
session_context.py	fix(dashboard): hide sidecar sessions from history (#49269 )	2026-06-19 18:06:38 -04:00
shutdown_forensics.py	chore: ruff auto-fixes — collapsible-else-if, if-stmt-min-max, dict.fromkeys (#23926 )	2026-05-11 11:03:29 -07:00
slash_access.py	feat(gateway): per-platform admin/user split for slash commands (salvage of #4443 ) (#23373 )	2026-05-10 12:33:54 -07:00
slash_commands.py	fix(model): clear stale endpoint credentials across switches	2026-06-19 19:58:26 -07:00
status.py	feat(gateway): multiplex phase 4 — lifecycle guard + per-profile observability	2026-06-19 07:34:15 -07:00
sticker_cache.py	fix: guard yaml.safe_load, flock unlock, TOCTOU races, and atomic writes	2026-05-19 00:12:41 -07:00
stream_consumer.py	fix(mattermost): harden delivery hygiene	2026-06-16 06:34:54 -07:00
stream_dispatch.py	feat(gateway): structured stream-event protocol + Telegram draft formatting parity (#37250 )	2026-06-02 00:33:50 -07:00
stream_events.py	feat(gateway): structured stream-event protocol + Telegram draft formatting parity (#37250 )	2026-06-02 00:33:50 -07:00
whatsapp_identity.py	fix(whatsapp_identity): pin identifier regex to ASCII, clarify it's defense-in-depth	2026-04-26 20:48:31 -07:00