hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-26 11:12:03 +00:00

Author	SHA1	Message	Date
teknium1	8ac5e90ec2	fix(gateway): dedup image_generate media across the compression boundary After context compression, the agent re-sent an already-delivered generated image on every subsequent turn (#46627). The auto-append fallback rescans full history when the message list shrinks (compression- safe path), deduping against _history_media_paths — but that set was built by scanning ONLY MEDIA: text tags in tool results. image_generate returns its path in a JSON payload field (host_image/image/agent_visible_image), never a MEDIA: tag, so generated-image paths never entered the dedup set and were re-emitted after the boundary. Extract the history-path collection into _collect_history_media_paths(), which now covers BOTH delivery shapes: MEDIA: text tags AND image_generate JSON-payload paths (mirroring what _collect_auto_append_media_tags extracts). The inline block in _handle_message is replaced with a call to the helper. Co-authored-by: liuhao1024 <sunsky.lau@gmail.com>	2026-06-20 23:20:16 -07:00
Teknium	9351cbafab	fix(gateway): auto-deliver image_generate output as native media (#42616 ) image_generate returns its artifact as JSON ({"image": "/abs/path.png"}) with no MEDIA: tag, so the gateway auto-append path (which only recognized text_to_speech MEDIA: tags) never delivered it — image delivery silently depended on the model restating the path in its reply. Add image_generate to the producer allowlist and extract the local path from its JSON result (host_image > image > agent_visible_image), reusing the existing extension-anchored matcher and history-dedupe so remote URLs, unknown extensions, failures, and already-sent paths are rejected. Closes the remaining unfixed path from #19105.	2026-06-08 22:51:03 -07:00
VinciZhu	521d06975e	fix(gateway): restrict auto-appended media to producer tools	2026-06-01 00:00:26 -07:00
Bartok9	08c0b22417	fix(gateway): scope tool-result MEDIA scan to current turn The post-run scan that appends tool-emitted MEDIA: tags to the final response iterated every tool/function message in the full conversation and relied solely on path-based dedup against paths reconstructed from the replayable transcript. When that reconstruction does not byte-match the in-memory tool content (timestamp stripping, observed-context withholding, compression rewrites), a stale path emitted several turns earlier is absent from the dedup set and leaks onto a later text-only reply (Telegram 'Sending media group of 1 photo(s)' with no MEDIA directive present). Scope the scan to this turn's new messages by slicing result['messages'] at len(agent_history) (agent_history is passed as conversation_history into run_conversation, so the returned list is history + this turn). Retain path-based dedup as a secondary guard and as the sole guard on the compression-shrink fallback, preserving the #160 behaviour. Closes #34608	2026-05-29 13:13:34 -07:00
Bartok9	35655298e6	fix(gateway): prevent TTS voice messages from accumulating across turns Fixes #160 The issue was that MEDIA tags were being extracted from ALL messages in the conversation history, not just messages from the current turn. This caused TTS voice messages generated in earlier turns to be re-attached to every subsequent reply. The fix: - Track history_len before calling run_conversation - Only scan messages AFTER history_len for MEDIA tags - Add comprehensive tests to prevent regression This ensures each voice message is sent exactly once, when it's generated, not on every subsequent message in the session.	2026-02-28 03:38:27 -05:00

5 commits