hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-02 12:13:05 +00:00

History

wilsen0 ac95b8cdbe perf(gateway): tune Telegram cadence + adaptive fast-path for short replies Re-authored against current main from PR #10388 by @wilsen0. The original branch is 3800+ commits stale and could not be cherry-picked without reverting unrelated work; this change carries only the perf intent forward. Tuning summary ============== Text-batch ingress (gateway/platforms/telegram.py): - HERMES_TELEGRAM_TEXT_BATCH_DELAY_SECONDS default 0.6 -> 0.3 - HERMES_TELEGRAM_TEXT_BATCH_SPLIT_DELAY_SECONDS default 2.0 -> 1.0 - Adaptive fast-path tiers in _flush_text_batch: total <= 320 cp -> min(cap, 0.18) total <= 1024 cp -> min(cap, 0.24) else -> cap A single short reply now reaches the agent in ~180ms instead of 600ms. Tier constants compose with the configured cap via min() so an operator who tightens HERMES_TELEGRAM_TEXT_BATCH_DELAY_SECONDS below 0.18 still wins on every tier. - _env_float_clamped helper replaces bare float(os.getenv()). Rejects NaN / Inf, applies optional min/max bounds. Used for text-batch + media-batch knobs. Prevents asyncio.sleep(NaN) crashes when an operator typos an env var. Stream cadence (gateway/config.py + stream_consumer.py): - StreamingConfig.edit_interval default 1.0s -> 0.8s - StreamingConfig.buffer_threshold default 40 -> 24 chars - DEFAULT_STREAMING_EDIT_INTERVAL / BUFFER_THRESHOLD / CURSOR are now a single source of truth. StreamConsumerConfig imports them instead of duplicating the literals; the prior dual-source drift is fixed. Tool progress (gateway/display_config.py): - Telegram default tool_progress 'all' -> 'new'. Inside Telegram's ~1 edit/s flood envelope the 'all' default would accumulate edit pressure on busy chats; 'new' shows only the leading bubble per tool batch and feels less spammy. - Slack tier_low override (tool_progress='off') is preserved. Composition with native draft streaming (#23512) ================================================ The mid-stream cadence (edit_interval, buffer_threshold) gates BOTH the draft path (send_draft) and the edit path (edit_message), so the tighter cadence helps native draft as much as edit-based. The text-batch fast-path applies before the consumer starts, so it speeds up the first-token latency on every transport. No conflict. Stale-base avoidance ==================== Re-authored from scratch rather than cherry-picked. Dropped from the original branch: - Unrelated `d2f043f9c` 'fix(anthropic): preserve third-party thinking continuity' commit - boot_md.py builtin gateway hook (unrelated) - Reverted Slack tool_progress='off' (#14663) restoration - Reverted Platform plugin discovery, MSGRAPH_WEBHOOK, YUANBAO members deletion - 2300+ lines of run.py base-skew noise Tests ===== New tests/gateway/test_telegram_text_batch_perf.py: - 7 tests for _env_float_clamped (NaN, Inf, garbage, bounds). - 4 tests for the adaptive-tier composition rules. Updated tests/gateway/test_display_config.py: - test_platform_default_when_no_user_config: 'all' -> 'new' for Telegram, with comment. - test_high_tier_platforms: split into Telegram-overrides-to-new and Discord-stays-all assertions. Closes #10388. Co-authored-by: wilsen0 <132184373+wilsen0@users.noreply.github.com>		2026-05-10 22:22:25 -07:00
..
assets	fix: improve telegram topic mode setup	2026-05-04 12:07:17 -07:00
builtin_hooks	remove: BOOT.md built-in hook (#17093 )	2026-04-28 09:50:27 -07:00
platforms	perf(gateway): tune Telegram cadence + adaptive fast-path for short replies	2026-05-10 22:22:25 -07:00
__init__.py	Enhance CLI with multi-platform messaging integration and configuration management	2026-02-02 19:01:51 -08:00
channel_directory.py	feat: complete plugin platform parity — all 12 integration points	2026-04-29 21:56:51 -07:00
config.py	perf(gateway): tune Telegram cadence + adaptive fast-path for short replies	2026-05-10 22:22:25 -07:00
delivery.py	fix(gateway): preserve case-sensitive chat IDs in DeliveryTarget.parse	2026-05-01 14:01:26 -07:00
display_config.py	perf(gateway): tune Telegram cadence + adaptive fast-path for short replies	2026-05-10 22:22:25 -07:00
hooks.py	fix(plugins): register dynamically-loaded modules in sys.modules before exec	2026-04-29 23:34:35 -07:00
mirror.py	fix(gateway): avoid cross-user mirror writes in per-user group sessions	2026-04-26 18:31:24 -07:00
pairing.py	fix(pairing): enforce lockout on approve_code, not just generate_code (#10195 ) (#21325 )	2026-05-07 07:18:21 -07:00
platform_registry.py	feat(plugins): add standalone_sender_fn for out-of-process cron delivery	2026-05-09 02:56:29 -07:00
restart.py	fix(gateway): address restart review feedback	2026-04-10 21:18:34 -07:00
run.py	fix(security): sanitize env and redact output in quick commands + remove write-only _pending_messages	2026-05-10 22:12:23 -07:00
runtime_footer.py	feat(gateway): opt-in runtime-metadata footer on final replies (#17026 )	2026-04-28 06:50:04 -07:00
session.py	refactor(gateway): simplify auto-resume + extend to crash recovery	2026-05-07 05:05:34 -07:00
session_context.py	fix(cron): run due jobs in parallel to prevent serial tick starvation (#13021 )	2026-04-20 11:53:07 -07:00
shutdown_forensics.py	feat(gateway): shutdown forensics — non-blocking diag, per-phase timing, stale-unit warning (#23285 )	2026-05-10 09:01:51 -07:00
slash_access.py	feat(gateway): per-platform admin/user split for slash commands (salvage of #4443 ) (#23373 )	2026-05-10 12:33:54 -07:00
status.py	fix(gateway): refresh runtime argv metadata	2026-05-09 11:08:23 -07:00
sticker_cache.py	chore: remove ~100 unused imports across 55 files (#3016 )	2026-03-25 15:02:03 -07:00
stream_consumer.py	perf(gateway): tune Telegram cadence + adaptive fast-path for short replies	2026-05-10 22:22:25 -07:00
whatsapp_identity.py	fix(whatsapp_identity): pin identifier regex to ASCII, clarify it's defense-in-depth	2026-04-26 20:48:31 -07:00