hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-04 12:33:08 +00:00

History

wilsen0 ac95b8cdbe perf(gateway): tune Telegram cadence + adaptive fast-path for short replies Re-authored against current main from PR #10388 by @wilsen0. The original branch is 3800+ commits stale and could not be cherry-picked without reverting unrelated work; this change carries only the perf intent forward. Tuning summary ============== Text-batch ingress (gateway/platforms/telegram.py): - HERMES_TELEGRAM_TEXT_BATCH_DELAY_SECONDS default 0.6 -> 0.3 - HERMES_TELEGRAM_TEXT_BATCH_SPLIT_DELAY_SECONDS default 2.0 -> 1.0 - Adaptive fast-path tiers in _flush_text_batch: total <= 320 cp -> min(cap, 0.18) total <= 1024 cp -> min(cap, 0.24) else -> cap A single short reply now reaches the agent in ~180ms instead of 600ms. Tier constants compose with the configured cap via min() so an operator who tightens HERMES_TELEGRAM_TEXT_BATCH_DELAY_SECONDS below 0.18 still wins on every tier. - _env_float_clamped helper replaces bare float(os.getenv()). Rejects NaN / Inf, applies optional min/max bounds. Used for text-batch + media-batch knobs. Prevents asyncio.sleep(NaN) crashes when an operator typos an env var. Stream cadence (gateway/config.py + stream_consumer.py): - StreamingConfig.edit_interval default 1.0s -> 0.8s - StreamingConfig.buffer_threshold default 40 -> 24 chars - DEFAULT_STREAMING_EDIT_INTERVAL / BUFFER_THRESHOLD / CURSOR are now a single source of truth. StreamConsumerConfig imports them instead of duplicating the literals; the prior dual-source drift is fixed. Tool progress (gateway/display_config.py): - Telegram default tool_progress 'all' -> 'new'. Inside Telegram's ~1 edit/s flood envelope the 'all' default would accumulate edit pressure on busy chats; 'new' shows only the leading bubble per tool batch and feels less spammy. - Slack tier_low override (tool_progress='off') is preserved. Composition with native draft streaming (#23512) ================================================ The mid-stream cadence (edit_interval, buffer_threshold) gates BOTH the draft path (send_draft) and the edit path (edit_message), so the tighter cadence helps native draft as much as edit-based. The text-batch fast-path applies before the consumer starts, so it speeds up the first-token latency on every transport. No conflict. Stale-base avoidance ==================== Re-authored from scratch rather than cherry-picked. Dropped from the original branch: - Unrelated `d2f043f9c` 'fix(anthropic): preserve third-party thinking continuity' commit - boot_md.py builtin gateway hook (unrelated) - Reverted Slack tool_progress='off' (#14663) restoration - Reverted Platform plugin discovery, MSGRAPH_WEBHOOK, YUANBAO members deletion - 2300+ lines of run.py base-skew noise Tests ===== New tests/gateway/test_telegram_text_batch_perf.py: - 7 tests for _env_float_clamped (NaN, Inf, garbage, bounds). - 4 tests for the adaptive-tier composition rules. Updated tests/gateway/test_display_config.py: - test_platform_default_when_no_user_config: 'all' -> 'new' for Telegram, with comment. - test_high_tier_platforms: split into Telegram-overrides-to-new and Discord-stays-all assertions. Closes #10388. Co-authored-by: wilsen0 <132184373+wilsen0@users.noreply.github.com>		2026-05-10 22:22:25 -07:00
..
qqbot	feat(qqbot): wire native tool-approval UX via inline keyboards	2026-05-07 07:48:15 -07:00
__init__.py	perf(gateway): defer QQAdapter and YuanbaoAdapter imports via PEP 562 (#22790 )	2026-05-09 13:17:48 -07:00
_http_client_limits.py	fix(gateway): tighten httpx keepalive and close whatsapp typing-response leak (#18451 )	2026-05-02 02:23:37 -07:00
ADDING_A_PLATFORM.md	feat(gateway): add LINE Messaging API platform plugin (#23197 )	2026-05-10 06:40:46 -07:00
api_server.py	fix(api-server): emit length/error finish_reason for truncation/failure (#22775 )	2026-05-09 12:48:08 -07:00
base.py	fix(telegram): split-and-deliver oversized edits instead of silent truncation	2026-05-10 22:02:56 -07:00
bluebubbles.py	fix(gateway): tighten httpx keepalive and close whatsapp typing-response leak (#18451 )	2026-05-02 02:23:37 -07:00
dingtalk.py	fix(dingtalk): align override signatures with base + guard Optional[error] in tests	2026-05-09 11:11:10 -07:00
discord.py	feat(session): make /handoff actually transfer the session live	2026-05-10 13:06:25 -07:00
email.py	fix(email): use real hermes version in IMAP ID command	2026-05-09 13:35:50 -07:00
feishu.py	fix(gateway): stream consumer first message drops thread context	2026-05-10 15:20:40 -07:00
feishu_comment.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
feishu_comment_rules.py	fix(feishu-comment): use get_hermes_home(); drop dead asyncio wrapper; AUTHOR_MAP	2026-04-17 19:04:11 -07:00
helpers.py	fix(gateway): ensure deterministic thread eviction in helpers	2026-05-05 10:13:55 -07:00
homeassistant.py	fix(gateway): correct ws scheme conversion for https urls	2026-05-03 03:54:03 -07:00
matrix.py	feat(gateway): add allowed_{chats,channels,rooms} whitelist to Telegram, Mattermost, Matrix, DingTalk	2026-05-07 06:54:29 -07:00
mattermost.py	feat(gateway): add allowed_{chats,channels,rooms} whitelist to Telegram, Mattermost, Matrix, DingTalk	2026-05-07 06:54:29 -07:00
msgraph_webhook.py	fix(msgraph_webhook): harden auth surface + IP allowlisting + response hygiene	2026-05-08 10:29:58 -07:00
signal.py	fix(signal): skip reactions for unauthorized senders	2026-05-04 01:38:21 -07:00
signal_rate_limit.py	feat(gateway/signal): add support for multiple images sending	2026-04-30 04:28:08 -07:00
slack.py	feat(session): make /handoff actually transfer the session live	2026-05-10 13:06:25 -07:00
sms.py	test(sms): use clear=True in test_missing_phone_number_is_non_retryable	2026-05-04 05:25:09 -07:00
telegram.py	perf(gateway): tune Telegram cadence + adaptive fast-path for short replies	2026-05-10 22:22:25 -07:00
telegram_network.py	fix(gateway): keep DoH-confirmed Telegram IPs that match system DNS (#14520 )	2026-05-05 04:42:59 -07:00
webhook.py	fix(webhook): widen INSECURE_NO_AUTH loopback check + tests + docs	2026-05-07 07:38:43 -07:00
wecom.py	fix(gateway): use monotonic deadlines in QR onboarding flows	2026-05-07 05:09:39 -07:00
wecom_callback.py	fix(gateway): tighten httpx keepalive and close whatsapp typing-response leak (#18451 )	2026-05-02 02:23:37 -07:00
wecom_crypto.py	feat(gateway): add WeCom callback-mode adapter for self-built apps	2026-04-11 15:22:49 -07:00
weixin.py	fix(weixin): wrap long copy-unfriendly lines	2026-05-07 06:08:06 -07:00
whatsapp.py	feat(cross-platform): psutil for PID/process management + Windows footgun checker	2026-05-08 14:27:40 -07:00
yuanbao.py	fix(yuanbao): enforce owner identity check on group slash commands	2026-04-30 23:57:55 -07:00
yuanbao_media.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
yuanbao_proto.py	chore: remove unused imports and dead locals (ruff F401, F841) (#17010 )	2026-04-28 06:46:45 -07:00
yuanbao_sticker.py	yuanbao platform (#16298 )	2026-04-26 18:50:49 -07:00