hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-25 17:18:11 +00:00

History

Siddharth Balyan fcb1944b4f Some checks are pending Deploy Site / deploy-vercel (push) Waiting to run Details Deploy Site / deploy-docs (push) Waiting to run Details Docker Build and Publish / build-amd64 (push) Waiting to run Details Docker Build and Publish / build-arm64 (push) Waiting to run Details Docker Build and Publish / merge (push) Blocked by required conditions Details Lint (ruff + ty) / ruff + ty diff (push) Waiting to run Details Lint (ruff + ty) / ruff enforcement (blocking) (push) Waiting to run Details Lint (ruff + ty) / Windows footguns (blocking) (push) Waiting to run Details Nix Lockfile Fix / auto-fix-main (push) Waiting to run Details Nix Lockfile Fix / fix (push) Waiting to run Details Nix / nix (macos-latest) (push) Waiting to run Details Nix / nix (ubuntu-latest) (push) Waiting to run Details OSV-Scanner / Scan lockfiles (push) Waiting to run Details Tests / test (1) (push) Waiting to run Details Tests / test (2) (push) Waiting to run Details Tests / test (3) (push) Waiting to run Details Tests / test (4) (push) Waiting to run Details Tests / test (5) (push) Waiting to run Details Tests / test (6) (push) Waiting to run Details Tests / save-durations (push) Blocked by required conditions Details Tests / e2e (push) Waiting to run Details uv.lock check / uv lock --check (push) Waiting to run Details feat(credits): usage-aware credits — in-session notices, /usage view, dev readout (#40011 ) * feat(tui): HERMES_DEV_CREDITS live-spend dev readout (L0 tracer for usage-aware credits) L0 of the usage-aware-credits feature: a dev-only, env-gated tracer that exercises the real header -> CreditsState -> TUI pipe end-to-end behind HERMES_DEV_CREDITS, de-risking the L1/L5 build before the notice policy exists. - agent/credits_tracker.py: CreditsState + parse_credits_headers (headers are strings -> paid_access via == "true", never bool(); retain-last-known; only subscription_micros may be negative; _usd kept verbatim). - run_agent.py: _capture_credits / get_credits_state / get_credits_spent_micros, session-start baseline latch, + dev-gated "credits" capture log. - agent/chat_completion_helpers.py: capture on the streaming response. - agent/agent_init.py: init _credits_state + _credits_session_start_micros. - tui_gateway/server.py: _get_usage emits dev_credits_spent_micros only when flagged. - ui-tui appChrome.tsx / types.ts: cents delta status segment + "(dev credits)" banner. Off by default; silent for normal users. Validated live against staging (capture log delta matches the TUI segment). Throwaway consumer (readout/log/ banner); credits_tracker + the capture plumbing are the real feature foundation. test(credits): lock parser under 9-state matrix + harden validation (L2) Add tests/agent/test_credits_tracker.py with 92 tests covering the 9-state matrix (healthy, sub_90pct, grant_exhausted, purchased_only, tool_pool_free, depleted, debt, missing, no_org) plus validation edge cases: version strict==1 with warn-once latch for v>1, bool-string trap (paid_access/tool_pool_gated_off == "true"/"false", never bool()), half-pair subscription limit treated as both-absent while parse succeeds, USD regex ^-?\d+\.\d{2}$, non-int micros → None, negative non-subscription micros → None, as_of_ms junk → None, zero limit ZeroDivision guard. Harden agent/credits_tracker.py to match the spec: - Add tool_pool_micros/tool_pool_gated_off/from_header fields to CreditsState - Add depleted property (== not paid_access, never remaining==0) - Change used_fraction guard to key off subscription_limit_micros (the actual denominator) not denominator_kind (metadata) - Replace fail-soft _safe_int with a sentinel-returning variant; full validation now returns None on any malformed field rather than silently defaulting - Add module-level warn-once latch for version > 1 - Add USD regex validation; add denominator_kind allow-list check - Parse x-nous-tool-pool-* prefix headers (not x-nous-credits-tool-pool-) feat(credits): notice spine — AgentNotice + notice_callback/notice_clear_callback + TUI binding (L1) L1 of usage-aware credits: the driver-agnostic notice delivery spine that L4's policy will fire through and L5's TUI render will consume. - agent/credits_tracker.py: AgentNotice dataclass (text/level/kind/ttl_ms/key/id; kind defaults "sticky", kept TTL-expressive for a future config seam). - run_agent.py: AIAgent gains notice_callback + notice_clear_callback slots and _emit_notice / _emit_notice_clear emitters (swallow all callback errors — a notice must never break the agent loop; no-op when unbound). - agent/agent_init.py: thread both callbacks through init_agent. - tui_gateway/server.py: bind both in _agent_cbs → notification.show / notification.clear WS events (snake_case payload, matching the existing gateway-event convention). - ui-tui/src/gatewayTypes.ts: notification.show / notification.clear arms on GatewayEvent. - tests/run_agent/test_notice_spine.py: 15 tests (emitter fire + fail-open + no-op, signature threading, TUI binding payload shape). Messaging push is out of v1 (binds neither callback). CLI binding + the TUI render/ decode land with L4 (firing) and L5 (render) so turn-end flush is wired correctly. * feat(credits): threshold reconciliation policy + tests (L4.1) * feat(credits): wire threshold policy into capture + latch (L4.2) After a fresh header parse, _capture_credits runs evaluate_credits_notices against the agent's _credits_latch and emits the result — clears first, then shows (so a recovered depletion clears before the "restored" success lands, and depleted wins the latest-wins slot). Gated on a bound notice_callback: messaging (no callbacks) still caches state for /usage but runs no policy. Parse stays fail-open (miss → keep last-known); the eval/emit path warns on failure rather than swallowing, so a depletion-notice bug can't vanish silently. - run_agent.py: _capture_credits split into parse (swallow→miss) + policy (warn); latch lazy-guarded (object.__new__ safety). - agent/agent_init.py: init agent._credits_latch = {"active": set(), "seen_below_90": False}. * feat(tui): render credits notices in the status bar (L5, Strategy B) The TUI now renders the notification.show / notification.clear gateway events the agent emits — a level-colored notice overrides the status/verb slot when not busy. - Notice state machine on turnController (pendingNotice + dedicated noticeTimer + show/clear/applyNotice/flushPendingNotice/clearNoticeState). createGatewayEventHandler decodes the events and delegates. - Render priority busy > notice > status (appChrome StatusRule); notice text rendered verbatim (its glyph comes from the policy), shrinkable so it never clips model│ctx; dev-credits banner + Δ segment preserved. UiState.notice is snake_case (matches wire). - Busy-wins: a notice arriving mid-turn is held and flushed at the THREE turn-end sites (recordMessageComplete / interruptTurn / recordError) — never idle(), which reset() also calls (would leak across sessions); reset() clears instead. - Dedicated noticeTimer (never statusTimer); TTL starts on visibility with an id-guard; latest-wins cancels the prior timer; clear is key-matched (no-op on mismatch); a sticky survives a turn (flush no-ops with no pending); session reset clears (no cross-session leak). - 20 tests (handler/turnController logic incl. R3-C2 timer isolation + render priority). * feat(credits): cold-start seed for new Nous sessions (L3) A genuinely-new Nous session has no inference header yet, so seed credits state from the authoritative GET /api/oauth/account snapshot at session start (in the new-session branch of _restore_or_build_system_prompt — inline, since the on_session_start plugin hook gets no agent reference). The seed runs the shared notice policy, so a session that opens already depleted warns IMMEDIATELY rather than only after the first turn. - Maps the nested account fields (paid_service_access → paid_access; total_usable / subscription / purchased on paid_service_access_info; rollover on subscription), each None-guarded; float dollars → micros via round(d1e6), _usd left "" (render formats from micros — never synthesize a verbatim usd from a float). - Magnitudes-only: no monthlyCredits on the endpoint → subscription_limit_* unset → used_fraction None → no warn90 from the seed (% only once a header lands, per D-E). - Provider-guarded to Nous; fail-open (any error leaves _credits_state None, never blocks startup); paid_access unknown ⇒ True (never falsely depleted). - run_agent.py: extracted the warm-path policy/emit block into a shared _emit_credits_notices() so capture and the seed fire notices identically. * feat(credits): /usage Nous credits magnitudes view + recovery trigger (L6) Add Nous credit dollar magnitudes to /usage (subscription / top-up / total + rollover + renewal + portal CTA), magnitudes-only per v1 (no % until the account endpoint exposes a denominator). Reuses the existing account-usage render machinery via a new pure build_nous_credits_snapshot() that maps a NousPortalAccountInfo to an AccountUsageSnapshot; no nous branch is added to fetch_account_usage (keeps the per-provider boundary intact). CLI /usage also doubles as a depletion-recovery trigger: a force_fresh account fetch, kept in a SEPARATE local so it never clobbers the header-sourced agent._credits_state (which alone carries used_fraction). If paid access recovered while credits.depleted is latched and a notice consumer is bound, it reuses agent._emit_credits_notices() to clear it. Gateway /usage displays magnitudes only — messaging binds no notice consumer, so it performs no recovery emit. Fail-open throughout: any portal hiccup leaves /usage unaffected. * refactor(credits): dedupe HERMES_DEV_CREDITS flag parse via shared helpers The dev-flag truthy check was inlined in three places. Replace with the shared utils.is_truthy_value (run_agent.py, tui_gateway/server.py — also drops a redundant inline `import os`) and a hoisted DEV_CREDITS_MODE export in ui-tui/src/config/env.ts (consumed by appChrome, which also stops recomputing the env check on every render). Behaviour-preserving; identical truthy set. * fix(credits): cut dead /usage recovery trigger + bound portal fetches (L6 review) Adversarial review found the /usage depletion-recovery trigger dead AND broken: the CLI binds no notice_clear_callback, the TUI runs /usage in a separate slash-worker subprocess (its own agent/latch), and the no-clobber rule made it evaluate stale paid_access anyway. Recovery already happens on the next inference (warm path), so the trigger was redundant — remove it and stop the depleted notice over-promising. - cli.py: remove the dead recovery block; bound the /usage portal fetch with a 10s wall-clock timeout (ThreadPoolExecutor) like the per-provider fetch — urllib's per-socket timeout is not a wall-clock guarantee. - agent/credits_tracker.py: reword the depleted CTA to "run /usage for balance" (no false recovery promise; /usage shows fresh magnitudes, sticky clears next turn). - agent/conversation_loop.py: same wall-clock timeout on the cold-start seed fetch so a stalled portal can't hang session startup; tidy its time import. * chore(credits): dev notice-state fixtures (HERMES_DEV_CREDITS_FIXTURE) Throwaway dev scaffolding to exercise the notice pipeline without real spend or Redis seeding. Set HERMES_DEV_CREDITS_FIXTURE to a state name (healthy / sub_90pct / grant_exhausted / depleted / clear) or a file path whose contents name a state (re-read each turn → flip states live for recovery testing). _capture_credits injects the chosen CreditsState instead of parsing real headers and runs the shared notice policy. Deletable with the rest of the HERMES_DEV_CREDITS scaffolding. * feat(credits): /usage monthly-grant % gauge The portal /api/oauth/account subscription block now carries monthly_credits (the per-period grant allowance, the % denominator). The consumer parsed monthly_charge but dropped monthly_credits, so /usage stayed magnitudes-only. Capture monthly_credits into NousPortalSubscriptionInfo + _subscription_from_payload. build_nous_credits_snapshot emits a Subscription usage window (real % used, routed through the existing render machinery) when monthly_credits is a finite positive denominator and credits_remaining is finite and <= cap; otherwise it degrades to magnitudes-only (older portals, rollover-over-cap, or non-finite payloads). Guards (adversarial-review-driven): reject non-finite operands (json.loads parses bare NaN/Infinity by default → would render $nan + a false 100% used), reject bools, guard div-by-zero (cap>0), and suppress the gauge when remaining > cap (rollover spanning the period makes the cap a nonsensical denominator → the $X-of-$Y detail would read as a contradiction). Debt (remaining<0) clamps to 100%. Money rule preserved: the ratio + magnitudes are computed from numeric float account fields via display formatting, never by parsing a server _usd string (there are none on these dataclasses). 13 gauge tests added (tests/agent/test_nous_credits_gauge.py). fix(credits): show /usage Nous block whenever a Nous account is present /usage runs in a slash-worker subprocess whose resolved inference provider is often not "nous" even when the user has a Nous account, so gating the Nous credits block on (provider == "nous") hid it entirely — the account data was fully available but never rendered. Gate instead on "a Nous account is logged in": a cheap local auth-state lookup (get_provider_auth_state('nous') has an access_token) decides whether to attempt the portal fetch, regardless of which provider inference runs on. In the gateway the block is also lifted out of the 'if provider:' scope so a Nous-credentialled user with another (or no) resident inference provider still sees their balance. Fail-open and the per-fetch wall-clock timeout are preserved. * fix(credits): show /usage Nous block when there's no live agent (TUI slash-worker) In the TUI, /usage runs in a slash-worker subprocess that resumes the session WITHOUT building an agent (self.agent is None), so _show_usage early-returned "(._.) No active agent" before ever reaching the Nous credits block — which is agent-independent (a portal fetch gated on Nous auth-state). Extract the block into _print_nous_credits_block() and run it at the no-agent / no-calls early-returns too (returns True if it printed, so the fallback message only shows when there's genuinely nothing). Verified live against staging: the block + monthly-grant gauge now render in the slash-worker /usage path (previously hidden). The plain CLI REPL + messaging paths are unchanged (they have a live agent). * feat(credits): escalating 50/75/90 usage bands (single status line) Replace the lone 90%-used warning with three escalating bands (50 info, 75 warn, 90 warn) shown as ONE status-bar line: it displays the highest band the subscription grant has crossed, replaces the line as usage climbs, steps back down on recovery, and clears below 50%. No stacking, no per-turn churn. Bands live in a tunable CREDITS_USAGE_BANDS list; the policy derives everything from it. Single notice key (credits.usage) with a usage_band latch field so the notice only re-emits when the band actually changes. The crossing gate (seen_below_90) is preserved so a fresh live session that opens mid-range stays quiet until it has been observed below the lowest band (cold-start primes it when it wants an open-high warning). Denominator math unchanged: % = subscription grant burn (cap - grant_remaining)/cap, clamped [0,1]; top-up never moves the %. Migrated test_credits_policy.py to the new key + added TestUsageBands (climb, step-down, recovery-clear, idempotent, inclusive boundaries). * feat(credits): hydrate notices at session OPEN via shared seed (TUI + first-turn) Notices previously only fired inside a conversation turn (first message), so a session that opened already depleted / past a usage band showed nothing at 'ready'. Extract the cold-start seed into a shared seed_credits_at_session_start() and call it (a) in the TUI/desktop agent build right after the notice callback is wired (fires at 'ready', before any message) and (b) as the first-turn fallback in conversation_loop. Idempotent (skips once _credits_state exists) and fail-open. The seed now maps monthly_credits -> subscription_limit_micros + denominator_kind='subscription_cap', so used_fraction is computable at seed time and usage-band warnings (not just depletion) hydrate on open. Primes the crossing latch so a session opening already in a band warns immediately. Degrades to depletion-only when monthly_credits is absent (older portals). Adds test_credits_cold_start.py covering open-at-band, depletion, debt, no-cap degradation, and the shared seed (fires/idempotent/skips-non-nous). * feat(credits): /usage monthly-grant % gauge + fixture support + TUI surfacing agent/account_usage.py: build_nous_credits_snapshot emits a subscription %% gauge when the portal supplies a positive, finite monthly_credits denominator with remaining <= cap (guards reject NaN/Infinity and rollover-over-cap, which would render $nan or a contradictory $X-of-$Y); degrades to magnitudes-only otherwise. Adds shared nous_credits_lines() (auth-gated, wall-clock-bounded portal fetch) so the CLI and TUI /usage render the same block, and _snapshot_from_credits_state() so HERMES_DEV_CREDITS_FIXTURE drives /usage offline too. TUI: session.usage RPC carries credits_lines (agent-independent) and the /usage panel renders them regardless of API-call count or resume state — previously the TUI's separate /usage implementation only showed token counts. Money rule preserved: %% and magnitudes come from numeric float account fields via display formatting, never by parsing a server _usd string. feat(credits): CLI REPL inline notices (parity with TUI) The plain CLI agent bound no notice callbacks, so credit notices were TUI-only. Bind notice_callback/notice_clear_callback on the CLI AIAgent; _on_notice renders a single level-colored line above the prompt (error red / warn yellow / success green / info dim) via _cprint, and seed credits at session open so a depletion or usage-band warning shows before the first message — the same hydration the TUI got. _on_notice_clear is a no-op (the REPL prints lines, no persistent slot). * test(credits): add sub_50pct + sub_75pct dev fixtures for the new usage bands The fixture set jumped 10%% -> 90%%; add sub_50pct (uf 0.5 -> band 50 info) and sub_75pct (uf 0.75 -> band 75 warn) so the new escalating bands are exercisable via HERMES_DEV_CREDITS_FIXTURE across all three surfaces (notice, session-open seed, /usage gauge). * fix(credits): usage-band notice clears on next prompt (not sticky-forever) A 50/75/90 usage heads-up was sticky and camped the status bar indefinitely. Clear the visible credits.usage notice when a new turn starts (startMessage), so it shows until your next prompt then yields. The server latch is unchanged, so it won't re-nag at the same band — it only re-shows when the band actually changes (climb) or clears when usage drops below the lowest band. Depletion stays sticky. * refactor(credits): consolidate the /usage credits block behind nous_credits_lines() The CLI (_print_nous_credits_block) and the messaging gateway (_handle_usage_command) each re-implemented the auth-gate + portal fetch + render, and both bypassed the dev-fixture short-circuit that only the TUI honored — so /usage ignored HERMES_DEV_CREDITS_FIXTURE on the CLI and in chat. Route both through the shared agent.account_usage.nous_credits_lines() helper: one fetch/render path, one auth gate, and the fixture works on every surface (~60 fewer duplicated lines). The gateway usage test recorded only the last asyncio.to_thread call; /usage now dispatches both the account fetch and the credits fetch, so it records every call and matches the account fetch by its provider arg. * fix(credits): keep the /usage gauge type-safe and log its fail-open path _is_finite_num is now a TypeGuard[float], so the type checker narrows the gauge operands (monthly_credits / credits_remaining) and the magnitudes passed to _fmt_usd through it — no more None-operand warnings on the arithmetic. Add a debug breadcrumb on the nous_credits_lines portal-fetch fail-open so a dead /usage block is diagnosable in agent.log without a dev flag. * fix(credits): harden the header tracker — prod-leak gate, hot-path probe, fire-and-forget seed - Prod-leak guard: dev fixtures (HERMES_DEV_CREDITS_FIXTURE) now also require HERMES_DEV_CREDITS, so a stray fixture var can't surface fabricated balances on a real account. Matches the documented run workflow (both vars set together). - Hot-path probe: parse_credits_headers checks for the version sentinel header before allocating a lowercased copy of the response headers — skips that work on every non-Nous API call. Behaviour-identical and still case-insensitive. - Fire-and-forget seed: the real portal fetch in seed_credits_at_session_start now runs in a daemon thread, so a slow/unreachable portal never delays session "ready" (previously blocked up to 10s). The dev-fixture path stays synchronous; the thread re-checks idempotency before hydrating (a live header may land first). - Diagnostics: debug breadcrumbs on the parse and seed fail-open paths so a crashed parser / dead seed is distinguishable from a legitimate no-headers miss. Cold-start tests set HERMES_DEV_CREDITS alongside the fixture to match the gate. * test(tui): fix env-timing in the StatusRule dev-credits assertion DEV_CREDITS_MODE is read once at module load (config/env), so mutating process.env.HERMES_DEV_CREDITS inside the test couldn't flip it — the dev-banner assertion only passed if the env was exported before vitest started, and failed in a normal run. Move that assertion to a sibling file that mocks config/env with DEV_CREDITS_MODE: true (scoped, no module-reset / React-identity hazard). * test(credits): cover the dev-fixture /usage render and usage-band clear-on-prompt - _snapshot_from_credits_state (the offline /usage renderer) had no direct test: lock the gauge math, the verbatim _usd magnitudes, the depletion line and the fixture marker, plus the no-cap (no gauge) and None-state cases. - turnController.startMessage had no test for clearing the credits.usage notice on the next prompt while leaving credits.depleted sticky. feat(credits): deliver credit notices over messaging gateways Bind notice_callback/notice_clear_callback on the per-turn gateway agent so usage-band / depletion / restored notices reach Telegram/Discord/Slack/ etc. Previously the messaging gateway bound neither callback, so the agent's _emit_credits_notices early-returned and a chat user crossing a band got nothing unless they ran /usage manually. - render_notice_line(): AgentNotice -> single plaintext line (level glyph + text), plaintext-only so it renders uniformly without per-platform escaping. Fail-soft on malformed/empty notices. - Standalone push for every notice (messaging has no persistent status bar): route through the shared _deliver_platform_notice rail (honors private/ public delivery + thread metadata), scheduled onto the gateway loop via safe_schedule_threadsafe from the agent's sync worker thread — same pattern as _status_callback_sync. - The fired-once latch lives on the cached (reused-in-place) agent and persists across turns, so a band crosses once -> one push, no per-turn re-nag. Re-fires only after idle-eviction rebuilds the agent (a reminder). - Recovery ('Credit access restored') rides the show path (emitted as a success notice, not a clear). notice_clear_callback is a no-op: a sent platform message can't be cleanly retracted. Tests: render glyph/levels/fail-soft + public/private delivery seam through _deliver_platform_notice + no-adapter no-op. * fix(credits): don't double the glyph on messaging notices render_notice_line prepended a per-level glyph, but the notice policy already bakes the glyph into the text (and the TUI + CLI render it verbatim) — so every credit notice over messaging came out doubled ("⚠ ⚠ Credits 90% used", "⛔ ✕ Credit access paused"). Emit the text verbatim instead; drop the now-dead level→glyph map. The render tests fed glyph-less text (and the success case only checked startswith), so the doubling slipped through. Rework them around the verbatim contract and add an end-to-end regression that runs real evaluate_credits_notices output through render_notice_line and asserts the line is returned unchanged.		2026-06-06 13:18:18 +05:30
..
platforms	feat(state.db): persist platform_message_id; restore yuanbao exact-id recall	2026-05-20 13:00:57 -07:00
__init__.py
_plugin_adapter_loader.py	test(gateway): isolate plugin adapter imports and guard the anti-pattern	2026-04-30 01:19:34 -07:00
conftest.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
feishu_helpers.py	feat(feishu): operator-configurable bot admission and mention policy	2026-04-30 20:30:31 -07:00
restart_test_helpers.py	fix(gateway): clean service restart notifications	2026-05-31 21:05:53 -07:00
test_7100_transient_failure_transcript.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_active_session_text_merge.py	gateway: debounce queued text follow-ups	2026-05-24 01:31:45 -07:00
test_agent_cache.py	fix(honcho): harden self-hosted setup paths	2026-05-29 22:29:48 -07:00
test_allowed_channels_widening.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_allowlist_startup_check.py	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
test_api_server.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_api_server_bind_guard.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_api_server_jobs.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_api_server_multimodal.py	feat(api-server): inline image inputs on /v1/chat/completions and /v1/responses (#12969 )	2026-04-20 04:16:13 -07:00
test_api_server_normalize.py	fix(api_server): normalize array-based content parts in chat completions	2026-04-12 18:03:16 -07:00
test_api_server_runs.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_api_server_toolset.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_approve_deny_commands.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_auth_fallback.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_auto_continue.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_background_command.py	fix(gateway): route /background result media by type	2026-06-02 16:55:25 -07:00
test_background_process_notifications.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_base_topic_sessions.py	gateway: debounce queued text follow-ups	2026-05-24 01:31:45 -07:00
test_bluebubbles.py	feat(bluebubbles): support group mention gating	2026-06-01 18:52:05 -07:00
test_bundles_command.py	feat(skills): add skill bundles — alias /<name> loads multiple skills (#28373 )	2026-05-18 21:38:05 -07:00
test_busy_session_ack.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_busy_session_auth_bypass.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_cancel_background_drain.py	fix(gateway): cancel_background_tasks must drain late-arrivals (#12471 )	2026-04-19 01:48:42 -07:00
test_channel_directory.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_clean_shutdown_marker.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_command_bypass_active_session.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_complete_path_at_filter.py	fix(tui): restore voice/panic handlers + scope fuzzy paths to cwd	2026-04-23 19:38:33 -05:00
test_compress_command.py	fix(compress): abort instead of dropping messages when summary LLM fails (#28102 )	2026-05-18 10:19:40 -07:00
test_compress_focus.py	fix(compress): don't reach into ContextCompressor privates from /compress (#15039 )	2026-04-24 02:55:43 -07:00
test_compress_plugin_engine.py	fix(compress): don't reach into ContextCompressor privates from /compress (#15039 )	2026-04-24 02:55:43 -07:00
test_compression_session_id_persistence.py	test+polish(compression): pin anti-thrash gate and gateway session_id persistence	2026-05-25 01:44:46 -07:00
test_config.py	fix(gateway): bridge shared-key loop to nested platform config blocks	2026-06-04 05:31:47 -07:00
test_config_cwd_bridge.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_config_driven_access_policy.py	fix(gateway): don't treat dm_policy: pairing as open access on own-policy adapters	2026-06-04 06:31:28 -07:00
test_config_env_bridge_authority.py	gateway: debounce queued text follow-ups	2026-05-24 01:31:45 -07:00
test_debug_command.py	fix(debug): sweep expired pending pastes on slash debug paths	2026-04-22 11:59:39 -07:00
test_delivery.py	fix: refresh stale Telegram DM topic threads	2026-05-25 14:54:02 -07:00
test_delivery_silence_filter.py	fix(gateway): drop outbound silence-narration messages pre-send	2026-05-29 19:06:05 -07:00
test_destructive_slash_confirm.py	feat: confirm prompt for destructive slash commands (#4069 ) (#22687 )	2026-05-09 11:04:46 -07:00
test_dingtalk.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_allowed_channels.py	fix(discord): honor wildcard '*' in ignored_channels and free_response_channels	2026-04-24 03:04:42 -07:00
test_discord_allowed_mentions.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_attachment_download.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_bot_auth_bypass.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_bot_filter.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_channel_controls.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_channel_prompts.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_channel_skills.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_clarify_buttons.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_component_auth.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_connect.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_document_handling.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_free_response.py	fix(discord): skip backfill for auto-created threads and update test fakes	2026-05-28 04:52:02 -07:00
test_discord_imports.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_lazy_install_views.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_media_metadata.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_model_picker.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_opus.py	fix(discord): recover Windows voice opus decoding	2026-05-27 03:35:33 -07:00
test_discord_race_polish.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_reactions.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_reply_mode.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_roles_dm_scope.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_send.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_slash_auth.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_discord_slash_commands.py	fix(discord): skip backfill for auto-created threads and update test fakes	2026-05-28 04:52:02 -07:00
test_discord_system_messages.py	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
test_discord_thread_persistence.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_discord_voice_mixer.py	feat(discord): voice-channel mixer — ambient idle bed + verbal acks that overlap TTS (#39659 )	2026-06-05 03:10:40 -07:00
test_display_config.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_dm_topics.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_document_cache.py	refactor(telegram): generalize observed-media caching into a reusable primitive	2026-06-01 20:18:41 -07:00
test_duplicate_reply_suppression.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_email.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_empty_model_recovery.py	fix(gateway): recover model on post-interrupt turn; gate fallback status (#35381 )	2026-05-30 07:28:06 -07:00
test_ephemeral_reply.py	fix(gateway): stop system tips from auto-uploading local files	2026-05-30 18:58:46 -07:00
test_extract_local_files.py	test: update extract_local_files Windows-path test for new matching behavior	2026-05-30 07:38:03 -07:00
test_fallback_eviction.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_fast_command.py	test(fast-command): stub _load_gateway_runtime_config too	2026-05-23 02:40:33 -07:00
test_feishu.py	feat(gateway): handle Feishu meeting invitations	2026-06-04 06:15:23 -07:00
test_feishu_approval_buttons.py	fix(tests): align CI tests with recent security hardening (#31470 )	2026-05-24 06:54:16 -07:00
test_feishu_bot_admission.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_feishu_bot_auth_bypass.py	feat(feishu): operator-configurable bot admission and mention policy	2026-04-30 20:30:31 -07:00
test_feishu_comment.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_feishu_comment_rules.py	feat: add Feishu document comment intelligent reply with 3-tier access control	2026-04-17 19:04:11 -07:00
test_feishu_meeting_invite.py	refactor(feishu): slim meeting-invite parser; add AUTHOR_MAP entry	2026-06-04 06:15:23 -07:00
test_feishu_onboard.py	fix(gateway): use monotonic deadlines in QR onboarding flows	2026-05-07 05:09:39 -07:00
test_fresh_reset_skill_injection.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_gateway_command_help.py	fix: ignore Telegram start pings	2026-05-27 02:41:24 -07:00
test_gateway_inactivity_timeout.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_gateway_shutdown.py	fix(gateway): clean service restart notifications	2026-05-31 21:05:53 -07:00
test_goal_max_turns_config.py	fix(gateway): honor configured goal turn budget	2026-05-07 06:31:08 -07:00
test_goal_status_notice.py	fix(gateway): defer goal status notices until after response delivery	2026-05-07 17:33:09 -07:00
test_goal_verdict_send.py	revert: roll back /goal checklist + /subgoal feature stack (#23813 )	2026-05-11 07:06:27 -07:00
test_google_chat.py	revert: keep Google Chat OAuth secret + active_provider profile-scoped (#39398 )	2026-06-04 16:54:40 -07:00
test_home_target_env_var.py	fix(gateway): preserve home-channel thread targets across restart notifications	2026-05-03 08:47:49 -07:00
test_homeassistant.py	test: remove 169 change-detector tests across 21 files (#11472 )	2026-04-17 01:05:09 -07:00
test_hooks.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_insights_unicode_flags.py	fix(model-switch): normalize Unicode dashes from Telegram/iOS input	2026-04-15 17:54:16 -07:00
test_internal_event_bypass_pairing.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_interrupt_key_match.py	gateway: debounce queued text follow-ups	2026-05-24 01:31:45 -07:00
test_irc_adapter.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_kanban_notifier.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_kanban_notifier_watcher_dispatch_gate.py	feat(kanban): gate notifier watcher on dispatch_in_gateway	2026-06-01 20:30:24 -07:00
test_keep_typing_timeout.py	fix(gateway): keep typing indicator alive across slow send_typing calls (#16763 )	2026-04-27 19:09:32 -07:00
test_line_plugin.py	fix(line): map inbound message types to the correct MessageType	2026-06-04 21:55:20 -07:00
test_load_transcript_db_only.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_loop_exception_handler.py	fix(gateway): swallow transient Telegram TimedOut at loop level	2026-05-24 15:03:27 -07:00
test_matrix.py	fix(matrix): make bang-command resolution robust + fix dead skill-command branch	2026-06-03 17:19:27 +05:30
test_matrix_approval_reaction_fail_closed.py	fix(matrix): fail-closed approval reaction auth when MATRIX_ALLOWED_USERS is empty	2026-05-29 03:58:45 -07:00
test_matrix_exec_approval.py	test(matrix): set user_id in approval-reaction test to bypass defensive self-drop	2026-04-27 21:22:44 -07:00
test_matrix_mention.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_matrix_voice.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_mattermost.py	refactor(gateway): migrate Mattermost adapter to bundled plugin	2026-05-24 18:05:33 -07:00
test_max_tokens_propagation.py	test(gateway): regression tests for max_tokens propagation chain (#20741 )	2026-06-05 09:10:26 -07:00
test_mcp_reload_refreshes_cached_agents.py	fix(gateway): refresh cached agent tools on /reload-mcp	2026-05-26 14:28:51 -07:00
test_media_download_retry.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_media_extraction.py	fix(gateway): restrict auto-appended media to producer tools	2026-06-01 00:00:26 -07:00
test_memory_monitor.py	Port from cline/cline#10343: periodic gateway memory logging (#27102 )	2026-05-16 12:55:23 -07:00
test_message_deduplicator.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_mirror.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_model_command_custom_providers.py	fix: extract custom_provider_slug() helper, harden gateway test	2026-04-10 03:07:00 -07:00
test_model_command_flat_string_config.py	fix(gateway): coerce scalar `model:` to dict before /model --global persist (#32272 )	2026-05-25 15:22:23 -07:00
test_model_switch_persistence.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_msgraph_webhook.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_native_image_buffer_isolation.py	fix(gateway): isolate pending native image paths by session	2026-04-30 20:26:35 -07:00
test_notice_delivery.py	feat(gateway): private notice delivery and Slack format_message fixes	2026-05-01 13:33:06 -07:00
test_notice_rendering.py	feat(credits): usage-aware credits — in-session notices, /usage view, dev readout (#40011 )	2026-06-06 13:18:18 +05:30
test_ntfy_plugin.py	test(ntfy): cover echo-tag filter; tag standalone send path	2026-05-29 13:17:46 -07:00
test_pairing.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_pending_drain_no_recursion.py	test(gateway): pin cleanup invariants for #17758 in-band drain hand-off	2026-04-30 05:00:25 -07:00
test_pending_drain_race.py	fix(gateway): close pending-drain and late-arrival races in base adapter (#12371 )	2026-04-18 19:32:26 -07:00
test_pending_event_none.py	fix(gateway): stop typing loops on session interrupt	2026-04-19 03:03:57 -07:00
test_per_platform_streaming_defaults.py	feat(streaming): per-platform streaming defaults (Telegram on, Discord off) + dashboard toggles (#37303 )	2026-06-02 05:52:54 -07:00
test_pii_redaction.py	fix: remove 115 verified dead code symbols across 46 production files	2026-04-10 03:44:43 -07:00
test_planned_stop_watcher.py	fix(gateway): only fire planned-stop watcher for self-targeting markers + fix Windows consume (#34749 )	2026-05-29 17:36:58 +00:00
test_platform_base.py	fix(gateway): deliver $HOME deliverables on root-run gateways	2026-06-04 07:50:22 -07:00
test_platform_connected_checkers.py	Harden msgraph webhook auth requirements (#30169 )	2026-05-24 04:25:20 -07:00
test_platform_http_client_limits.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_platform_reconnect.py	fix(gateway): retry startup auto-resume when a failed platform reconnects	2026-06-04 05:56:45 -07:00
test_platform_reconnect_fd_leak.py	polish(gateway): address Copilot review comments on fd-leak fix	2026-06-02 17:27:44 -07:00
test_platform_registry.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_plugin_platform_interface.py	feat(irc): add interactive setup	2026-04-29 21:56:51 -07:00
test_post_delivery_callback_chaining.py	feat(gateway): opt-in cleanup of temporary progress bubbles (#21186 )	2026-05-07 05:04:37 -07:00
test_pre_gateway_dispatch.py	feat(plugins): add pre_gateway_dispatch hook	2026-04-24 03:02:03 -07:00
test_proxy_mode.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_qqbot.py	Add Hermes desktop app (#20059 )	2026-05-31 17:46:56 -05:00
test_queue_consumption.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_reasoning_command.py	fix: coerce show_reasoning and guard_agent_created config bools	2026-04-30 20:40:46 -07:00
test_reload_skills_command.py	refactor(reload-skills): queue note for next turn, drop cache invalidation + agent tool	2026-04-29 21:07:47 -07:00
test_reload_skills_discord_resync.py	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
test_replay_entry_fields.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_reply_to_injection.py	fix(gateway): always inject reply-to pointer, not just when quoted text is absent (#13676 )	2026-04-21 13:33:02 -07:00
test_restart_drain.py	gateway: debounce queued text follow-ups	2026-05-24 01:31:45 -07:00
test_restart_notification.py	fix(gateway): clean service restart notifications	2026-05-31 21:05:53 -07:00
test_restart_redelivery_dedup.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_restart_resume_pending.py	fix(gateway): retry startup auto-resume when a failed platform reconnects	2026-06-04 05:56:45 -07:00
test_resume_command.py	test(cli,gateway): cover bracket-stripping and gateway session-ID lookup	2026-05-25 01:33:32 -07:00
test_retry_replacement.py	refactor(yuanbao): drop dead branch A1 message_id loop + pin missing fixture	2026-05-20 13:00:57 -07:00
test_retry_response.py
test_run_cleanup_progress.py	fix(gateway): keep Telegram heartbeat + interim commentary on; edit heartbeat in place (#33187 )	2026-05-27 05:21:53 -07:00
test_run_progress_interrupt.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_run_progress_topics.py	fix(gateway): keep Telegram heartbeat + interim commentary on; edit heartbeat in place (#33187 )	2026-05-27 05:21:53 -07:00
test_run_tool_media_re.py	test: use raw docstring in test_run_tool_media_re to silence escape warning	2026-05-30 07:38:03 -07:00
test_runner_fatal_adapter.py	fix(gateway): keep running when platforms fail; add per-platform circuit breaker + /platform (#26600 )	2026-05-15 14:32:14 -07:00
test_runner_startup_failures.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_running_agent_session_toggles.py	refactor: drop persist_session plumbing + fix broken btw mid-turn bypass (#16075 )	2026-04-26 07:15:23 -07:00
test_runtime_config_env_expansion.py	fix(config): align prefill messages key handling	2026-06-03 23:51:44 -06:00
test_runtime_env_reload_config_authority.py	fix(gateway): preserve max turns after env reload	2026-05-07 05:49:16 -07:00
test_runtime_footer.py	feat(gateway): opt-in runtime-metadata footer on final replies (#17026 )	2026-04-28 06:50:04 -07:00
test_safe_adapter_disconnect.py	fix(gateway): cap adapter disconnect during stop	2026-05-08 18:50:25 -07:00
test_send_image_file.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_send_multiple_images.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_send_retry.py
test_send_voice_reply_notify.py	fix(gateway): mark final voice reply as notify-worthy so Telegram delivers it audibly	2026-05-18 22:25:15 -07:00
test_session.py	test(gateway): pin DEFAULT_DB_PATH in fixtures to prevent real state.db writes	2026-05-20 13:00:57 -07:00
test_session_api.py	fix(api_server): emit per-turn transcript on run.completed (#34703 ) (#34804 )	2026-05-29 12:27:49 -07:00
test_session_boundary_hooks.py	feat(observability): observer-grade telemetry hooks + NeMo-Relay plugin	2026-06-03 06:36:46 -07:00
test_session_boundary_security_state.py	fix(gateway): clear slash-confirm state during session boundary cleanup	2026-05-09 14:18:20 +03:00
test_session_dm_thread_seeding.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_session_env.py	fix(tests): resolve remaining CI failures — commit_memory_session, already_sent, timezone leak, session env (#10785 )	2026-04-16 02:26:14 -07:00
test_session_hygiene.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_session_info.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_session_list_allowed_sources.py	fix(sessions): /save lands under $HERMES_HOME, widen browse+TUI picker, force-refresh ollama-cloud on setup (#16296 )	2026-04-26 18:49:48 -07:00
test_session_model_override_routing.py	fix(gateway): honor key_env in auth-failure fallback resolution	2026-05-23 02:25:53 -07:00
test_session_model_reset.py	fix(gateway): clear stale pending model note on session reset	2026-04-26 19:01:50 -07:00
test_session_race_guard.py	fix: ignore Telegram start pings	2026-05-27 02:41:24 -07:00
test_session_reset_notify.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_session_split_brain_11016.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_session_state_cleanup.py	fix(gateway): clear zombie agent slot when session_reset races in-flight run	2026-06-04 07:50:45 -07:00
test_session_store_prune.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_setup_feishu.py	fix: salvage follow-ups for Feishu QR onboarding (#7706 )	2026-04-12 13:05:56 -07:00
test_shared_group_sender_prefix.py	fix(gateway): preserve sender attribution in shared group sessions	2026-04-21 00:54:46 -07:00
test_shutdown_cache_cleanup.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_shutdown_forensics.py	feat(gateway): shutdown forensics — non-blocking diag, per-phase timing, stale-unit warning (#23285 )	2026-05-10 09:01:51 -07:00
test_shutdown_memory_provider_messages.py	fix(gateway): pass session messages to shutdown_memory_provider (#15165 )	2026-04-27 06:41:16 -07:00
test_signal.py	Add Hermes desktop app (#20059 )	2026-05-31 17:46:56 -05:00
test_signal_format.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_signal_rate_limit.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_simplex_plugin.py	fix(simplex): avoid reconnecting healthy idle websocket	2026-06-01 16:36:43 -07:00
test_slack.py	Add Hermes desktop app (#20059 )	2026-05-31 17:46:56 -05:00
test_slack_approval_buttons.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_slack_channel_skills.py	feat(gateway/slack): support channel_skill_bindings	2026-04-26 18:25:41 -07:00
test_slack_mention.py	feat(slack): add allowed_channels whitelist config	2026-05-07 06:54:29 -07:00
test_slash_access.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_slash_access_dispatch.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_sms.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_sse_agent_cancel.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_ssl_certs.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_status.py	fix(gateway): tolerate non-UTF-8 status/pid files in gateway status reads	2026-06-04 22:05:23 -07:00
test_status_command.py	fix(gateway): name what the /status token number actually is	2026-05-29 19:14:37 -07:00
test_steer_command.py	feat(steer): /steer <prompt> injects a mid-run note after the next tool call (#12116 )	2026-04-18 04:17:18 -07:00
test_step_callback_compat.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_sticker_cache.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_stop_thread_sibling.py	fix(gateway): /stop can interrupt a sibling participant's run in a per-user thread (#35959 )	2026-05-31 09:29:03 -07:00
test_stream_consumer.py	fix(stream-consumer): only set _final_content_delivered when final response confirmed delivered	2026-05-28 03:15:19 -07:00
test_stream_consumer_draft.py	fix(telegram): default streaming transport to edit	2026-05-18 21:51:39 -07:00
test_stream_consumer_fresh_final.py	fix(gateway): scope final-delivery flags to turn-final segment (#29346 )	2026-06-01 17:31:32 -07:00
test_stream_consumer_thread_routing.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_stream_events.py	feat(gateway): structured stream-event protocol + Telegram draft formatting parity (#37250 )	2026-06-02 00:33:50 -07:00
test_stt_config.py	feat(telegram): skip-STT audio path + 2GB cap via local Bot API server	2026-05-18 22:59:40 -07:00
test_stuck_loop.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_subagent_protection_30170.py	test(gateway): regression tests for #30170 subagent interrupt protection	2026-05-25 16:23:24 +00:00
test_teams.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_teams_pipeline_runtime_wiring.py	fix(teams-pipeline): drop-scheduler fallback + test wiring for enablement gate	2026-05-08 11:18:14 -07:00
test_telegram_approval_buttons.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_audio_vs_voice.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_callback_auth_fail_closed.py	fix(telegram): fail-closed auth fallback when TELEGRAM_ALLOWED_USERS is empty	2026-05-18 22:08:08 -07:00
test_telegram_caption_merge.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_channel_posts.py	test+release: stub auth in channel_posts fixture; map @brndnsvr	2026-05-18 22:51:35 -07:00
test_telegram_clarify_buttons.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_conflict.py	fix(test+release): update conflict retry count for MAX=5; map @CryptoByz	2026-05-18 22:01:31 -07:00
test_telegram_documents.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_format.py	fix(telegram): tighten table row-group spacing and drop redundant first bullet	2026-05-25 23:16:00 -07:00
test_telegram_forum_commands.py	test+release: fix test fixture for forum_commands; map @chromalinx	2026-05-18 22:34:48 -07:00
test_telegram_group_gating.py	refactor(telegram): generalize observed-media caching into a reusable primitive	2026-06-01 20:18:41 -07:00
test_telegram_max_doc_bytes.py	feat(telegram): skip-STT audio path + 2GB cap via local Bot API server	2026-05-18 22:59:40 -07:00
test_telegram_mention_boundaries.py	refactor(telegram): use entity-only mention detection	2026-04-20 00:10:22 -07:00
test_telegram_model_picker.py	feat(model-picker): group multi-endpoint providers under one row (#35227 )	2026-05-30 01:41:33 -07:00
test_telegram_network.py	test+release: align stale sticky-IP test for #24511 ; map @falconexe	2026-05-18 22:14:45 -07:00
test_telegram_network_reconnect.py	fix(telegram): probe polling liveness after reconnect to detect wedged Updater	2026-05-02 01:55:04 -07:00
test_telegram_noise_filter.py	fix(gateway): hide telegram compaction status noise	2026-05-27 02:41:24 -07:00
test_telegram_photo_interrupts.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_progress_edit_transient.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_reactions.py	fix(telegram): clear in-progress reaction on cancelled processing (#24628 )	2026-05-12 17:02:29 -07:00
test_telegram_reply_mode.py	fix(telegram): respect reply_to_mode for DM topic reply fallback	2026-05-18 21:52:39 -07:00
test_telegram_reply_quote.py	fix(telegram): honor message.quote for partial-quote reply context	2026-05-09 11:10:36 -07:00
test_telegram_send_draft_format.py	feat(gateway): structured stream-event protocol + Telegram draft formatting parity (#37250 )	2026-06-02 00:33:50 -07:00
test_telegram_send_path_health.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_slash_confirm.py	fix(telegram): escape send_slash_confirm preview with format_message	2026-05-18 22:28:21 -07:00
test_telegram_status_update.py	feat(telegram): edit status messages in place instead of appending (#30864 )	2026-05-23 02:42:10 -07:00
test_telegram_text_batch_perf.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_text_batching.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_telegram_thread_fallback.py	fix(gateway): clean service restart notifications	2026-05-31 21:05:53 -07:00
test_telegram_topic_mode.py	fix(gateway): keep Telegram topic bindings aligned with compression children (#34409 )	2026-05-28 23:25:52 -07:00
test_telegram_webhook_secret.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_text_batching.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_title_command.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_tool_response_drop_recovery.py	fix(gateway): recover extract-stripped tool responses on all platforms (#29346 )	2026-06-01 17:31:32 -07:00
test_transcript_offset.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_tts_media_routing.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_unauthorized_dm_behavior.py	fix(gateway): allow chat-scoped telegram auth without sender user_id	2026-05-18 22:43:14 -07:00
test_unavailable_skill_hint.py	fix(gateway): match disabled/optional skills by frontmatter slug, not dir name (#18753 )	2026-05-02 02:00:09 -07:00
test_undo_rewind_session.py	feat(gateway): bring /undo [N] to messaging platforms (parity with CLI/TUI) (#36699 )	2026-06-01 02:04:14 -07:00
test_unknown_command.py	feat(gateway): expose plugin slash commands natively on all platforms + decision-capable command hook	2026-04-22 16:23:21 -07:00
test_update_command.py	fix(gateway): keep pending /update completion notifications until the target platform reconnects	2026-06-04 06:56:28 -07:00
test_update_streaming.py	fix(gateway): keep pending /update completion notifications until the target platform reconnects	2026-06-04 06:56:28 -07:00
test_usage_command.py	feat(credits): usage-aware credits — in-session notices, /usage view, dev readout (#40011 )	2026-06-06 13:18:18 +05:30
test_verbose_command.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_version_command.py	Include git SHA in /version output via banner label helper.	2026-06-05 18:05:05 -07:00
test_vision_memory_leak.py	fix(memory): narrow scrub surface to known wrapper boundaries	2026-04-27 12:37:33 -07:00
test_voice_command.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_voice_mode_platform_isolation.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_weak_credential_guard.py	fix(gateway): reject known-weak placeholder credentials at startup	2026-04-12 18:05:41 -07:00
test_webhook_adapter.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_webhook_deliver_only.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_webhook_dynamic_routes.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_webhook_integration.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_webhook_signature_rate_limit.py	fix(webhook): validate HMAC signature before rate limiting (#12544 )	2026-04-19 22:45:08 -07:00
test_wecom.py	fix(gateway): honor WECOM_ALLOWED_USERS in env-only WeCom DM allowlist	2026-06-01 19:20:36 -07:00
test_wecom_callback.py	fix(wecom-callback): retry send with fresh token on errcode 40001/42001	2026-05-24 01:30:47 -07:00
test_weixin.py	test(weixin): regression suite for _api_post/_api_get timeout migration	2026-06-01 17:31:40 -07:00
test_whatsapp_connect.py	fix(gateway): keep running when platforms fail; add per-platform circuit breaker + /platform (#26600 )	2026-05-15 14:32:14 -07:00
test_whatsapp_formatting.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_whatsapp_group_gating.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_whatsapp_reply_prefix.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_whatsapp_text_batching.py	fix(gateway): config.yaml path for WhatsApp/Weixin text-batch delays	2026-05-30 07:33:15 -07:00
test_ws_auth_retry.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
test_yolo_command.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00