hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-08 03:01:47 +00:00

History

Teknium f67063ba81 feat(kanban): generic diagnostics engine for task distress signals (#20332 ) * feat(kanban): generic diagnostics engine for task distress signals Replaces the hallucination-specific ``warnings`` / ``RecoverySection`` surface (shipped in PR #20232) with a reusable diagnostic-rule engine that covers five distress kinds in v1 and can be extended without touching UI code. The "something's wrong with this task" signal is no longer limited to phantom card ids. Closes the follow-up from #20232 discussion. New module ---------- ``hermes_cli/kanban_diagnostics.py`` — stateless, no-side-effect rule engine. Each rule is a pure function of ``(task, events, runs, now, config) -> list[Diagnostic]``. Registry is a simple list; adding a new distress kind is one function + one import, no UI or API changes required. v1 rule set ----------- * ``hallucinated_cards`` (error) — folds the existing ``completion_blocked_hallucination`` event into the new surface. * ``prose_phantom_refs`` (warning) — folds ``suspected_hallucinated_references``. * ``repeated_spawn_failures`` (error → critical at 2x threshold) — fires when ``tasks.spawn_failures >= 3``; suggests ``hermes -p <profile> doctor`` / ``auth``. * ``repeated_crashes`` (error → critical) — fires after N consecutive ``crashed`` run outcomes with no successful completion between; suggests ``hermes kanban log <id>``. * ``stuck_in_blocked`` (warning) — fires after 24h in ``blocked`` state with no comments / unblock attempts; suggests commenting. Every diagnostic carries structured ``actions`` (reclaim, reassign, unblock, cli_hint, comment, open_docs) that render consistently in both CLI and dashboard. Suggested actions are highlighted; generic recovery actions (reclaim / reassign) are available on every kind as fallbacks. Diagnostics auto-clear when the underlying failure resolves — a clean ``completed``/``edited`` event drops hallucination diagnostics, a successful run drops crash diagnostics, a comment drops stuck-blocked diagnostics. Audit events persist; the badge goes away. API --- ``plugin_api.py``: * ``/board`` now attaches ``diagnostics`` (full list) and ``warnings`` (compact summary with ``highest_severity``) per task. * ``/tasks/{id}`` attaches diagnostics so the drawer's Diagnostics section auto-opens on flagged tasks. * NEW ``/diagnostics`` endpoint — fleet-wide listing, filterable by severity, sorted critical-first. CLI --- * NEW ``hermes kanban diagnostics [--severity X] [--task id] [--json]`` — fleet view or single-task view, matches dashboard rule output so CLI users see the same picture. * ``hermes kanban show <id>`` now renders a Diagnostics section near the top with severity markers + suggested actions. Dashboard --------- * Card badge is severity-coloured (⚠ amber warning, !! orange error, !!! red critical) using ``warnings.highest_severity``. * Attention strip above the toolbar counts EVERY task with active diagnostics (not just hallucinations), severity-coloured, lists affected tasks with Open buttons when expanded. * Drawer's old ``RecoverySection`` replaced with generic ``DiagnosticsSection`` rendering a card per active diagnostic: title + detail + structured data (task-id chips when payload keys look like id lists) + action buttons. Reassign profile picker is inline per-diagnostic. Clipboard fallback uses ``.catch()`` for environments where writeText rejects. * Three-rung severity palette; amber for warning, orange for error, red for critical. Uses CSS variables so theming is straightforward. Tests ----- * NEW ``tests/hermes_cli/test_kanban_diagnostics.py`` — 14 unit tests covering each rule's positive/negative/threshold paths, severity sorting, broken-rule isolation, and sqlite3.Row integration. * Dashboard plugin tests extended: ``/diagnostics`` endpoint (empty, populated, severity-filtered), ``/board`` exposes both diagnostic list and compact summary with ``highest_severity``. * Existing hallucination-specific test (``test_board_surfaces_ warnings_field_for_hallucinated_completions``) updated to reflect the new contract: warning summary keys by diagnostic kind (``hallucinated_cards``) not event kind. 379 kanban-suite tests pass (+16 net from this PR). Live verification ----------------- Seeded all 5 diagnostic kinds + one clean + one plain-running task (7 total) into an isolated HERMES_HOME, spun up the dashboard, and verified: * Attention strip: shows ``!! 5 tasks need attention`` in the error-severity orange; Show expands to a list of 5 rows ordered critical > error > warning. * Card badges: error tasks render ``!!`` orange, warning tasks render ``⚠`` amber, clean and plain-running tasks render no badge. * Each of the 5 rules opens a correctly-coloured, correctly-styled diagnostic card in the drawer with its specific suggested action. * Live reassign from a diagnostic card flipped ``broken-ml-worker → alice`` and the drawer refreshed with the new assignee + the same diagnostic still firing (correct: spawn_failures counter hasn't reset yet). * CLI ``hermes kanban diagnostics`` prints all 5 in severity order; ``--severity error`` narrows to 3; ``kanban show <id>`` includes the Diagnostics block at the top with suggested action hint. Migration note -------------- The old ``warnings`` shape (``{count, kinds, latest_at}``) is preserved on the API but ``kinds`` now keys by diagnostic kind (``hallucinated_cards``) instead of event kind (``completion_blocked_hallucination``). ``highest_severity`` is a new required field. The dashboard was the only consumer and has been updated in the same commit; external API consumers of the ``warnings`` field will need to update their kind-match logic. * feat(kanban/diagnostics): lead titles with the actual error text The generic 'Worker crashed N runs in a row' / 'Worker failed to spawn N times' titles buried the actual cause in the data section. Operators had to open logs or expand the diagnostic to see WHY the worker is stuck — rate-limit vs insufficient quota vs bad auth vs context overflow vs network blip all looked identical at a glance. New titles: Agent crashed 3x: openai: 429 Too Many Requests - rate limit reached Agent crashed 3x: anthropic: 402 insufficient_quota - credit balance Agent crashed 3x: provider auth error: 401 Unauthorized Agent spawn failed 4x: insufficient_quota: You exceeded your current Detail keeps the full error snippet (capped at 500 chars + ellipsis for tracebacks). Title takes the first line capped at 160 chars. Fallback title if no error recorded stays honest ('no error recorded'). Tests: 4 new cases covering 429/billing/spawn/truncation. 383 total pass (+4). Live-verified on dashboard with 6 seeded scenarios (rate-limit, billing, auth, context, network, spawn-billing) — each card title leads with the actionable error text.		2026-05-05 13:32:42 -07:00
..
__init__.py	test: reorganize test structure and add missing unit tests	2026-02-26 03:20:08 +03:00
conftest.py	fix(kanban): suppress dispatcher stuck-warn when ready queue holds only non-spawnable assignees	2026-05-05 04:13:12 -07:00
test_ai_gateway_models.py	refactor(ai-gateway): single source of truth for model catalog (#13304 )	2026-04-20 22:21:21 -07:00
test_anthropic_model_flow_stale_oauth.py	fix: re-auth on stale OAuth token; read Claude Code credentials from macOS Keychain	2026-04-24 07:14:00 -07:00
test_anthropic_oauth_flow.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_anthropic_provider_persistence.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_api_key_providers.py	chore(salvage): strip duplicated/merge-corrupted blocks from PR #17664	2026-04-29 21:56:51 -07:00
test_apply_model_switch_result_context.py	fix(cli): /model picker honors provider-specific context caps (#16030 )	2026-04-26 05:43:31 -07:00
test_arcee_provider.py	feat(providers): add tencent-tokenhub provider support	2026-04-28 03:45:52 -07:00
test_argparse_flag_propagation.py	feat: shell hooks — wire shell scripts as Hermes hook callbacks	2026-04-20 20:53:51 -07:00
test_at_context_completion_filter.py	fix(tui): @folder: only yields directories, @file: only yields files	2026-04-21 14:31:48 -05:00
test_atomic_json_write.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_atomic_yaml_write.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_auth_codex_provider.py	fix(model): let Codex setup reuse or reauthenticate	2026-04-24 04:53:32 -07:00
test_auth_commands.py	fix(auth): make provider config writes atomic	2026-04-30 20:39:41 -07:00
test_auth_nous_provider.py	feat(nous): persist Nous OAuth across profiles via shared token store (#19712 )	2026-05-04 04:54:55 -07:00
test_auth_provider_gate.py	fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )	2026-04-14 01:43:45 -07:00
test_auth_qwen_provider.py	feat(qwen): add Qwen OAuth provider with portal request support	2026-04-08 13:46:30 -07:00
test_auth_ssl_macos.py	fix(auth): honor SSL CA env vars across httpx + requests callsites	2026-04-24 03:00:33 -07:00
test_aux_config.py	fix(aux): add session_search extra_body and concurrency controls	2026-04-20 00:47:39 -07:00
test_azure_detect.py	feat(azure-foundry): auto-detect transport, models, context length	2026-04-25 18:48:43 -07:00
test_backup.py	fix(backup): floor pre-update backup_keep to 1 so the new backup survives	2026-05-04 05:07:13 -07:00
test_banner.py	feat(banner): hyperlink startup banner title to latest GitHub release (#14945 )	2026-04-23 23:28:34 -07:00
test_banner_git_state.py	fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )	2026-04-07 17:59:42 -07:00
test_banner_skills.py	fix: disabled skills respected across banner, system prompt, slash commands, and skill_view (#1897 )	2026-03-18 03:17:37 -07:00
test_bedrock_model_picker.py	fix(model): avoid bedrock credential probe in provider picker	2026-05-03 00:32:55 -07:00
test_chat_skills_flag.py	fix(termux): add local image chat route	2026-04-09 16:24:53 -07:00
test_claw.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
test_clear_stale_base_url.py	fix: warn and clear stale OPENAI_BASE_URL on provider switch (#5161 )	2026-04-11 01:52:58 -07:00
test_cmd_update.py	fix(update): sync bundled skills to all profiles, including active (#16176 )	2026-05-04 12:34:53 -07:00
test_coalesce_session_args.py	fix(cli): handle unquoted multi-word session names in -c/--continue and -r/--resume	2026-03-09 21:36:29 -07:00
test_codex_cli_model_picker.py	fix(tests): unstick CI — sweep stale tests from recent merges (#12670 )	2026-04-19 12:39:58 -07:00
test_codex_models.py	fix(model): let Codex setup reuse or reauthenticate	2026-04-24 04:53:32 -07:00
test_commands.py	feat: add Telegram DM topic-mode sessions	2026-05-04 12:07:17 -07:00
test_completion.py	fix: preserve profile name completion in dynamic shell completion	2026-04-14 10:45:42 -07:00
test_config.py	fix(cli): prevent .env sanitizer from splitting GLM_API_KEY by LM_API_KEY suffix	2026-04-28 22:22:45 -07:00
test_config_drift.py	feat(delegate): orchestrator role and configurable spawn depth (default flat)	2026-04-21 14:23:45 -07:00
test_config_env_expansion.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
test_config_env_refs.py	fix(config): preserve env refs when save_config rewrites config (#11892 )	2026-04-17 19:03:26 -07:00
test_config_validation.py	fix(config): accept fallback_model list (chain) in validator + save	2026-04-28 01:40:25 -07:00
test_container_aware_cli.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
test_copilot_auth.py	fix: remove 115 verified dead code symbols across 46 production files	2026-04-10 03:44:43 -07:00
test_copilot_catalog_oauth_fallback.py	fix(copilot): require successful exchange when walking credential_pool catalog tokens	2026-04-28 01:18:09 -07:00
test_copilot_context.py	fix(copilot): wire live /models max_prompt_tokens into context-window resolver	2026-04-24 05:09:08 -07:00
test_copilot_in_model_list.py	fix(model): repair Discord Copilot /model flow	2026-04-24 03:33:29 -07:00
test_copilot_token_exchange.py	fix(copilot): exchange raw GitHub token for Copilot API JWT	2026-04-24 05:09:08 -07:00
test_cron.py	feat(skills): consolidate find-nearby into maps as a single location skill	2026-04-19 05:19:22 -07:00
test_curator_archive_prune.py	feat(curator): add archive and prune subcommands (#20200 )	2026-05-05 05:15:54 -07:00
test_curator_status.py	fix(curator): only mark agent-created for background-review sediment (#19621 )	2026-05-04 02:42:16 -07:00
test_custom_provider_context_length.py	fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 )	2026-04-25 18:47:53 -07:00
test_custom_provider_model_switch.py	fix(cli): omit empty api_mode when probing custom models	2026-05-04 02:46:41 -07:00
test_dashboard_browser_safe_imports.py	Merge upstream/main and address Copilot review feedback	2026-04-30 06:43:22 -04:00
test_dashboard_lifecycle_flags.py	feat(dashboard): add --stop and --status flags (#17840 )	2026-04-30 02:30:20 -07:00
test_dashboard_profiles_nav_label.py	fix(dashboard): keep profiles list resilient	2026-04-29 01:39:52 -04:00
test_debug.py	fix(debug): redact log content at upload time in hermes debug share	2026-05-03 11:42:20 -07:00
test_deprecated_cwd_warning.py	fix: enforce config.yaml as sole CWD source + deprecate .env CWD vars + add hermes memory reset (#11029 )	2026-04-16 06:48:33 -07:00
test_detect_api_mode_for_url.py	fix: restrict provider URL detection to exact hostname matches	2026-04-20 22:14:29 -07:00
test_determine_api_mode_hostname.py	fix: extend hostname-match provider detection across remaining call sites	2026-04-20 22:14:29 -07:00
test_dingtalk_auth.py	test(dingtalk): cover QR device-flow auth + OpenClaw branding disclosure	2026-04-17 05:08:07 -07:00
test_discord_skill_clamp_warning.py	test: add tests for cmd_key preservation through name clamping	2026-05-03 03:25:45 -07:00
test_doctor.py	fix(doctor): check gh auth status when GITHUB_TOKEN absent	2026-05-04 12:34:31 -07:00
test_doctor_command_install.py	feat(doctor): add Command Installation check for hermes bin symlink	2026-04-14 23:13:11 -07:00
test_env_loader.py	clarify placeholder telegram credential in tests	2026-05-04 15:31:15 -04:00
test_env_sanitize_on_load.py	clarify placeholder telegram credential in tests	2026-05-04 15:31:15 -04:00
test_fallback_cmd.py	feat(cli): add 'hermes fallback' command to manage fallback providers (#16052 )	2026-04-26 06:19:04 -07:00
test_gateway.py	fix(tests): tolerate ps ancestor-walk in find_gateway_pids fallback test (#19590 )	2026-05-04 01:40:39 -07:00
test_gateway_linger.py	fix(termux): disable gateway service flows on android	2026-04-09 16:24:53 -07:00
test_gateway_runtime_health.py	fix(gateway): harden Telegram polling conflict handling	2026-03-14 12:11:23 -07:00
test_gateway_service.py	fix(gateway): handle planned service stops	2026-05-04 16:00:49 -07:00
test_gateway_wsl.py	feat(gateway): WSL-aware gateway with smart systemd detection (#7510 )	2026-04-10 21:15:47 -07:00
test_gemini_free_tier_setup_block.py	feat(gemini): block free-tier keys at setup + surface guidance on 429 (#15100 )	2026-04-24 04:46:17 -07:00
test_gemini_provider.py	test: stop testing mutable data — convert change-detectors to invariants (#13363 )	2026-04-20 23:20:33 -07:00
test_gmi_provider.py	fix(providers/gmi): post-salvage review fixes	2026-04-27 11:17:59 -07:00
test_goals.py	feat: /goal — persistent cross-turn goals (Ralph loop) (#18262 )	2026-04-30 23:10:20 -07:00
test_hooks_cli.py	feat: shell hooks — wire shell scripts as Hermes hook callbacks	2026-04-20 20:53:51 -07:00
test_ignore_user_config_flags.py	refactor(cli): derive relaunch flag table from argparse introspection	2026-04-29 20:33:29 -07:00
test_image_gen_picker.py	fix(image-gen): persist plugin provider on reconfigure	2026-04-23 01:56:09 -07:00
test_kanban_boards.py	fix(kanban): ignore stale current board pointers	2026-05-05 04:34:45 -07:00
test_kanban_cli.py	feat(kanban): hallucination gate + recovery UX for worker-created-card claims (#20232 )	2026-05-05 08:06:55 -07:00
test_kanban_core_functionality.py	feat(kanban): hallucination gate + recovery UX for worker-created-card claims (#20232 )	2026-05-05 08:06:55 -07:00
test_kanban_db.py	fix(kanban): suppress dispatcher stuck-warn when ready queue holds only non-spawnable assignees	2026-05-05 04:13:12 -07:00
test_kanban_diagnostics.py	feat(kanban): generic diagnostics engine for task distress signals (#20332 )	2026-05-05 13:32:42 -07:00
test_launcher.py	fix: use argparse entrypoint in top-level launcher (#3874 )	2026-03-29 21:54:36 -07:00
test_list_picker_providers.py	feat(cli): add list_picker_providers for credential-filtered picker	2026-05-05 10:18:58 -07:00
test_logs.py	feat: component-separated logging with session context and filtering (#7991 )	2026-04-11 17:23:36 -07:00
test_managed_installs.py	chore: prepare Hermes for Homebrew packaging (#4099 )	2026-03-30 17:34:43 -07:00
test_mcp_config.py	fix(mcp): consolidate OAuth handling, pick up external token refreshes (#11383 )	2026-04-16 21:57:10 -07:00
test_mcp_reload_confirm_gate.py	feat(gateway,cli): confirm /reload-mcp to warn about prompt cache invalidation	2026-04-29 21:56:47 -07:00
test_mcp_tools_config.py	feat: interactive MCP tool configuration in hermes tools (#1694 )	2026-03-17 03:48:44 -07:00
test_memory_reset.py	fix: enforce config.yaml as sole CWD source + deprecate .env CWD vars + add hermes memory reset (#11029 )	2026-04-16 06:48:33 -07:00
test_model_catalog.py	feat(models): remote model catalog manifest for OpenRouter + Nous Portal (#16033 )	2026-04-26 05:46:43 -07:00
test_model_normalize.py	fix(model-normalize): pass DeepSeek V-series IDs through instead of folding to deepseek-chat	2026-04-24 05:24:54 -07:00
test_model_picker_viewport.py	refactor(cli): align model picker viewport with PR #11260 vocabulary	2026-04-17 06:33:21 -07:00
test_model_provider_persistence.py	fix(auth): make provider config writes atomic	2026-04-30 20:39:41 -07:00
test_model_switch_context_display.py	fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 )	2026-04-25 18:47:53 -07:00
test_model_switch_copilot_api_mode.py	fix: recompute Copilot api_mode after model switch	2026-04-16 01:16:14 -07:00
test_model_switch_custom_providers.py	test(model_switch): update regression to reflect bare-custom guard	2026-04-30 04:32:11 -07:00
test_model_switch_opencode_anthropic.py	fix(opencode): derive api_mode from target model, not stale config default (#15106 )	2026-04-24 04:58:46 -07:00
test_model_switch_variant_tags.py	fix(models): preserve OpenRouter variant tags (:free, :extended, :fast) during model switch (#6383 )	2026-04-08 19:58:16 -07:00
test_model_validation.py	chore(salvage): strip duplicated/merge-corrupted blocks from PR #17664	2026-04-29 21:56:51 -07:00
test_models.py	fix(tui): resolve startup model aliases statically	2026-04-25 14:13:02 -05:00
test_models_dev_preferred_merge.py	feat(/model): merge models.dev entries for lesser-loved providers (#14221 )	2026-04-22 17:33:42 -07:00
test_non_ascii_credential.py	fix(env_loader): warn when non-ASCII stripped from credential env vars (#13300 )	2026-04-20 22:14:03 -07:00
test_nous_hermes_non_agentic.py	fix(cli): narrow Nous Hermes non-agentic warning to actual hermes-3/-4 models	2026-04-13 04:33:52 -07:00
test_nous_subscription.py	fix(cli): coerce use_gateway config flags in tool routing	2026-04-26 19:02:55 -07:00
test_ollama_cloud_auth.py	fix(opencode): derive api_mode from target model, not stale config default (#15106 )	2026-04-24 04:58:46 -07:00
test_ollama_cloud_provider.py	fix(models): strip :cloud/-cloud suffix from models.dev Ollama Cloud IDs	2026-05-04 12:38:15 -07:00
test_openai_codex_model_validation_fallback.py	fix(model-switch): soft-accept unlisted openai-codex models	2026-05-04 05:06:53 -07:00
test_opencode_go_in_model_list.py	feat(/model): merge models.dev entries for lesser-loved providers (#14221 )	2026-04-22 17:33:42 -07:00
test_opencode_go_validation_fallback.py	fix(/model): accept provider switches when /models is unreachable	2026-04-21 05:19:43 -07:00
test_overlay_slug_resolution.py	fix(model_picker): detect mapped-provider auth-store credentials	2026-04-24 05:20:05 -07:00
test_path_completion.py	feat(cli): add file path autocomplete in the input prompt (#1545 )	2026-03-16 06:07:45 -07:00
test_pin_kanban_board_env.py	test(kanban): isolate HERMES_KANBAN_BOARD writes in pin-env tests	2026-05-05 04:37:47 -07:00
test_placeholder_usage.py	fix: cover remaining config placeholder help text	2026-03-14 10:35:14 -07:00
test_plugin_cli_registration.py	test: remove 169 change-detector tests across 21 files (#11472 )	2026-04-17 01:05:09 -07:00
test_plugin_scanner_recursion.py	feat(plugins): pluggable image_gen backends + OpenAI provider (#13799 )	2026-04-21 21:30:10 -07:00
test_plugins.py	fix(plugins): bound async plugin command await with 30s timeout	2026-04-30 19:56:18 -07:00
test_plugins_cmd.py	fix: treat ctrl-c as curses cancel	2026-05-04 01:36:44 -07:00
test_profile_export_credentials.py	fix: also exclude .env from default profile exports	2026-04-01 11:20:33 -07:00
test_profiles.py	fix(profiles): keep validate_profile_name strict; callers normalize first	2026-05-04 04:44:37 -07:00
test_prompt_api_key.py	fix(setup): offer Keep/Replace/Clear when API key already exists	2026-05-05 04:08:11 -07:00
test_provider_config_validation.py	fix(config): add request_timeout_seconds and stale_timeout_seconds to provider _KNOWN_KEYS	2026-04-28 01:28:25 -07:00
test_pty_bridge.py	fix(ci): stabilize main test suite regressions (#17660 )	2026-04-29 23:18:55 -07:00
test_reasoning_effort_menu.py	fix: normalize reasoning effort ordering in UI	2026-04-09 14:20:16 -07:00
test_redact_config_bridge.py	feat(security): make secret redaction off by default (#16794 )	2026-04-27 21:24:08 -07:00
test_regression_16767.py	test(cli): regression coverage for user-provider routing fix (#16767 )	2026-04-28 01:47:20 -07:00
test_relaunch.py	remove relaunch_chat	2026-04-29 20:33:29 -07:00
test_resolve_last_session.py	fix(cli): tighten MRU lookup and session DB cleanup	2026-04-27 08:52:12 -07:00
test_runtime_provider_resolution.py	fix(fallback): let custom_providers shadow built-in aliases	2026-04-30 20:18:44 -07:00
test_session_browse.py	fix(sessions): /save lands under $HERMES_HOME, widen browse+TUI picker, force-refresh ollama-cloud on setup (#16296 )	2026-04-26 18:49:48 -07:00
test_sessions_delete.py	test(sessions): wire sessions_dir through auto-prune + file-cleanup regression tests	2026-04-26 18:31:07 -07:00
test_set_config_value.py	fix(config): preserve YAML lists in hermes config set (#17876 )	2026-04-30 04:32:17 -07:00
test_setup.py	fix(setup): add missing SLACK_HOME_CHANNEL prompt to _setup_slack()	2026-05-04 01:37:18 -07:00
test_setup_agent_settings.py	fix(gateway): shutdown + restart hygiene (drain timeout, false-fatal, success log) (#18761 )	2026-05-02 02:08:06 -07:00
test_setup_hermes_script.py	fix(termux): make setup-hermes use android path	2026-04-09 16:24:53 -07:00
test_setup_irc.py	feat(plugins): bundled platform plugins auto-load by default	2026-04-29 21:56:51 -07:00
test_setup_matrix_e2ee.py	docs(matrix): update all references from matrix-nio to mautrix	2026-04-10 21:15:59 -07:00
test_setup_model_provider.py	fix: remove 115 verified dead code symbols across 46 production files	2026-04-10 03:44:43 -07:00
test_setup_noninteractive.py	feat(setup): auto-reconfigure on existing installs (#15879 )	2026-04-25 22:02:02 -07:00
test_setup_ollama_cloud_force_refresh.py	fix(sessions): /save lands under $HERMES_HOME, widen browse+TUI picker, force-refresh ollama-cloud on setup (#16296 )	2026-04-26 18:49:48 -07:00
test_setup_openclaw_migration.py	feat(plugins): bundled platform plugins auto-load by default	2026-04-29 21:56:51 -07:00
test_setup_prompt_menus.py	fix(cli): sanitize bracketed paste markers during setup	2026-05-05 06:12:42 -07:00
test_setup_reconfigure.py	feat(setup): auto-reconfigure on existing installs (#15879 )	2026-04-25 22:02:02 -07:00
test_skills_config.py	fix(tests): resolve 17 persistent CI test failures (#15084 )	2026-04-24 03:46:46 -07:00
test_skills_hub.py	feat(skills): install skills from a direct HTTP(S) URL (#16323 )	2026-04-26 20:57:10 -07:00
test_skills_install_flags.py	fix: add --yes flag to bypass confirmation in /skills install and uninstall (#1647 )	2026-03-17 01:59:07 -07:00
test_skills_skip_confirm.py	fix(skills): cache-aware /skills install and uninstall in TUI (#3586 )	2026-03-28 14:32:23 -07:00
test_skills_subparser.py	fix(cli): resolve duplicate 'skills' subparser crash on Python 3.11+	2026-03-11 00:50:39 -07:00
test_skin_engine.py	fix(tui): restore macOS copy behavior and theme polish (#17131 )	2026-04-28 18:47:14 -05:00
test_spotify_auth.py	feat(spotify): interactive setup wizard + docs page (#15130 )	2026-04-24 05:30:05 -07:00
test_status.py	feat: add Vercel Sandbox backend	2026-04-29 07:22:33 -07:00
test_status_model_provider.py	feat(agent): add lmstudio integration	2026-04-28 12:27:36 -07:00
test_subparser_routing_fallback.py	test: remove 169 change-detector tests across 21 files (#11472 )	2026-04-17 01:05:09 -07:00
test_subprocess_timeouts.py	fix(cli): add missing subprocess.run() timeouts in doctor and status (#4009 )	2026-03-30 11:17:15 -07:00
test_suppress_eio_on_interrupt.py	fix(cli): suppress OSError EIO on interrupt shutdown	2026-04-25 18:25:13 -07:00
test_tencent_tokenhub_provider.py	feat(providers): add tencent-tokenhub provider support	2026-04-28 03:45:52 -07:00
test_terminal_menu_fallbacks.py	Harden setup provider flows	2026-04-10 02:57:39 -07:00
test_timeouts.py	fix(config): add stale timeout settings	2026-04-20 00:52:50 -07:00
test_tips.py	refactor: remove dead code — 1,784 lines across 77 files (#9180 )	2026-04-13 16:32:04 -07:00
test_tool_token_estimation.py	fix(tests): resolve 10 CI failures across hooks, tiktoken, plugins (#3848 )	2026-03-29 20:05:59 -07:00
test_tools_config.py	fix(cli): sync use_gateway in _reconfigure_provider for tts, browser, and web	2026-05-04 02:33:55 -07:00
test_tools_disable_enable.py	fix: MCP toolset resolution for runtime and config (#3252 )	2026-03-26 13:39:41 -07:00
test_tui_npm_install.py	fix(tui): tolerate npm's peer-flag drop in lockfile comparison	2026-05-04 14:13:38 +10:00
test_tui_resume_flow.py	fix(tui): honor launch toolsets (#17623 )	2026-04-29 16:55:27 -07:00
test_update_autostash.py	fix(ci): recover 38 failing tests on main (#17642 )	2026-04-29 20:05:32 -07:00
test_update_check.py	test: remove 8 flaky tests that fail under parallel xdist scheduling (#12784 )	2026-04-19 19:38:02 -07:00
test_update_config_clears_custom_fields.py	fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )	2026-04-19 22:43:09 -07:00
test_update_gateway_restart.py	fix(gateway): drain manual profile gateways via SIGUSR1 before respawn	2026-04-30 20:00:31 -07:00
test_update_hangup_protection.py	fix(update): survive mid-update terminal disconnect (#11960 )	2026-04-17 21:29:24 -07:00
test_update_stale_dashboard.py	fix(tests): make test_update_stale_dashboard immune to hermes_cli.main reload (#17881 )	2026-04-30 02:46:56 -07:00
test_update_yes_flag.py	feat(update): add --yes/-y flag to skip interactive prompts (#18261 )	2026-04-30 23:06:32 -07:00
test_user_providers_model_switch.py	test(model_switch): cover private user_providers override	2026-04-30 19:44:26 -07:00
test_voice_wrapper.py	fix(tui): respect voice.record_key config (supersedes #19028 , #19339 ) (#19835 )	2026-05-04 15:49:28 -07:00
test_web_server.py	Merge upstream/main and address Copilot review feedback	2026-04-30 06:43:22 -04:00
test_web_server_host_header.py	fix(web_server,whatsapp-bridge): validate Host header against bound interface (#13530 )	2026-04-21 06:26:35 -07:00
test_web_ui_build.py	fix(cli): check hermes_cli/web_dist/ not web/dist/ for build staleness	2026-04-26 18:43:57 -07:00
test_webhook_cli.py	feat(webhook): hermes webhook CLI + skill for event-driven subscriptions (#3578 )	2026-03-28 14:33:35 -07:00
test_xiaomi_provider.py	feat(providers): add tencent-tokenhub provider support	2026-04-28 03:45:52 -07:00