hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-13 14:02:16 +00:00

History

kshitij 0554ef1aa3 fix(agent): fallback immediately on provider content-policy blocks (#33883 ) * fix(agent): fallback immediately on provider content-policy blocks Provider safety-filter refusals (e.g. OpenAI Codex 'flagged for possible cybersecurity risk', OpenAI moderation 'violates our usage policies', Anthropic safety-system rejections, Azure content_filter) are deterministic decisions about a specific prompt. Retrying the same prompt up to api_max_retries times just reproduces the same refusal and burns paid attempts before surfacing the generic 'API failed after 3 retries — <provider message>' to Telegram / cron with no indication that the failure came from the model provider rather than Hermes itself. Classify these as a new FailoverReason.content_policy_blocked (non-retryable, should_fallback=True) and route them through the existing is_client_error path so the loop: - skips the 3x retry backoff - activates a configured fallback model immediately - emits a clear provider-safety message to the user (not the generic 'Non-retryable error (HTTP None)') and surfaces actionable guidance when no fallback is configured (rephrase, narrow context, or set fallback_model in hermes config) - returns a final_response that explicitly tells the user this came from the model provider, so gateway delivery is unambiguous and cron last_status reflects the safety block rather than a vague 'agent reported failure' Patterns are intentionally narrow — verbatim refusal phrasings keyed to specific provider safety pipelines, not generic words like 'policy' or 'violation' that would collide with billing / format / auth errors. Regression guards in test_18028_content_policy_blocked.py verify billing 402s, generic 400s, and OpenRouter account-level provider_policy_blocked remain distinct classifications. Salvaged from #18164 onto current main (file restructure: loop logic moved from run_agent.py to agent/conversation_loop.py, _emit_status → _buffer_status), broadened patterns beyond the original OpenAI Codex cybersecurity case to cover OpenAI moderation, Anthropic safety system, and Azure content_filter; added user-actionable guidance and a clear final_response so cron/gateway surfaces the policy block instead of a generic non-retryable error, and added a regression-guard test module mirroring the is_client_error predicate. Addresses #18028. Co-authored-by: Kuan-Chieh Huang <kchuang1015@users.noreply.github.com> * chore: add kchuang1015 to AUTHOR_MAP --------- Co-authored-by: Kuan-Chieh Huang <kchuang1015@users.noreply.github.com>		2026-05-28 07:28:24 -07:00
..
acp	test(acp): drop flaky runtime_calls[-1] tail-position assertion	2026-05-24 23:23:12 -07:00
acp_adapter	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
agent	fix(agent): fallback immediately on provider content-policy blocks (#33883 )	2026-05-28 07:28:24 -07:00
cli	test(auth): update entitlement CI expectations	2026-05-28 00:19:31 -07:00
cron	test(ci): harden two flaky tests against CI noise (#33675 )	2026-05-27 23:15:41 -07:00
docker	fix(docker): bake build-time git SHA into the image	2026-05-28 15:14:05 +10:00
e2e	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
fakes
gateway	fix(discord): skip backfill for auto-created threads and update test fakes	2026-05-28 04:52:02 -07:00
hermes_cli	fix(xai-oauth): accept bare-code manual paste (state=None) (#26923 ) (#33880 )	2026-05-28 05:47:30 -07:00
hermes_state	feat(session_search): single-shape tool with discovery, scroll, browse — no LLM (#27590 )	2026-05-17 23:28:45 -07:00
honcho_plugin	fix(honcho): align peer-card read and write paths	2026-05-27 10:49:33 -07:00
integration	chore(web): remove web_crawl tool + provider crawl plumbing (#33824 )	2026-05-28 04:52:42 -07:00
openviking_plugin
plugins	chore(web): remove web_crawl tool + provider crawl plumbing (#33824 )	2026-05-28 04:52:42 -07:00
providers	remove Vercel AI Gateway and Vercel Sandbox (#33067 )	2026-05-27 00:43:32 -07:00
run_agent	fix(agent): fallback immediately on provider content-policy blocks (#33883 )	2026-05-28 07:28:24 -07:00
scripts	feat(acp-registry): switch to uvx distribution, drop npm launcher	2026-05-14 22:27:09 -07:00
skills	fix(skills): add timeout to Google OAuth urlopen calls	2026-05-19 00:11:44 -07:00
stress	docs: align kanban readiness docs and smoke tests	2026-05-18 21:07:03 -07:00
tools	chore(web): remove web_crawl tool + provider crawl plumbing (#33824 )	2026-05-28 04:52:42 -07:00
tui_gateway	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
website
__init__.py
conftest.py	test(dashboard-auth): strip HERMES_DASHBOARD_OAUTH_* env vars in hermetic fixture	2026-05-27 02:12:27 -07:00
run_interrupt_test.py
test_account_usage.py
test_atomic_replace_symlinks.py
test_base_url_hostname.py
test_batch_runner_checkpoint.py
test_bitwarden_secrets.py	perf(cli): cut hermes startup 63% — flip head-to-head vs codex (#31968 )	2026-05-25 03:06:39 -07:00
test_cli_file_drop.py
test_cli_manual_compress.py	fix(tests): catch up six stale tests after compression/aux/kanban changes (#28465 )	2026-05-18 21:43:59 -07:00
test_cli_skin_integration.py
test_ctx_halving_fix.py
test_docker_home_override_scripts.py	fix(docker): align HOME for dashboard and s6 gateway services (#33481 )	2026-05-28 13:42:27 +10:00
test_empty_model_fallback.py
test_env_loader_secret_sources.py	fix(secrets): only apply external secrets once per HERMES_HOME per process (#32271 )	2026-05-25 15:18:55 -07:00
test_evidence_store.py
test_gateway_streaming_nested_config.py
test_get_tool_definitions_cache_isolation.py
test_hermes_bootstrap.py
test_hermes_constants.py	fix(security): guard os.chmod(parent) against / and top-level dirs	2026-05-20 22:56:55 -07:00
test_hermes_home_profile_warning.py
test_hermes_logging.py	fix(tests): catch up 25 stale tests after recent merges (#28626 )	2026-05-19 01:28:32 -07:00
test_hermes_state.py	fix(kanban): skip redundant WAL pragma on already-WAL connections	2026-05-27 14:31:55 -07:00
test_hermes_state_wal_fallback.py	fix(kanban): skip redundant WAL pragma on already-WAL connections	2026-05-27 14:31:55 -07:00
test_honcho_client_config.py
test_honcho_session_context.py	fix(honcho): align user context peer perspective	2026-05-27 10:49:33 -07:00
test_install_sh_browser_install.py
test_install_sh_pythonpath_sanitization.py
test_install_sh_root_fhs_uv_python_path.py	test(install): harden uv-python-path regression test against future drift	2026-05-27 13:55:51 -07:00
test_install_sh_setup_wizard_tty_probe.py
test_install_sh_symlink_stomp.py
test_install_sh_termux_network_prereqs.py
test_ipv4_preference.py
test_lazy_session_regressions.py
test_lint_config.py
test_live_system_guard_self_test.py	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
test_mcp_serve.py
test_mini_swe_runner.py
test_minimax_model_validation.py
test_minimax_oauth.py	fix(minimax-oauth): refresh short-lived access tokens per request (#30619 )	2026-05-22 15:16:15 -07:00
test_minisweagent_path.py
test_model_picker_scroll.py
test_model_tools.py	chore: remove Atropos RL environments and tinker-atropos integration (#26106 )	2026-05-15 10:36:38 +05:30
test_model_tools_async_bridge.py
test_ollama_num_ctx.py
test_package_json_lazy_deps.py	fix(update): make Camofox lazy-installed instead of eager (#27055 )	2026-05-16 12:15:45 -07:00
test_packaging_metadata.py
test_plugin_skills.py
test_process_loop_event_loop_warning.py
test_project_metadata.py	remove Vercel AI Gateway and Vercel Sandbox (#33067 )	2026-05-27 00:43:32 -07:00
test_retry_utils.py
test_run_tests_parallel.py	test: use subprocesses for each test file (#29016 )	2026-05-21 16:40:04 +05:30
test_sanitize_tool_error.py	security: sanitize tool error strings before injecting into model context (#26823 )	2026-05-16 00:57:39 -07:00
test_sql_injection.py
test_subprocess_home_isolation.py	fix: avoid process-wide cron profile home mutation	2026-05-18 17:39:50 +00:00
test_termux_all_extra_compat.py
test_timezone.py	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
test_toolset_distributions.py
test_toolsets.py
test_trajectory_compressor.py
test_trajectory_compressor_async.py
test_transform_llm_output_hook.py
test_transform_tool_result_hook.py
test_tui_gateway_server.py	feat: add TUI session orchestrator	2026-05-26 20:51:59 -07:00
test_utils_truthy_values.py
test_yuanbao_integration.py
test_yuanbao_markdown.py
test_yuanbao_pipeline.py
test_yuanbao_proto.py