hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-20 15:33:54 +00:00

History

Andre Kurait b290297d66 fix(bedrock): resolve context length via static table before custom-endpoint probe ## Problem `get_model_context_length()` in `agent/model_metadata.py` had a resolution order bug that caused every Bedrock model to fall back to the 128K default context length instead of reaching the static Bedrock table (200K for Claude, etc.). The root cause: `bedrock-runtime.<region>.amazonaws.com` is not listed in `_URL_TO_PROVIDER`, so `_is_known_provider_base_url()` returned False. The resolution order then ran the custom-endpoint probe (step 2) before the Bedrock branch (step 4b), which: 1. Treated Bedrock as a custom endpoint (via `_is_custom_endpoint`). 2. Called `fetch_endpoint_model_metadata()` → `GET /models` on the bedrock-runtime URL (Bedrock doesn't serve this shape). 3. Fell through to `return DEFAULT_FALLBACK_CONTEXT` (128K) at the "probe-down" branch — never reaching the Bedrock static table. Result: users on Bedrock saw 128K context for Claude models that actually support 200K on Bedrock, causing premature auto-compression. ## Fix Promote the Bedrock branch from step 4b to step 1b, so it runs before the custom-endpoint probe at step 2. The static table in `bedrock_adapter.py::get_bedrock_context_length()` is the authoritative source for Bedrock (the ListFoundationModels API doesn't expose context window sizes), so there's no reason to probe `/models` first. The original step 4b is replaced with a one-line breadcrumb comment pointing to the new location, to make the resolution-order docstring accurate. ## Changes - `agent/model_metadata.py` - Add step 1b: Bedrock static-table branch (unchanged predicate, moved). - Remove dead step 4b block, replace with breadcrumb comment. - Update resolution-order docstring to include step 1b. - `tests/agent/test_model_metadata.py` - New `TestBedrockContextResolution` class (3 tests): - `test_bedrock_provider_returns_static_table_before_probe`: confirms `provider="bedrock"` hits the static table and does NOT call `fetch_endpoint_model_metadata` (regression guard). - `test_bedrock_url_without_provider_hint`: confirms the `bedrock-runtime.*.amazonaws.com` host match works without an explicit `provider=` hint. - `test_non_bedrock_url_still_probes`: confirms the probe still fires for genuinely-custom endpoints (no over-reach). ## Testing pytest tests/agent/test_model_metadata.py -q # 83 passed in 1.95s (3 new + 80 existing) ## Risk Very low. - Predicate is identical to the original step 4b — no behaviour change for non-Bedrock paths. - Original step 4b was dead code for the user-facing case (always hit the 128K fallback first), so removing it cannot regress behaviour. - Bedrock path now short-circuits before any network I/O — faster too. - `ImportError` fall-through preserved so users without `boto3` installed are unaffected. ## Related - This is a prerequisite for accurate context-window accounting on Bedrock — the fix for #14710 (stale-connection client eviction) depends on correct context sizing to know when to compress. Signed-off-by: Andre Kurait <andrekurait@gmail.com>		2026-04-24 07:26:07 -07:00
..
transports	fix(kimi,mcp): Moonshot schema sanitizer + MCP schema robustness (#14805 )	2026-04-23 16:11:57 -07:00
__init__.py	test: add unit tests for 8 modules (batch 2)	2026-02-26 13:54:20 +03:00
test_anthropic_adapter.py	refactor: remove _nr_to_assistant_message shim + fix flush_memories guard	2026-04-23 02:30:05 -07:00
test_anthropic_keychain.py	fix: re-auth on stale OAuth token; read Claude Code credentials from macOS Keychain	2026-04-24 07:14:00 -07:00
test_auxiliary_client.py	fix(aux): force anthropic oauth refresh after 401	2026-04-24 07:14:00 -07:00
test_auxiliary_client_anthropic_custom.py	fix(anthropic): complete third-party Anthropic-compatible provider support (#12846 )	2026-04-19 22:43:09 -07:00
test_auxiliary_config_bridge.py	fix: remove legacy compression.summary_* config and env var fallbacks (#8992 )	2026-04-13 04:59:26 -07:00
test_auxiliary_main_first.py	feat: add Xiaomi MiMo v2.5-pro and v2.5 model support (#14635 )	2026-04-23 10:06:25 -07:00
test_auxiliary_named_custom_providers.py	fix(aux): normalize GitHub Copilot provider slugs	2026-04-24 03:33:29 -07:00
test_bedrock_adapter.py	feat: native AWS Bedrock provider via Converse API	2026-04-15 16:17:17 -07:00
test_bedrock_integration.py	fix(anthropic): auto-detect Bedrock model IDs in normalize_model_name (#12295 )	2026-04-24 07:26:07 -07:00
test_codex_cloudflare_headers.py	fix(codex): pin correct Cloudflare headers and extend to auxiliary client	2026-04-19 11:59:25 -07:00
test_compress_focus.py	fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )	2026-04-14 01:43:45 -07:00
test_context_compressor.py	fix(agent): guard context compressor against structured message content	2026-04-22 14:46:51 -07:00
test_context_engine.py	feat: wire context engine plugin slot into agent and plugin system	2026-04-10 19:15:50 -07:00
test_context_references.py	fix(agent): fall back when rg is blocked for @folder references	2026-04-20 01:56:41 -07:00
test_copilot_acp_client.py	fix: set HOME for Copilot ACP subprocesses	2026-04-24 05:09:08 -07:00
test_credential_pool.py	fix(credential_pool): add Nous OAuth cross-process auth-store sync	2026-04-24 05:20:05 -07:00
test_credential_pool_routing.py	refactor: remove smart_model_routing feature (#12732 )	2026-04-19 18:12:55 -07:00
test_crossloop_client_cache.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_direct_provider_url_detection.py	fix: restrict provider URL detection to exact hostname matches	2026-04-20 22:14:29 -07:00
test_display.py	fix(display): render <missing old_text> in memory previews instead of empty quotes (#12852 )	2026-04-19 22:45:47 -07:00
test_display_emoji.py	feat(tools): centralize tool emoji metadata in registry + skin integration	2026-03-15 20:21:21 -07:00
test_error_classifier.py	fix(agent): only set rate-limit cooldown when leaving primary; add tests	2026-04-24 05:35:43 -07:00
test_external_skills.py	feat(skills): support external skill directories via config (#3678 )	2026-03-29 00:33:30 -07:00
test_gemini_cloudcode.py	fix(gemini): assign unique stream indices to parallel tool calls	2026-04-20 02:10:53 -07:00
test_gemini_free_tier_gate.py	feat(gemini): block free-tier keys at setup + surface guidance on 429 (#15100 )	2026-04-24 04:46:17 -07:00
test_gemini_native_adapter.py	fix(gemini): fail fast on missing API key + surface it in hermes dump (#15133 )	2026-04-24 05:35:17 -07:00
test_gemini_schema.py	fix(gemini): drop integer/number/boolean enums from tool schemas (#15082 )	2026-04-24 03:40:00 -07:00
test_image_gen_registry.py	feat(plugins): pluggable image_gen backends + OpenAI provider (#13799 )	2026-04-21 21:30:10 -07:00
test_insights.py	test: stop testing mutable data — convert change-detectors to invariants (#13363 )	2026-04-20 23:20:33 -07:00
test_kimi_coding_anthropic_thinking.py	fix(kimi): don't send Anthropic thinking to api.kimi.com/coding (#13826 )	2026-04-21 21:19:14 -07:00
test_local_stream_timeout.py	fix(agent): recognize Tailscale CGNAT (100.64.0.0/10) as local for Ollama timeouts	2026-04-22 14:46:10 -07:00
test_memory_provider.py	fix(honcho): dialectic lifecycle — defaults, retry, prewarm consumption	2026-04-18 22:50:55 -07:00
test_memory_user_id.py	feat(hindsight): richer session-scoped retain metadata	2026-04-22 05:27:10 -07:00
test_minimax_auxiliary_url.py	fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )	2026-04-07 22:23:28 -07:00
test_minimax_provider.py	fix(tests): resolve 17 persistent CI test failures (#15084 )	2026-04-24 03:46:46 -07:00
test_model_metadata.py	fix(bedrock): resolve context length via static table before custom-endpoint probe	2026-04-24 07:26:07 -07:00
test_model_metadata_local_ctx.py	fix: forward auth when probing local model metadata	2026-04-20 20:51:56 -07:00
test_model_metadata_ssl.py	fix(auth): honor SSL CA env vars across httpx + requests callsites	2026-04-24 03:00:33 -07:00
test_models_dev.py	feat: add Step Plan provider support (salvage #6005 )	2026-04-22 02:59:58 -07:00
test_moonshot_schema.py	fix(kimi,mcp): Moonshot schema sanitizer + MCP schema robustness (#14805 )	2026-04-23 16:11:57 -07:00
test_nous_rate_guard.py	fix: Nous Portal rate limit guard — prevent retry amplification (#10568 )	2026-04-15 16:31:48 -07:00
test_prompt_builder.py	feat(agent): add PLATFORM_HINTS for matrix, mattermost, and feishu (#14428 )	2026-04-23 12:50:22 +05:30
test_prompt_caching.py	fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter	2026-03-21 16:54:43 -07:00
test_proxy_and_url_validation.py	fix(agent): normalize socks:// env proxies for httpx/anthropic	2026-04-21 05:52:46 -07:00
test_rate_limit_tracker.py	feat: capture provider rate limit headers and show in /usage (#6541 )	2026-04-09 03:43:14 -07:00
test_redact.py	feat: replace kimi-k2.5 with kimi-k2.6 on OpenRouter and Nous Portal (#13148 )	2026-04-20 11:49:54 -07:00
test_shell_hooks.py	feat: shell hooks — wire shell scripts as Hermes hook callbacks	2026-04-20 20:53:51 -07:00
test_shell_hooks_consent.py	feat: shell hooks — wire shell scripts as Hermes hook callbacks	2026-04-20 20:53:51 -07:00
test_skill_commands.py	refactor(commands): drop /provider, /plan handler, and clean up slash registry (#15047 )	2026-04-24 03:10:52 -07:00
test_subagent_progress.py	feat(delegate): orchestrator role and configurable spawn depth (default flat)	2026-04-21 14:23:45 -07:00
test_subagent_stop_hook.py	feat: shell hooks — wire shell scripts as Hermes hook callbacks	2026-04-20 20:53:51 -07:00
test_subdirectory_hints.py	fix(agent): catch PermissionError in subdirectory hint discovery	2026-04-09 03:10:30 -07:00
test_title_generator.py	feat: auto-generate session titles after first exchange	2026-03-17 04:14:40 -07:00
test_usage_pricing.py	fix(usage): read top-level Anthropic cache fields from OAI-compatible proxies	2026-04-22 17:40:49 -07:00
test_vision_resolved_args.py	fix: pass resolved args to resolve_vision_provider_client()	2026-04-16 07:45:13 -07:00