hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-29 18:46:59 +00:00

History

Teknium 125de02056 fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 ) Fixes #15779. Custom-provider per-model context_length (`custom_providers[].models.<id>.context_length`) is now honored across every resolution path, not just agent startup. Also adds 256K as the top probe tier and default fallback. ## What changed New helper `hermes_cli.config.get_custom_provider_context_length()` — single source of truth for the per-model override lookup, with trailing-slash-insensitive base-url matching. `agent.model_metadata.get_model_context_length()` gains an optional `custom_providers=` kwarg (step 0b — runs after explicit `config_context_length` but before every other probe). Wired through five call sites that previously either duplicated the lookup or ignored it entirely: - `run_agent.py` startup — refactored to use the new helper (dedups legacy inline loop, keeps invalid-value warning) - `AIAgent.switch_model()` — re-reads custom_providers from live config on every /model switch - `hermes_cli.model_switch.resolve_display_context_length()` — new `custom_providers=` kwarg - `gateway/run.py` /model confirmation (picker callback + text path) - `gateway/run.py` `_format_session_info` (/info) ## Context probe tiers `CONTEXT_PROBE_TIERS = [256_000, 128_000, 64_000, 32_000, 16_000, 8_000]` — was `[128_000, ...]`. `DEFAULT_FALLBACK_CONTEXT` follows tier[0], so unknown models now default to 256K. The stale `128000` literal in the OpenRouter metadata-miss path is replaced with `DEFAULT_FALLBACK_CONTEXT` for consistency. ## Repro (from #15779) ```yaml custom_providers: - name: my-custom-endpoint base_url: https://example.invalid/v1 model: gpt-5.5 models: gpt-5.5: context_length: 1050000 ``` `/model gpt-5.5 --provider custom:my-custom-endpoint` → previously "Context: 128,000", now "Context: 1,050,000". ## Tests - `tests/hermes_cli/test_custom_provider_context_length.py` — new file, 19 tests covering the helper, step-0b integration, and the 256K tier invariants - `tests/hermes_cli/test_model_switch_context_display.py` — added regression tests for #15779 through the display resolver - `tests/gateway/test_session_info.py` — updated default-fallback assertion (128K → 256K) - `tests/agent/test_model_metadata.py` — updated tier assertions for the new top tier		2026-04-25 18:47:53 -07:00
..
acp	fix(acp): include MCP toolsets in ACP sessions	2026-04-24 03:04:42 -07:00
agent	fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 )	2026-04-25 18:47:53 -07:00
cli	refactor(memory): remove flush_memories entirely (#15696 )	2026-04-25 08:21:14 -07:00
cron	fix(cron): wire context_from through the update action	2026-04-25 04:49:28 -07:00
e2e	test(discord): add guild to fake e2e messages	2026-04-25 18:25:56 -07:00
environments/benchmarks	fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )	2026-04-07 17:28:37 -07:00
fakes	fix: streaming tool call parsing, error handling, and fake HA state mutation	2026-03-14 14:27:20 +03:00
gateway	fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 )	2026-04-25 18:47:53 -07:00
hermes_cli	fix(context): honor custom_providers context_length on /model switch + bump probe tier to 256K (#15844 )	2026-04-25 18:47:53 -07:00
hermes_state	fix(resume): redirect --resume to the descendant that actually holds the messages	2026-04-24 03:04:42 -07:00
honcho_plugin	feat(honcho): wizard cadence default 2, surface reasoning level, backwards-compat fallback	2026-04-18 22:50:55 -07:00
integration	fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )	2026-04-16 16:50:15 -07:00
plugins	feat(hindsight): optional bank_id_template for per-agent / per-user banks	2026-04-24 03:38:17 -07:00
run_agent	fix(agent): preserve Codex message items for replay	2026-04-25 18:22:06 -07:00
skills	fix(skills): factor HERMES_HOME resolution into shared _hermes_home helper	2026-04-24 16:45:27 -07:00
tools	fix(terminal): three-layer defense against watch_patterns notification spam (#15642 )	2026-04-25 06:41:58 -07:00
tui_gateway	fix(tui): keep default personality neutral	2026-04-24 16:19:23 -05:00
__init__.py	A bit of restructuring for simplicity and organization	2025-10-01 23:29:25 +00:00
conftest.py	test(conftest): reset module-level state + unset platform allowlists (#13400 )	2026-04-21 01:33:10 -07:00
run_interrupt_test.py	fix: thread safety for concurrent subagent delegation (#1672 )	2026-03-17 02:53:33 -07:00
test_account_usage.py	feat(account-usage): add per-provider account limits module	2026-04-21 01:56:35 -07:00
test_base_url_hostname.py	security(runtime_provider): close OLLAMA_API_KEY substring-leak sweep miss (#13522 )	2026-04-21 06:06:16 -07:00
test_batch_runner_checkpoint.py	test: regression coverage for checkpoint dedup and inf/nan coercion	2026-04-24 14:32:21 -07:00
test_cli_file_drop.py	fix(tui): improve macOS paste and shortcut parity	2026-04-21 08:00:00 -07:00
test_cli_skin_integration.py	fix: align status bar skin tests with upstream main	2026-04-22 13:20:02 -07:00
test_ctx_halving_fix.py	fix(tests): fix 78 CI test failures and remove dead test (#9036 )	2026-04-13 10:50:24 -07:00
test_empty_model_fallback.py	fix: fall back to provider's default model when model config is empty (#8303 )	2026-04-12 03:53:30 -07:00
test_evidence_store.py	feat: add OSS Security Forensics skill (Skills Hub) (#1482 )	2026-03-15 21:59:53 -07:00
test_hermes_constants.py	fix(gateway): harden Docker/container gateway pathway	2026-04-12 16:36:11 -07:00
test_hermes_logging.py	fix(tests): fix 78 CI test failures and remove dead test (#9036 )	2026-04-13 10:50:24 -07:00
test_hermes_state.py	fix(agent): preserve Codex message items for replay	2026-04-25 18:22:06 -07:00
test_honcho_client_config.py	feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )	2026-04-02 15:33:51 -07:00
test_ipv4_preference.py	feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )	2026-04-11 23:12:11 -07:00
test_mcp_serve.py	feat: add MCP server mode — hermes mcp serve (#3795 )	2026-03-29 15:47:19 -07:00
test_mini_swe_runner.py	fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )	2026-04-20 12:23:05 -07:00
test_minimax_model_validation.py	fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )	2026-04-19 22:44:47 -07:00
test_minisweagent_path.py	chore: remove all remaining mini-swe-agent references	2026-03-24 08:19:23 -07:00
test_model_picker_scroll.py	fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )	2026-04-07 17:59:42 -07:00
test_model_tools.py	test: regression coverage for checkpoint dedup and inf/nan coercion	2026-04-24 14:32:21 -07:00
test_model_tools_async_bridge.py	fix(core): ensure non-blocking executor shutdown on async timeout	2026-04-22 14:42:32 -07:00
test_ollama_num_ctx.py	fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )	2026-04-07 22:23:28 -07:00
test_packaging_metadata.py	chore: prepare Hermes for Homebrew packaging (#4099 )	2026-03-30 17:34:43 -07:00
test_plugin_skills.py	fix(tests): attach caplog to specific logger in 3 order-dependent tests (#11453 )	2026-04-17 00:20:40 -07:00
test_project_metadata.py	build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )	2026-04-17 13:31:53 -07:00
test_retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
test_sql_injection.py	fix(security): eliminate SQL string formatting in execute() calls	2026-03-19 15:16:35 +01:00
test_subprocess_home_isolation.py	fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )	2026-04-10 13:37:45 -07:00
test_timezone.py	test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )	2026-04-17 14:21:22 -07:00
test_toolset_distributions.py	test: add unit tests for 8 modules (batch 2)	2026-02-26 13:54:20 +03:00
test_toolsets.py	feat(discord): split discord_server into discord + discord_admin tools	2026-04-25 04:50:14 -07:00
test_trajectory_compressor.py	fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )	2026-04-20 12:23:05 -07:00
test_trajectory_compressor_async.py	fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )	2026-04-20 12:23:05 -07:00
test_transform_tool_result_hook.py	test: stop testing mutable data — convert change-detectors to invariants (#13363 )	2026-04-20 23:20:33 -07:00
test_tui_gateway_server.py	fix(tui): sync inference model after switches	2026-04-25 14:17:57 -05:00
test_utils_truthy_values.py	Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.	2026-03-30 13:28:10 +09:00