hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-31 19:16:29 +00:00

History

Rob Moen 0dd373ec43 fix(context): honor model.context_length for Ollama num_ctx and all display paths When a user sets model.context_length in config.yaml, the value was only used for Hermes' internal compression decisions (context_compressor) but NOT for Ollama's num_ctx parameter. Ollama auto-detects context from GGUF metadata (often 256K+) and allocates that much VRAM regardless of the user's config — causing OOM on smaller GPUs like the P100 (16GB). Root cause: two separate context values existed independently: - context_compressor.context_length = config value (e.g. 65536) ✓ - _ollama_num_ctx = GGUF metadata value (e.g. 256000) ✗ ignored config Changes: 1. Cap Ollama num_ctx to config context_length (run_agent.py) When model.context_length is explicitly set and no explicit ollama_num_ctx override exists, cap the auto-detected GGUF value to the user's context_length. This is the core fix — it prevents Ollama from allocating more VRAM than the user budgeted. 2. Pass config_context_length through all secondary call sites Several paths called get_model_context_length() without the config override, falling through to the 256K default fallback: - cli.py: @-reference expansion and /model switch display - gateway/run.py: @-reference expansion and /model switch display - tui_gateway/server.py: @-reference expansion - hermes_cli/model_switch.py: resolve_display_context_length() 3. Normalize root-level context_length in config (hermes_cli/config.py) _normalize_root_model_keys() now migrates root-level context_length into the model section, matching existing behavior for provider and base_url. Users who wrote `context_length: 65536` at the YAML root instead of under `model:` had it silently ignored. 4. Fix misleading comments (agent/model_metadata.py) DEFAULT_FALLBACK_CONTEXT is 256K (CONTEXT_PROBE_TIERS[0]), not 128K as two comments stated. Tests: 3 new tests for root-level context_length normalization. All existing context_length tests pass (96 tests).		2026-04-30 04:31:23 -07:00
..
__init__.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_branch_command.py	feat(memory): notify providers on mid-process session_id rotation (#17409 )	2026-04-29 04:57:22 -07:00
test_busy_input_mode_command.py	feat(busy): add 'steer' as a third display.busy_input_mode option (#16279 )	2026-04-26 18:21:29 -07:00
test_cli_approval_ui.py	fix(cli): wire approvals in background tasks	2026-04-26 12:29:48 -07:00
test_cli_background_tui_refresh.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_bracketed_paste_sanitizer.py	fix(cli): strip leaked bracketed-paste wrappers	2026-04-26 21:47:40 -07:00
test_cli_browser_connect.py	fix(browser): address Copilot review on /browser connect	2026-04-28 22:11:10 -07:00
test_cli_context_warning.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_copy_command.py	feat: add /copy and /agents	2026-04-09 17:19:36 -05:00
test_cli_extension_hooks.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_external_editor.py	feat(cli): add editor workflow for drafts	2026-04-20 02:53:40 -07:00
test_cli_file_drop.py	fix(tui): improve macOS paste and shortcut parity	2026-04-21 08:00:00 -07:00
test_cli_force_redraw.py	fix(cli): eliminate ghost status-bar + DSR input leaks from terminal drift	2026-04-27 05:31:47 -07:00
test_cli_image_command.py	fix(termux): harden execute_code and mobile browser/audio UX	2026-04-09 16:24:53 -07:00
test_cli_init.py	fix(context): honor model.context_length for Ollama num_ctx and all display paths	2026-04-30 04:31:23 -07:00
test_cli_interrupt_subagent.py	fix: resolve CI test failures — add missing functions, fix stale tests (#9483 )	2026-04-14 01:43:45 -07:00
test_cli_loading_indicator.py	fix: clean up defensive shims and finish CI stabilization from #17660 (#17801 )	2026-04-29 23:53:17 -07:00
test_cli_markdown_rendering.py	Rename test variables	2026-04-21 16:00:34 -03:00
test_cli_mcp_config_watch.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_new_session.py	refactor(memory): remove flush_memories entirely (#15696 )	2026-04-25 08:21:14 -07:00
test_cli_prefix_matching.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_preloaded_skills.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_provider_resolution.py	refactor: remove smart_model_routing feature (#12732 )	2026-04-19 18:12:55 -07:00
test_cli_reload_skills.py	refactor(reload-skills): queue note for next turn, drop cache invalidation + agent tool	2026-04-29 21:07:47 -07:00
test_cli_retry.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_save_config_value.py	fix(config): preserve env refs when save_config rewrites config (#11892 )	2026-04-17 19:03:26 -07:00
test_cli_secret_capture.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_shutdown_memory_messages.py	fix(cli): pass session messages to shutdown_memory_provider (#15165 sibling)	2026-04-27 06:41:16 -07:00
test_cli_skin_integration.py	fix(tui): restore macOS copy behavior and theme polish (#17131 )	2026-04-28 18:47:14 -05:00
test_cli_status_bar.py	fix(cli): use display width for wrapped spinner height	2026-04-18 14:34:05 -07:00
test_cli_status_command.py	fix(profile): use existing get_active_profile_name() for /profile command	2026-04-15 17:52:03 -07:00
test_cli_steer_busy_path.py	fix(cli): dispatch /steer inline while agent is running (#13354 )	2026-04-20 23:05:38 -07:00
test_cli_terminal_response_sanitizer.py	fix(cli): eliminate ghost status-bar + DSR input leaks from terminal drift	2026-04-27 05:31:47 -07:00
test_cli_tools_command.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00
test_cli_user_message_preview.py	feat(cli): improve multiline previews	2026-04-20 02:53:40 -07:00
test_compress_focus.py	feat: /compress <focus> — guided compression with focus topic (#8017 )	2026-04-11 19:23:29 -07:00
test_cwd_env_respect.py	fix: enforce config.yaml as sole CWD source + deprecate .env CWD vars + add hermes memory reset (#11029 )	2026-04-16 06:48:33 -07:00
test_fast_command.py	feat(fast): broaden /fast whitelist to all OpenAI + Anthropic models (#16883 )	2026-04-28 00:44:43 -07:00
test_gquota_command.py	fix(cli): sanitize interactive command output	2026-04-19 01:16:34 -07:00
test_manual_compress.py	fix(cli): sync session_id after compression and preserve original end_reason (#12920 )	2026-04-20 01:48:20 -07:00
test_personality_none.py	fix(gateway): use profile-aware Hermes paths in runtime hints	2026-04-15 17:52:03 -07:00
test_quick_commands.py	fix(tests): resolve 17 persistent CI test failures (#15084 )	2026-04-24 03:46:46 -07:00
test_reasoning_command.py	test: update stale tests to match current code (#11963 )	2026-04-17 21:35:30 -07:00
test_resume_display.py	fix(cli): strip all reasoning tag variants from /resume recap	2026-04-18 19:19:24 -07:00
test_save_conversation_location.py	fix(sessions): /save lands under $HERMES_HOME, widen browse+TUI picker, force-refresh ollama-cloud on setup (#16296 )	2026-04-26 18:49:48 -07:00
test_session_boundary_hooks.py	fix: add gateway coverage for session boundary hooks, move test to tests/cli/	2026-04-08 04:27:34 -07:00
test_stream_delta_think_tag.py	fix(streaming): prevent <think> in prose from suppressing response output	2026-04-09 22:16:36 -07:00
test_surrogate_sanitization.py	fix(surrogates): sanitize reasoning/reasoning_content/reasoning_details fields (#11628 )	2026-04-17 13:30:47 -07:00
test_tool_progress_scrollback.py	fix(cli): restore stacked tool progress scrollback in TUI (#8201 )	2026-04-11 23:22:34 -07:00
test_worktree.py	fix: aggressive worktree and branch cleanup to prevent accumulation (#6134 )	2026-04-08 04:44:49 -07:00
test_worktree_security.py	refactor(tests): re-architect tests + fix CI failures (#5946 )	2026-04-07 17:19:07 -07:00