hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-31 19:16:29 +00:00

History

Julien Talbot b577697189 fix(model_metadata): add xAI Grok context length fallbacks xAI /v1/models does not return context_length metadata, so Hermes probes down to the 128k default whenever a user configures a custom provider pointing at https://api.x.ai/v1. This forces every xAI user to manually override model.context_length in config.yaml (2M for Grok 4.20 / 4.1-fast / 4-fast) or lose most of the usable context window. Add DEFAULT_CONTEXT_LENGTHS entries for the Grok family so the fallback lookup returns the correct value via substring matching. Values sourced from models.dev (2026-04) and cross-checked against the xAI /v1/models listing: - grok-4.20-* 2,000,000 (reasoning, non-reasoning, multi-agent) - grok-4-1-fast-* 2,000,000 - grok-4-fast-* 2,000,000 - grok-4 / grok-4-0709 256,000 - grok-code-fast-1 256,000 - grok-3* 131,072 - grok-2 / latest 131,072 - grok-2-vision* 8,192 - grok (catch-all) 131,072 Keys are ordered longest-first so that specific variants match before the catch-all, consistent with the existing Claude/Gemma/MiniMax entries. Add TestDefaultContextLengths.test_grok_models_context_lengths and test_grok_substring_matching to pin the values and verify the full lookup path. All 77 tests in test_model_metadata.py pass.		2026-04-10 03:04:19 -07:00
..
__init__.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
anthropic_adapter.py	feat: add Anthropic Fast Mode support to /fast command (#7037 )	2026-04-10 02:32:15 -07:00
auxiliary_client.py	fix: update Kimi Coding User-Agent to KimiCLI/1.30.0	2026-04-10 02:37:28 -07:00
builtin_memory_provider.py	refactor: add tool_error/tool_result helpers + read_raw_config, migrate 129 callsites	2026-04-07 13:36:38 -07:00
context_compressor.py	fix: insert static fallback when compression summary fails	2026-04-09 14:28:56 -07:00
context_references.py	refactor: replace inline HERMES_HOME re-implementations with get_hermes_home()	2026-04-07 10:40:34 -07:00
copilot_acp_client.py	fix: bridge tool-calls in copilot-acp adapter	2026-04-06 01:47:57 -07:00
credential_pool.py	fix: add auth.json write-back for Codex retry and valid-token early-return paths	2026-04-09 21:48:50 -07:00
display.py	refactor: remove 24 confirmed dead functions — 432 lines of unused code	2026-04-07 11:41:26 -07:00
error_classifier.py	fix: set retryable=False for message-based auth errors in _classify_by_message() (#7027 )	2026-04-10 02:48:45 -07:00
insights.py	fix(insights): show cache tokens in overview so total adds up (#4428 )	2026-04-01 03:06:47 -07:00
memory_manager.py	refactor: add tool_error/tool_result helpers + read_raw_config, migrate 129 callsites	2026-04-07 13:36:38 -07:00
memory_provider.py	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 )	2026-04-07 10:25:31 -07:00
model_metadata.py	fix(model_metadata): add xAI Grok context length fallbacks	2026-04-10 03:04:19 -07:00
models_dev.py	feat(qwen): add Qwen OAuth provider with portal request support	2026-04-08 13:46:30 -07:00
prompt_builder.py	feat(gateway): add BlueBubbles iMessage platform adapter (#6437 )	2026-04-08 23:54:03 -07:00
prompt_caching.py	fix(prompt-caching): skip top-level cache_control on role:tool for OpenRouter	2026-03-21 16:54:43 -07:00
rate_limit_tracker.py	feat: capture provider rate limit headers and show in /usage (#6541 )	2026-04-09 03:43:14 -07:00
redact.py	fix: mem0 API v2 compat, prefetch context fencing, secret redaction (#5423 )	2026-04-05 22:43:33 -07:00
retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
skill_commands.py	feat(skills): add skill config interface + llm-wiki skill (#5635 )	2026-04-06 13:49:13 -07:00
skill_utils.py	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 )	2026-04-07 10:25:31 -07:00
smart_model_routing.py	Merge branch 'main' into rewbs/tool-use-charge-to-subscription	2026-04-02 11:00:35 +11:00
subdirectory_hints.py	fix(agent): catch PermissionError in subdirectory hint discovery	2026-04-09 03:10:30 -07:00
title_generator.py	feat(agent): configurable timeouts for auxiliary LLM calls via config.yaml (#3597 )	2026-03-28 14:35:28 -07:00
trajectory.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
usage_pricing.py	fix: status bar shows 26K instead of 260K for token counts with trailing zeros (#3024 )	2026-03-25 12:45:58 -07:00