hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-29 18:46:59 +00:00

History

Teknium 88643a1ba9 feat: overhaul context length detection with models.dev and provider-aware resolution (#2158 ) Replace the fragile hardcoded context length system with a multi-source resolution chain that correctly identifies context windows per provider. Key changes: - New agent/models_dev.py: Fetches and caches the models.dev registry (3800+ models across 100+ providers with per-provider context windows). In-memory cache (1hr TTL) + disk cache for cold starts. - Rewritten get_model_context_length() resolution chain: 0. Config override (model.context_length) 1. Custom providers per-model context_length 2. Persistent disk cache 3. Endpoint /models (local servers) 4. Anthropic /v1/models API (max_input_tokens, API-key only) 5. OpenRouter live API (existing, unchanged) 6. Nous suffix-match via OpenRouter (dot/dash normalization) 7. models.dev registry lookup (provider-aware) 8. Thin hardcoded defaults (broad family patterns) 9. 128K fallback (was 2M) - Provider-aware context: same model now correctly resolves to different context windows per provider (e.g. claude-opus-4.6: 1M on Anthropic, 128K on GitHub Copilot). Provider name flows through ContextCompressor. - DEFAULT_CONTEXT_LENGTHS shrunk from 80+ entries to ~16 broad patterns. models.dev replaces the per-model hardcoding. - CONTEXT_PROBE_TIERS changed from [2M, 1M, 512K, 200K, 128K, 64K, 32K] to [128K, 64K, 32K, 16K, 8K]. Unknown models no longer start at 2M. - hermes model: prompts for context_length when configuring custom endpoints. Supports shorthand (32k, 128K). Saved to custom_providers per-model config. - custom_providers schema extended with optional models dict for per-model context_length (backward compatible). - Nous Portal: suffix-matches bare IDs (claude-opus-4-6) against OpenRouter's prefixed IDs (anthropic/claude-opus-4.6) with dot/dash normalization. Handles all 15 current Nous models. - Anthropic direct: queries /v1/models for max_input_tokens. Only works with regular API keys (sk-ant-api*), not OAuth tokens. Falls through to models.dev for OAuth users. Tests: 5574 passed (18 new tests for models_dev + updated probe tiers) Docs: Updated configuration.md context length section, AGENTS.md Co-authored-by: Test <test@test.com>		2026-03-20 06:04:33 -07:00
..
__init__.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
anthropic_adapter.py	fix(anthropic): tool_choice 'none' still allowed tool calls	2026-03-17 04:02:49 -07:00
auxiliary_client.py	fix: respect config.yaml model.base_url for Anthropic provider (#1948 ) (#1998 )	2026-03-18 16:51:24 -07:00
context_compressor.py	feat: overhaul context length detection with models.dev and provider-aware resolution (#2158 )	2026-03-20 06:04:33 -07:00
copilot_acp_client.py	feat: integrate GitHub Copilot providers across Hermes	2026-03-17 23:40:22 -07:00
display.py	feat(tools): centralize tool emoji metadata in registry + skin integration	2026-03-15 20:21:21 -07:00
insights.py	fix(security): eliminate SQL string formatting in execute() calls	2026-03-19 15:16:35 +01:00
model_metadata.py	feat: overhaul context length detection with models.dev and provider-aware resolution (#2158 )	2026-03-20 06:04:33 -07:00
models_dev.py	feat: overhaul context length detection with models.dev and provider-aware resolution (#2158 )	2026-03-20 06:04:33 -07:00
prompt_builder.py	fix(cron): remove send_message/clarify from cron agents + autonomous prompt	2026-03-20 05:18:05 -07:00
prompt_caching.py	fix(cache_control) treat empty text like None to avoid anthropic api cache_control error	2026-03-13 18:08:46 -07:00
redact.py	feat: secure skill env setup on load (core #688 )	2026-03-13 03:14:04 -07:00
skill_commands.py	fix: disabled skills respected across banner, system prompt, slash commands, and skill_view (#1897 )	2026-03-18 03:17:37 -07:00
smart_model_routing.py	feat: integrate GitHub Copilot providers across Hermes	2026-03-17 23:40:22 -07:00
title_generator.py	feat: auto-generate session titles after first exchange	2026-03-17 04:14:40 -07:00
trajectory.py	Refactor Terminal and AIAgent cleanup	2026-02-21 22:31:43 -08:00
usage_pricing.py	feat: use endpoint metadata for custom model context and pricing (#1906 )	2026-03-18 03:04:07 -07:00