hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-30 06:41:51 +00:00

History

kshitij 1a74795735 feat: add claude-opus-4.8 and claude-opus-4.8-fast (#34003 ) Anthropic released Claude Opus 4.8 on 2026-05-27, available on OpenRouter, Anthropic, Amazon Bedrock, and Claude Platform on AWS: - https://openrouter.ai/anthropic/claude-opus-4.8 - https://openrouter.ai/anthropic/claude-opus-4.8-fast The fast-mode variant is a separate model ID (anthropic/claude-opus-4.8-fast) priced at 2x of the base model — a notable improvement over the 6x premium on older Opus generations (4.6/4.7). It is NOT a `speed: "fast"` request parameter like Opus 4.6; Anthropic's native fast-mode beta still only covers Opus 4.6. Changes: hermes_cli/models.py - Add anthropic/claude-opus-4.8 + anthropic/claude-opus-4.8-fast to the OpenRouter fallback snapshot and the Nous Portal curated list (live catalogs surface them automatically when reachable; the fallback list matters when the manifest fetch fails). - Add claude-opus-4-8 to the Anthropic-native picker list. agent/model_metadata.py - Register claude-opus-4-8 / claude-opus-4.8 in DEFAULT_CONTEXT_LENGTHS with 1M tokens (matches 4.6/4.7). agent/anthropic_adapter.py - Extend _XHIGH_EFFORT_SUBSTRINGS, _ADAPTIVE_THINKING_SUBSTRINGS, and _NO_SAMPLING_PARAMS_SUBSTRINGS with "4-8"/"4.8". 4.8 inherits the Opus 4.7 API contract: adaptive thinking only, xhigh effort level supported, sampling parameters (temperature/top_p/top_k) return 400. - Add claude-opus-4-8 to _ANTHROPIC_OUTPUT_LIMITS (128k max output, same as 4.7). Matches by substring so claude-opus-4-8-fast and date-stamped variants resolve correctly. agent/usage_pricing.py - Add anthropic/claude-opus-4-8: $5/$25 per MTok input/output, $0.50 cache read, $6.25 cache write (same as 4.6/4.7). - Add anthropic/claude-opus-4-8-fast: $10/$50 per MTok (2x), $1.00 cache read, $12.50 cache write. Per OpenRouter, the 2x premium is the only differentiator from regular Opus 4.8. - OpenRouter routes still pull pricing from the live /models API, so no static OpenRouter entry is needed. tests/agent/test_model_metadata.py - Extend the Claude 4.6+ context-length tag list with 4.8/4-8. website/static/api/model-catalog.json - Regenerated via `python scripts/build_model_catalog.py` to pick up the new entries in the OpenRouter and Nous Portal fallback lists. E2E verification (isolated sys.path import against the worktree): - _supports_adaptive_thinking, _supports_xhigh_effort, _forbids_sampling_params all return True for claude-opus-4.8 and claude-opus-4.8-fast. - _supports_fast_mode (the `speed: "fast"` request-parameter gate) stays False for 4.8 — fast mode is a separate model ID on OpenRouter, not a parameter Anthropic accepts on the base model. - DEFAULT_CONTEXT_LENGTHS resolves 1M for both notations. - resolve_billing_route + _lookup_official_docs_pricing resolve the correct $5/$25 (regular) and $10/$50 (fast) pricing for both dot-notation and dash-notation inputs. - 4.7 and 4.6 regression: behavior unchanged. Unit tests: 305 passed across tests/agent/test_usage_pricing.py, test_model_metadata.py, tests/hermes_cli/test_model_catalog.py, test_models.py, test_model_validation.py, test_models_dev_preferred_merge.py.		2026-05-28 10:31:59 -07:00
..
acp	test(acp): drop flaky runtime_calls[-1] tail-position assertion	2026-05-24 23:23:12 -07:00
acp_adapter	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
agent	feat: add claude-opus-4.8 and claude-opus-4.8-fast (#34003 )	2026-05-28 10:31:59 -07:00
cli	test(auth): update entitlement CI expectations	2026-05-28 00:19:31 -07:00
cron	test(ci): harden two flaky tests against CI noise (#33675 )	2026-05-27 23:15:41 -07:00
docker	fix(docker): bake build-time git SHA into the image	2026-05-28 15:14:05 +10:00
e2e	refactor(gateway): migrate Discord adapter to bundled plugin (full Teams parity)	2026-05-22 14:21:41 -07:00
fakes
gateway	fix(discord): skip backfill for auto-created threads and update test fakes	2026-05-28 04:52:02 -07:00
hermes_cli	fix(xai-oauth): accept bare-code manual paste (state=None) (#26923 ) (#33880 )	2026-05-28 05:47:30 -07:00
hermes_state	feat(session_search): single-shape tool with discovery, scroll, browse — no LLM (#27590 )	2026-05-17 23:28:45 -07:00
honcho_plugin	fix(honcho): align peer-card read and write paths	2026-05-27 10:49:33 -07:00
integration	chore(web): remove web_crawl tool + provider crawl plumbing (#33824 )	2026-05-28 04:52:42 -07:00
openviking_plugin
plugins	chore(web): remove web_crawl tool + provider crawl plumbing (#33824 )	2026-05-28 04:52:42 -07:00
providers	feat(openrouter): pass session_id in extra_body for sticky routing	2026-05-28 08:52:19 -07:00
run_agent	fix(agent): fallback immediately on provider content-policy blocks (#33883 )	2026-05-28 07:28:24 -07:00
scripts
skills	fix(skills): add timeout to Google OAuth urlopen calls	2026-05-19 00:11:44 -07:00
stress	docs: align kanban readiness docs and smoke tests	2026-05-18 21:07:03 -07:00
tools	chore(web): remove web_crawl tool + provider crawl plumbing (#33824 )	2026-05-28 04:52:42 -07:00
tui_gateway	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
website
__init__.py
conftest.py	test(dashboard-auth): strip HERMES_DASHBOARD_OAUTH_* env vars in hermetic fixture	2026-05-27 02:12:27 -07:00
run_interrupt_test.py
test_account_usage.py
test_atomic_replace_symlinks.py
test_base_url_hostname.py
test_batch_runner_checkpoint.py
test_bitwarden_secrets.py	perf(cli): cut hermes startup 63% — flip head-to-head vs codex (#31968 )	2026-05-25 03:06:39 -07:00
test_cli_file_drop.py
test_cli_manual_compress.py	fix(tests): catch up six stale tests after compression/aux/kanban changes (#28465 )	2026-05-18 21:43:59 -07:00
test_cli_skin_integration.py
test_ctx_halving_fix.py
test_docker_home_override_scripts.py	fix(docker): align HOME for dashboard and s6 gateway services (#33481 )	2026-05-28 13:42:27 +10:00
test_empty_model_fallback.py
test_env_loader_secret_sources.py	fix(secrets): only apply external secrets once per HERMES_HOME per process (#32271 )	2026-05-25 15:18:55 -07:00
test_evidence_store.py
test_gateway_streaming_nested_config.py
test_get_tool_definitions_cache_isolation.py
test_hermes_bootstrap.py
test_hermes_constants.py	fix(security): guard os.chmod(parent) against / and top-level dirs	2026-05-20 22:56:55 -07:00
test_hermes_home_profile_warning.py
test_hermes_logging.py	fix(tests): catch up 25 stale tests after recent merges (#28626 )	2026-05-19 01:28:32 -07:00
test_hermes_state.py	fix(kanban): skip redundant WAL pragma on already-WAL connections	2026-05-27 14:31:55 -07:00
test_hermes_state_wal_fallback.py	fix(kanban): skip redundant WAL pragma on already-WAL connections	2026-05-27 14:31:55 -07:00
test_honcho_client_config.py
test_honcho_session_context.py	fix(honcho): align user context peer perspective	2026-05-27 10:49:33 -07:00
test_install_sh_browser_install.py
test_install_sh_pythonpath_sanitization.py
test_install_sh_root_fhs_uv_python_path.py	test(install): harden uv-python-path regression test against future drift	2026-05-27 13:55:51 -07:00
test_install_sh_setup_wizard_tty_probe.py
test_install_sh_symlink_stomp.py
test_install_sh_termux_network_prereqs.py
test_ipv4_preference.py
test_lazy_session_regressions.py
test_lint_config.py
test_live_system_guard_self_test.py	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
test_mcp_serve.py
test_mini_swe_runner.py
test_minimax_model_validation.py
test_minimax_oauth.py	fix(minimax-oauth): refresh short-lived access tokens per request (#30619 )	2026-05-22 15:16:15 -07:00
test_minisweagent_path.py
test_model_picker_scroll.py
test_model_tools.py
test_model_tools_async_bridge.py
test_ollama_num_ctx.py
test_package_json_lazy_deps.py	fix(update): make Camofox lazy-installed instead of eager (#27055 )	2026-05-16 12:15:45 -07:00
test_packaging_metadata.py
test_plugin_skills.py
test_process_loop_event_loop_warning.py
test_project_metadata.py	remove Vercel AI Gateway and Vercel Sandbox (#33067 )	2026-05-27 00:43:32 -07:00
test_retry_utils.py
test_run_tests_parallel.py	test: use subprocesses for each test file (#29016 )	2026-05-21 16:40:04 +05:30
test_sanitize_tool_error.py	security: sanitize tool error strings before injecting into model context (#26823 )	2026-05-16 00:57:39 -07:00
test_sql_injection.py
test_subprocess_home_isolation.py	fix: avoid process-wide cron profile home mutation	2026-05-18 17:39:50 +00:00
test_termux_all_extra_compat.py
test_timezone.py	chore: ruff auto-fix PLR6201 resweep — tuple → set in membership tests (#27355 )	2026-05-17 02:29:41 -07:00
test_toolset_distributions.py
test_toolsets.py
test_trajectory_compressor.py
test_trajectory_compressor_async.py
test_transform_llm_output_hook.py
test_transform_tool_result_hook.py
test_tui_gateway_server.py	feat: add TUI session orchestrator	2026-05-26 20:51:59 -07:00
test_utils_truthy_values.py
test_yuanbao_integration.py
test_yuanbao_markdown.py
test_yuanbao_pipeline.py
test_yuanbao_proto.py