hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-31 19:16:29 +00:00

History

KUSH42 34d06a9802 fix(compaction): don't halve context_length on output-cap-too-large errors When the API returns "max_tokens too large given prompt" (input tokens are within the context window, but input + requested output > window), the old code incorrectly routed through the same handler as "prompt too long" errors, calling get_next_probe_tier() and permanently halving context_length. This made things worse: the window was fine, only the requested output size needed trimming for that one call. Two distinct error classes now handled separately: Prompt too long — input itself exceeds context window. Fix: compress history + halve context_length (existing behaviour, unchanged). Output cap too large — input OK, but input + max_tokens > window. Fix: parse available_tokens from the error message, set a one-shot _ephemeral_max_output_tokens override for the retry, and leave context_length completely untouched. Changes: - agent/model_metadata.py: add parse_available_output_tokens_from_error() that detects Anthropic's "available_tokens: N" error format and returns the available output budget, or None for all other error types. - run_agent.py: call the new parser first in the is_context_length_error block; if it fires, set _ephemeral_max_output_tokens (with a 64-token safety margin) and break to retry without touching context_length. _build_api_kwargs consumes the ephemeral value exactly once then clears it so subsequent calls use self.max_tokens normally. - agent/anthropic_adapter.py: expand build_anthropic_kwargs docstring to clearly document the max_tokens (output cap) vs context_length (total window) distinction, which is a persistent source of confusion due to the OpenAI-inherited "max_tokens" name. - cli-config.yaml.example: add inline comments explaining both keys side by side where users are most likely to look. - website/docs/integrations/providers.md: add a callout box at the top of "Context Length Detection" and clarify the troubleshooting entry. - tests/test_ctx_halving_fix.py: 24 tests across four classes covering the parser, build_anthropic_kwargs clamping, ephemeral one-shot consumption, and the invariant that context_length is never mutated on output-cap errors.		2026-04-09 11:27:41 -07:00
..
acp	feat(api): structured run events via /v1/runs SSE endpoint	2026-04-05 12:05:13 -07:00
agent	fix: prevent 400 format errors from triggering compression loop on Codex Responses API (#6751 )	2026-04-09 11:11:34 -07:00
cli	fix: handle empty sudo password and false prompts	2026-04-09 02:50:07 -07:00
cron	feat(cron): track delivery failures in job status (#6042 )	2026-04-07 22:49:01 -07:00
e2e	test(e2e): remove section separator comments	2026-04-01 15:23:52 -07:00
environments/benchmarks	fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )	2026-04-07 17:28:37 -07:00
fakes	fix: streaming tool call parsing, error handling, and fake HA state mutation	2026-03-14 14:27:20 +03:00
gateway	fix(discord): accept .log attachments and raise document size limit	2026-04-09 02:26:33 -07:00
hermes_cli	fix: remove /prompt slash command — footgun via prefix expansion (#6752 )	2026-04-09 11:27:27 -07:00
honcho_plugin	fix(honcho): migration guard for observation mode default change	2026-04-05 12:34:11 -07:00
integration	refactor: remove mini-swe-agent dependency — inline Docker/Modal backends (#2804 )	2026-03-24 07:30:25 -07:00
plugins	feat(hindsight): feature parity, setup wizard, and config improvements	2026-04-08 23:54:15 -07:00
run_agent	fix: retry 3 times when model returns truly empty response (#6488 )	2026-04-09 02:06:12 -07:00
skills	fix(skills): archive OpenClaw cron store without config	2026-04-09 03:06:11 -07:00
tools	fix: handle empty sudo password and false prompts	2026-04-09 02:50:07 -07:00
__init__.py	A bit of restructuring for simplicity and organization	2025-10-01 23:29:25 +00:00
conftest.py	fix(approval): show full command in dangerous command approval (#1553 )	2026-03-17 02:02:33 -07:00
run_interrupt_test.py	fix: thread safety for concurrent subagent delegation (#1672 )	2026-03-17 02:53:33 -07:00
test_batch_runner_checkpoint.py	fix: sanitize chat payloads and provider precedence	2026-03-13 23:59:12 -07:00
test_cli_skin_integration.py	fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )	2026-04-07 17:59:42 -07:00
test_ctx_halving_fix.py	fix(compaction): don't halve context_length on output-cap-too-large errors	2026-04-09 11:27:41 -07:00
test_evidence_store.py	feat: add OSS Security Forensics skill (Skills Hub) (#1482 )	2026-03-15 21:59:53 -07:00
test_hermes_logging.py	fix: repair 57 failing CI tests across 14 files (#5823 )	2026-04-07 09:58:45 -07:00
test_hermes_state.py	fix(state): orphan children instead of cascade-deleting in prune/delete (#6513 )	2026-04-09 02:41:56 -07:00
test_honcho_client_config.py	feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )	2026-04-02 15:33:51 -07:00
test_mcp_serve.py	feat: add MCP server mode — hermes mcp serve (#3795 )	2026-03-29 15:47:19 -07:00
test_minisweagent_path.py	chore: remove all remaining mini-swe-agent references	2026-03-24 08:19:23 -07:00
test_model_picker_scroll.py	fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )	2026-04-07 17:59:42 -07:00
test_model_tools.py	Add request-scoped plugin lifecycle hooks	2026-04-05 23:31:29 -07:00
test_model_tools_async_bridge.py	fix: use per-thread persistent event loops in worker threads	2026-03-20 15:41:06 -04:00
test_ollama_num_ctx.py	fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )	2026-04-07 22:23:28 -07:00
test_packaging_metadata.py	chore: prepare Hermes for Homebrew packaging (#4099 )	2026-03-30 17:34:43 -07:00
test_project_metadata.py	fix: exclude matrix from [all] extras — python-olm is upstream-broken (#4615 )	2026-04-02 09:21:37 -07:00
test_retry_utils.py	feat(agent): add jittered retry backoff	2026-04-08 00:41:36 -07:00
test_sql_injection.py	fix(security): eliminate SQL string formatting in execute() calls	2026-03-19 15:16:35 +01:00
test_timezone.py	fix: repair 57 failing CI tests across 14 files (#5823 )	2026-04-07 09:58:45 -07:00
test_toolset_distributions.py	test: add unit tests for 8 modules (batch 2)	2026-02-26 13:54:20 +03:00
test_toolsets.py	fix: add missing Platform.SIGNAL to toolset mappings, update test + config docs	2026-03-09 23:27:19 -07:00
test_trajectory_compressor.py	fix: URL-based auth for third-party Anthropic endpoints + CI test fixes (#4148 )	2026-03-30 20:36:56 -07:00
test_trajectory_compressor_async.py	fix: create AsyncOpenAI lazily in trajectory_compressor to avoid closed event loop (#4013 )	2026-03-30 13:16:16 -07:00
test_utils_truthy_values.py	Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.	2026-03-30 13:28:10 +09:00