hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-24 10:52:21 +00:00

History

Teknium 03563dabac fix(gateway): raise session-hygiene hard message limit 400 → 5000 (#50194 ) The gateway pre-compression hygiene valve force-compressed any session crossing 400 messages regardless of token usage. On large-context (1M+) models doing many short, message-dense turns, a healthy session at ~16% token usage could hit 400 messages and get force-compressed — and the compression summary's stale Active Task could then bleed into the next turn. The valve's actual purpose is to break a death spiral: when API calls keep disconnecting on an oversized session, no token-usage data arrives, the token threshold never fires, and the transcript grows unbounded. It's a count-based floor for that pathological case only. 400 was tuned for ~200K-context models and is far too low for modern large-context sessions. Raise the default to 5000 — still well clear of any death spiral, but no longer firing on legitimate long conversations. The value remains fully configurable via compression.hygiene_hard_message_limit.		2026-06-21 08:26:19 -07:00
..
developer-guide	docs: align contributor test checklist with wrapper	2026-06-20 23:23:47 -07:00
getting-started	fix(docs): document curl, xz-utils, and g++ as Linux prerequisites	2026-06-20 23:23:47 -07:00
guides	docs: sync documentation with current implementation	2026-06-20 23:23:47 -07:00
integrations	docs(providers): clarify vllm qwen reasoning output	2026-06-20 23:23:47 -07:00
reference	fix(docs): align slash-command and docker docs	2026-06-20 23:23:47 -07:00
user-guide	fix(gateway): raise session-hygiene hard message limit 400 → 5000 (#50194 )	2026-06-21 08:26:19 -07:00
index.mdx	docs: point desktop download links to site root (deprecate /desktop) (#46795 )	2026-06-15 15:02:24 -04:00
user-stories.mdx	docs(website): add User Stories and Use Cases collage page (#18282 )	2026-04-30 23:56:59 -07:00