hermes-agent/website/docs
Teknium 03563dabac
fix(gateway): raise session-hygiene hard message limit 400 → 5000 (#50194)
The gateway pre-compression hygiene valve force-compressed any session
crossing 400 messages regardless of token usage. On large-context (1M+)
models doing many short, message-dense turns, a healthy session at ~16%
token usage could hit 400 messages and get force-compressed — and the
compression summary's stale Active Task could then bleed into the next
turn.

The valve's actual purpose is to break a death spiral: when API calls
keep disconnecting on an oversized session, no token-usage data arrives,
the token threshold never fires, and the transcript grows unbounded.
It's a count-based floor for that pathological case only. 400 was tuned
for ~200K-context models and is far too low for modern large-context
sessions. Raise the default to 5000 — still well clear of any death
spiral, but no longer firing on legitimate long conversations.

The value remains fully configurable via compression.hygiene_hard_message_limit.
2026-06-21 08:26:19 -07:00
..
developer-guide docs: align contributor test checklist with wrapper 2026-06-20 23:23:47 -07:00
getting-started fix(docs): document curl, xz-utils, and g++ as Linux prerequisites 2026-06-20 23:23:47 -07:00
guides docs: sync documentation with current implementation 2026-06-20 23:23:47 -07:00
integrations docs(providers): clarify vllm qwen reasoning output 2026-06-20 23:23:47 -07:00
reference fix(docs): align slash-command and docker docs 2026-06-20 23:23:47 -07:00
user-guide fix(gateway): raise session-hygiene hard message limit 400 → 5000 (#50194) 2026-06-21 08:26:19 -07:00
index.mdx docs: point desktop download links to site root (deprecate /desktop) (#46795) 2026-06-15 15:02:24 -04:00
user-stories.mdx docs(website): add User Stories and Use Cases collage page (#18282) 2026-04-30 23:56:59 -07:00