hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-30 06:41:51 +00:00

History

teknium1 8cf6b3da9d fix(opencode-go): cap mimo-v2.5-pro max_tokens at 131072 The opencode-go relay defaults max_tokens to 262144 when none is sent, but Xiami mimo-v2.5-pro only supports 131072 completion tokens — every request 400s with "max_tokens is too large: 262144" before the agent can do anything. Add a get_max_tokens(model) hook on ProviderProfile (default returns default_max_tokens) so profiles fronting multiple upstreams can vary the cap per-model. Wire chat_completions transport through the hook. Override on OpenCodeGoProfile with mimo-v2.5-pro=131072. Only mimo-v2.5-pro is capped — other opencode-go models (kimi, glm, qwen, minimax, other mimo variants) unchanged.	2026-05-28 20:49:53 -07:00
..
__init__.py	fix(opencode-go): cap mimo-v2.5-pro max_tokens at 131072	2026-05-28 20:49:53 -07:00
plugin.yaml	feat(providers): make all 33 providers pluggable under plugins/model-providers/	2026-05-05 13:40:01 -07:00

teknium1 8cf6b3da9d fix(opencode-go): cap mimo-v2.5-pro max_tokens at 131072

The opencode-go relay defaults max_tokens to 262144 when none is sent,
but Xiami mimo-v2.5-pro only supports 131072 completion tokens — every
request 400s with "max_tokens is too large: 262144" before the agent
can do anything.

Add a get_max_tokens(model) hook on ProviderProfile (default returns
default_max_tokens) so profiles fronting multiple upstreams can vary
the cap per-model. Wire chat_completions transport through the hook.
Override on OpenCodeGoProfile with mimo-v2.5-pro=131072.

Only mimo-v2.5-pro is capped — other opencode-go models (kimi, glm,
qwen, minimax, other mimo variants) unchanged.

2026-05-28 20:49:53 -07:00

__init__.py

fix(opencode-go): cap mimo-v2.5-pro max_tokens at 131072

2026-05-28 20:49:53 -07:00

plugin.yaml

feat(providers): make all 33 providers pluggable under plugins/model-providers/

2026-05-05 13:40:01 -07:00