hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-24 16:54:43 +00:00

History

islam666 09ec26c66a fix(ollama): set default_max_tokens for custom/Ollama provider The custom/Ollama provider profile had no default_max_tokens, so no max_tokens was sent on requests and Ollama fell back to its internal num_predict=128 — truncating responses after a few tokens with finish_reason='length' (#39281, e.g. gemma4). max_tokens resolution is ephemeral > user model.max_tokens > profile default, so this is only a floor used when the user hasn't set their own cap. Set it to 65536 (matching the qwen-oauth tier) rather than a conservative value, since users can always override per-model. Fixes #39281	2026-06-07 21:50:25 -07:00
..
__init__.py	fix(ollama): set default_max_tokens for custom/Ollama provider	2026-06-07 21:50:25 -07:00
plugin.yaml	feat(providers): make all 33 providers pluggable under plugins/model-providers/	2026-05-05 13:40:01 -07:00

islam666 09ec26c66a fix(ollama): set default_max_tokens for custom/Ollama provider

The custom/Ollama provider profile had no default_max_tokens, so no
max_tokens was sent on requests and Ollama fell back to its internal
num_predict=128 — truncating responses after a few tokens with
finish_reason='length' (#39281, e.g. gemma4).

max_tokens resolution is ephemeral > user model.max_tokens > profile
default, so this is only a floor used when the user hasn't set their own
cap. Set it to 65536 (matching the qwen-oauth tier) rather than a
conservative value, since users can always override per-model.

Fixes #39281

2026-06-07 21:50:25 -07:00

__init__.py

fix(ollama): set default_max_tokens for custom/Ollama provider

2026-06-07 21:50:25 -07:00

plugin.yaml

feat(providers): make all 33 providers pluggable under plugins/model-providers/

2026-05-05 13:40:01 -07:00