mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-06-27 11:22:03 +00:00
Map Hermes xhigh→max to unlock DeepSeek V4's 'Max thinking' tier through Ollama Cloud's OpenAI-compatible /v1/chat/completions endpoint. low/medium/high pass through unchanged; disabled/none suppress reasoning entirely. Empirically confirmed: reasoning_effort:max produces ~2.5× more thinking tokens than high on deepseek-v4-pro:cloud (1576 vs 642). |
||
|---|---|---|
| .. | ||
| test_deepseek_profile.py | ||
| test_kimi_profile.py | ||
| test_minimax_profile.py | ||
| test_ollama_cloud_profile.py | ||
| test_opencode_go_profile.py | ||