mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-13 14:02:16 +00:00

History

helix4u ba9964ff0d fix(custom): pass custom provider extra body Allow custom OpenAI-compatible providers declared under `custom_providers:` to set provider-specific `extra_body` fields and have Hermes merge them into chat-completions requests when the matching custom endpoint is active. This is a manual per-provider override rather than a model-name heuristic. OpenAI-compatible Gemma thinking support is real, but the on-wire payload shape is backend-specific: some servers want top-level `enable_thinking`, while vLLM Gemma and NIM-style endpoints expect `chat_template_kwargs`. A per-provider override is safer than picking one assumed payload. Example config: ```yaml custom_providers: - name: gemma-local base_url: http://localhost:8080/v1 model: google/gemma-4-31b-it extra_body: enable_thinking: true reasoning_effort: high ``` For vLLM Gemma or NIM-style endpoints, use the nested shape those servers expect: ```yaml extra_body: chat_template_kwargs: enable_thinking: true ``` Changes: - `hermes_cli/config.py`: preserve `extra_body` in normalized `custom_providers:` entries and allow it in the validated field set. - `hermes_cli/runtime_provider.py`: propagate custom-provider `extra_body` as `request_overrides.extra_body` for named custom runtime resolution, including credential-pool paths. - `agent/agent_init.py`: at agent init, locate the matching custom-provider entry by `base_url` (+ optional model) and merge its `extra_body` into `AIAgent.request_overrides`, with caller-provided overrides winning on conflicting top-level keys. - `plugins/model-providers/custom/__init__.py`: keep existing CustomProfile behavior (Ollama `num_ctx`, `think=False` when reasoning disabled); user-configured `extra_body` flows through `request_overrides`. - `website/docs/integrations/providers.md`: document the explicit `extra_body` override and the vLLM/Gemma `chat_template_kwargs` variant. - Tests cover config normalization, runtime propagation, model matching, trailing-slash equivalence, fallback when no `model` field is set, and caller-override merging precedence. Verified end-to-end against `CustomProfile` via `ChatCompletionsTransport`: configured `extra_body` reaches `kwargs.extra_body` on the wire request, and coexists with profile-generated entries (Ollama `num_ctx`, `think=False`) without clobber. Salvaged from #29022 onto current `main`. Cosmetic typing edit in `plugins/model-providers/custom/__init__.py` and a stale-base docs revert in `providers.md` were dropped during cherry-pick. Closes #29022		2026-05-21 07:48:53 -07:00
..
docs	fix(custom): pass custom provider extra body	2026-05-21 07:48:53 -07:00
i18n	docs: add Korean Kanban documentation	2026-05-18 21:42:13 -07:00
scripts	fix(docs): unique sidebar keys for duplicate skill categories (#26726 )	2026-05-15 20:29:20 -07:00
src	docs(user-stories): add 116 stories from the Hermes Discord archive (#23436 )	2026-05-10 15:21:40 -07:00
static	rebuild model catalog	2026-05-11 09:54:31 -07:00
.gitignore	docs: publish llms.txt and llms-full.txt for agent-friendly ingestion (#18276 )	2026-04-30 23:17:14 -07:00
docusaurus.config.ts	docs: add Korean Kanban documentation	2026-05-18 21:42:13 -07:00
package-lock.json	chore(deps): bump mermaid from 11.13.0 to 11.15.0 in /website (#24011 )	2026-05-19 09:19:06 -04:00
package.json	feat(profile): shareable profile distributions via git (#20831 )	2026-05-08 10:04:32 -07:00
README.md	docs: replace ASCII diagrams with Mermaid/lists, add linting note	2026-03-21 17:58:30 -07:00
sidebars.ts	chore(skills/baoyu-article-illustrator): tighten description, add platforms, regen docs	2026-05-18 18:28:56 -07:00
tsconfig.json	feat: add documentation website (Docusaurus)	2026-03-05 05:24:55 -08:00

README.md

Website

This website is built using Docusaurus, a modern static website generator.

Installation

yarn

Local Development

yarn start

This command starts a local development server and opens up a browser window. Most changes are reflected live without having to restart the server.

Build

yarn build

This command generates static content into the build directory and can be served using any static contents hosting service.

Deployment

Using SSH:

USE_SSH=true yarn deploy

Not using SSH:

GIT_USER=<Your GitHub username> yarn deploy

If you are using GitHub pages for hosting, this command is a convenient way to build the website and push to the gh-pages branch.

Diagram Linting

CI runs ascii-guard to lint docs for ASCII box diagrams. Use Mermaid (````mermaid`) or plain lists/tables instead of ASCII boxes to avoid CI failures.