mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-05-18 04:41:56 +00:00
docs: update NovitaAI provider positioning (#25532)
This commit is contained in:
parent
0f0e20ef81
commit
ddb8d8fa84
6 changed files with 34 additions and 30 deletions
|
|
@ -14,7 +14,7 @@
|
||||||
|
|
||||||
**The self-improving AI agent built by [Nous Research](https://nousresearch.com).** It's the only agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions. Run it on a $5 VPS, a GPU cluster, or serverless infrastructure that costs nearly nothing when idle. It's not tied to your laptop — talk to it from Telegram while it works on a cloud VM.
|
**The self-improving AI agent built by [Nous Research](https://nousresearch.com).** It's the only agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions. Run it on a $5 VPS, a GPU cluster, or serverless infrastructure that costs nearly nothing when idle. It's not tied to your laptop — talk to it from Telegram while it works on a cloud VM.
|
||||||
|
|
||||||
Use any model you want — [Nous Portal](https://portal.nousresearch.com), [OpenRouter](https://openrouter.ai) (200+ models), [NovitaAI](https://novita.ai) (90+ models, pay-per-use), [NVIDIA NIM](https://build.nvidia.com) (Nemotron), [Xiaomi MiMo](https://platform.xiaomimimo.com), [z.ai/GLM](https://z.ai), [Kimi/Moonshot](https://platform.moonshot.ai), [MiniMax](https://www.minimax.io), [Hugging Face](https://huggingface.co), OpenAI, or your own endpoint. Switch with `hermes model` — no code changes, no lock-in.
|
Use any model you want — [Nous Portal](https://portal.nousresearch.com), [OpenRouter](https://openrouter.ai) (200+ models), [NovitaAI](https://novita.ai) (AI-native cloud for Model API, Agent Sandbox, and GPU Cloud), [NVIDIA NIM](https://build.nvidia.com) (Nemotron), [Xiaomi MiMo](https://platform.xiaomimimo.com), [z.ai/GLM](https://z.ai), [Kimi/Moonshot](https://platform.moonshot.ai), [MiniMax](https://www.minimax.io), [Hugging Face](https://huggingface.co), OpenAI, or your own endpoint. Switch with `hermes model` — no code changes, no lock-in.
|
||||||
|
|
||||||
<table>
|
<table>
|
||||||
<tr><td><b>A real terminal interface</b></td><td>Full TUI with multiline editing, slash-command autocomplete, conversation history, interrupt-and-redirect, and streaming tool output.</td></tr>
|
<tr><td><b>A real terminal interface</b></td><td>Full TUI with multiline editing, slash-command autocomplete, conversation history, interrupt-and-redirect, and streaming tool output.</td></tr>
|
||||||
|
|
|
||||||
|
|
@ -913,7 +913,7 @@ class ProviderEntry(NamedTuple):
|
||||||
CANONICAL_PROVIDERS: list[ProviderEntry] = [
|
CANONICAL_PROVIDERS: list[ProviderEntry] = [
|
||||||
ProviderEntry("nous", "Nous Portal", "Nous Portal (Nous Research subscription)"),
|
ProviderEntry("nous", "Nous Portal", "Nous Portal (Nous Research subscription)"),
|
||||||
ProviderEntry("openrouter", "OpenRouter", "OpenRouter (100+ models, pay-per-use)"),
|
ProviderEntry("openrouter", "OpenRouter", "OpenRouter (100+ models, pay-per-use)"),
|
||||||
ProviderEntry("novita", "NovitaAI", "NovitaAI (90+ models, pay-per-use)"),
|
ProviderEntry("novita", "NovitaAI", "NovitaAI (AI-native cloud: Model API, Agent Sandbox, GPU Cloud)"),
|
||||||
ProviderEntry("lmstudio", "LM Studio", "LM Studio (local desktop app with built-in model server)"),
|
ProviderEntry("lmstudio", "LM Studio", "LM Studio (local desktop app with built-in model server)"),
|
||||||
ProviderEntry("anthropic", "Anthropic", "Anthropic (Claude models — API key or Claude Code)"),
|
ProviderEntry("anthropic", "Anthropic", "Anthropic (Claude models — API key or Claude Code)"),
|
||||||
ProviderEntry("openai-codex", "OpenAI Codex", "OpenAI Codex"),
|
ProviderEntry("openai-codex", "OpenAI Codex", "OpenAI Codex"),
|
||||||
|
|
|
||||||
|
|
@ -8,7 +8,7 @@ novita = ProviderProfile(
|
||||||
name="novita",
|
name="novita",
|
||||||
aliases=("novita-ai", "novitaai"),
|
aliases=("novita-ai", "novitaai"),
|
||||||
display_name="NovitaAI",
|
display_name="NovitaAI",
|
||||||
description="NovitaAI — 90+ models, pay-per-use",
|
description="NovitaAI — AI-native cloud for builders and agents",
|
||||||
signup_url="https://novita.ai/settings/key-management",
|
signup_url="https://novita.ai/settings/key-management",
|
||||||
env_vars=("NOVITA_API_KEY", "NOVITA_BASE_URL"),
|
env_vars=("NOVITA_API_KEY", "NOVITA_BASE_URL"),
|
||||||
base_url="https://api.novita.ai/openai/v1",
|
base_url="https://api.novita.ai/openai/v1",
|
||||||
|
|
|
||||||
|
|
@ -1,5 +1,5 @@
|
||||||
name: novita-provider
|
name: novita-provider
|
||||||
kind: model-provider
|
kind: model-provider
|
||||||
version: 1.0.0
|
version: 1.0.0
|
||||||
description: NovitaAI multi-model aggregator
|
description: NovitaAI AI-native cloud for builders and agents
|
||||||
author: Nous Research
|
author: Nous Research
|
||||||
|
|
|
||||||
|
|
@ -20,7 +20,7 @@ You need at least one way to connect to an LLM. Use `hermes model` to switch pro
|
||||||
| **GitHub Copilot ACP** | `hermes model` (spawns local `copilot --acp --stdio`) |
|
| **GitHub Copilot ACP** | `hermes model` (spawns local `copilot --acp --stdio`) |
|
||||||
| **Anthropic** | `hermes model` (Claude Max + extra usage credits via OAuth; also supports Anthropic API key or manual setup-token — see note below) |
|
| **Anthropic** | `hermes model` (Claude Max + extra usage credits via OAuth; also supports Anthropic API key or manual setup-token — see note below) |
|
||||||
| **OpenRouter** | `OPENROUTER_API_KEY` in `~/.hermes/.env` |
|
| **OpenRouter** | `OPENROUTER_API_KEY` in `~/.hermes/.env` |
|
||||||
| **NovitaAI** | `NOVITA_API_KEY` in `~/.hermes/.env` (provider: `novita`, 90+ models, pay-per-use) |
|
| **NovitaAI** | `NOVITA_API_KEY` in `~/.hermes/.env` (provider: `novita`, 200+ models, Model API, Agent Sandbox, GPU Cloud) |
|
||||||
| **AI Gateway** | `AI_GATEWAY_API_KEY` in `~/.hermes/.env` (provider: `ai-gateway`) |
|
| **AI Gateway** | `AI_GATEWAY_API_KEY` in `~/.hermes/.env` (provider: `ai-gateway`) |
|
||||||
| **z.ai / GLM** | `GLM_API_KEY` in `~/.hermes/.env` (provider: `zai`) |
|
| **z.ai / GLM** | `GLM_API_KEY` in `~/.hermes/.env` (provider: `zai`) |
|
||||||
| **Kimi / Moonshot** | `KIMI_API_KEY` in `~/.hermes/.env` (provider: `kimi-coding`) |
|
| **Kimi / Moonshot** | `KIMI_API_KEY` in `~/.hermes/.env` (provider: `kimi-coding`) |
|
||||||
|
|
@ -268,6 +268,10 @@ model:
|
||||||
These providers have built-in support with dedicated provider IDs. Set the API key and use `--provider` to select:
|
These providers have built-in support with dedicated provider IDs. Set the API key and use `--provider` to select:
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
|
# NovitaAI Model API
|
||||||
|
hermes chat --provider novita --model moonshotai/kimi-k2.5
|
||||||
|
# Requires: NOVITA_API_KEY in ~/.hermes/.env
|
||||||
|
|
||||||
# z.ai / ZhipuAI GLM
|
# z.ai / ZhipuAI GLM
|
||||||
hermes chat --provider zai --model glm-5
|
hermes chat --provider zai --model glm-5
|
||||||
# Requires: GLM_API_KEY in ~/.hermes/.env
|
# Requires: GLM_API_KEY in ~/.hermes/.env
|
||||||
|
|
@ -317,7 +321,7 @@ model:
|
||||||
default: "zai-org/GLM-5.1-FP8"
|
default: "zai-org/GLM-5.1-FP8"
|
||||||
```
|
```
|
||||||
|
|
||||||
Base URLs can be overridden with `GLM_BASE_URL`, `KIMI_BASE_URL`, `MINIMAX_BASE_URL`, `MINIMAX_CN_BASE_URL`, `DASHSCOPE_BASE_URL`, `XIAOMI_BASE_URL`, `GMI_BASE_URL`, or `TOKENHUB_BASE_URL` environment variables.
|
Base URLs can be overridden with `NOVITA_BASE_URL`, `GLM_BASE_URL`, `KIMI_BASE_URL`, `MINIMAX_BASE_URL`, `MINIMAX_CN_BASE_URL`, `DASHSCOPE_BASE_URL`, `XIAOMI_BASE_URL`, `GMI_BASE_URL`, or `TOKENHUB_BASE_URL` environment variables.
|
||||||
|
|
||||||
:::note Z.AI Endpoint Auto-Detection
|
:::note Z.AI Endpoint Auto-Detection
|
||||||
When using the Z.AI / GLM provider, Hermes automatically probes multiple endpoints (global, China, coding variants) to find one that accepts your API key. You don't need to set `GLM_BASE_URL` manually — the working endpoint is detected and cached automatically.
|
When using the Z.AI / GLM provider, Hermes automatically probes multiple endpoints (global, China, coding variants) to find one that accepts your API key. You don't need to set `GLM_BASE_URL` manually — the working endpoint is detected and cached automatically.
|
||||||
|
|
@ -333,6 +337,29 @@ No configuration is needed — caching activates automatically when an xAI endpo
|
||||||
|
|
||||||
xAI also ships a dedicated TTS endpoint (`/v1/tts`). Select **xAI TTS** in `hermes tools` → Voice & TTS, or see the [Voice & TTS](../user-guide/features/tts.md#text-to-speech) page for config.
|
xAI also ships a dedicated TTS endpoint (`/v1/tts`). Select **xAI TTS** in `hermes tools` → Voice & TTS, or see the [Voice & TTS](../user-guide/features/tts.md#text-to-speech) page for config.
|
||||||
|
|
||||||
|
### NovitaAI
|
||||||
|
|
||||||
|
[NovitaAI](https://novita.ai) is the AI-native cloud for builders and agents. Its three product lines are Model API for 200+ models, Agent Sandbox for building and running AI agents, and GPU Cloud for scalable compute, all available from one platform.
|
||||||
|
|
||||||
|
```bash
|
||||||
|
# Use any available model
|
||||||
|
hermes chat --provider novita --model moonshotai/kimi-k2.5
|
||||||
|
# Requires: NOVITA_API_KEY in ~/.hermes/.env
|
||||||
|
|
||||||
|
# Short alias
|
||||||
|
hermes chat --provider novita-ai --model deepseek/deepseek-v3-0324
|
||||||
|
```
|
||||||
|
|
||||||
|
Or set it permanently in `config.yaml`:
|
||||||
|
```yaml
|
||||||
|
model:
|
||||||
|
provider: "novita"
|
||||||
|
default: "moonshotai/kimi-k2.5"
|
||||||
|
base_url: "https://api.novita.ai/openai/v1"
|
||||||
|
```
|
||||||
|
|
||||||
|
Get your API key at [novita.ai/settings/key-management](https://novita.ai/settings/key-management). The base URL can be overridden with `NOVITA_BASE_URL`.
|
||||||
|
|
||||||
### Ollama Cloud — Managed Ollama Models, OAuth + API Key
|
### Ollama Cloud — Managed Ollama Models, OAuth + API Key
|
||||||
|
|
||||||
[Ollama Cloud](https://ollama.com/cloud) hosts the same open-weight catalog as local Ollama but without the GPU requirement. Pick it in `hermes model` as **Ollama Cloud**, paste your API key from [ollama.com/settings/keys](https://ollama.com/settings/keys), and Hermes auto-discovers the available models.
|
[Ollama Cloud](https://ollama.com/cloud) hosts the same open-weight catalog as local Ollama but without the GPU requirement. Pick it in `hermes model` as **Ollama Cloud**, paste your API key from [ollama.com/settings/keys](https://ollama.com/settings/keys), and Hermes auto-discovers the available models.
|
||||||
|
|
@ -547,29 +574,6 @@ You can append routing suffixes to model names: `:fastest` (default), `:cheapest
|
||||||
|
|
||||||
The base URL can be overridden with `HF_BASE_URL`.
|
The base URL can be overridden with `HF_BASE_URL`.
|
||||||
|
|
||||||
### NovitaAI
|
|
||||||
|
|
||||||
[NovitaAI](https://novita.ai) is a 90+ model aggregator with pay-per-use pricing. Access models from DeepSeek, Kimi, MiniMax, GLM, Qwen, and more through a unified OpenAI-compatible API.
|
|
||||||
|
|
||||||
```bash
|
|
||||||
# Use any available model
|
|
||||||
hermes chat --provider novita --model moonshotai/kimi-k2.5
|
|
||||||
# Requires: NOVITA_API_KEY in ~/.hermes/.env
|
|
||||||
|
|
||||||
# Short alias
|
|
||||||
hermes chat --provider novita-ai --model deepseek/deepseek-v3-0324
|
|
||||||
```
|
|
||||||
|
|
||||||
Or set it permanently in `config.yaml`:
|
|
||||||
```yaml
|
|
||||||
model:
|
|
||||||
provider: "novita"
|
|
||||||
default: "moonshotai/kimi-k2.5"
|
|
||||||
base_url: "https://api.novita.ai/openai/v1"
|
|
||||||
```
|
|
||||||
|
|
||||||
Get your API key at [novita.ai/settings/key-management](https://novita.ai/settings/key-management). The base URL can be overridden with `NOVITA_BASE_URL`.
|
|
||||||
|
|
||||||
## Custom & Self-Hosted LLM Providers
|
## Custom & Self-Hosted LLM Providers
|
||||||
|
|
||||||
Hermes Agent works with **any OpenAI-compatible API endpoint**. If a server implements `/v1/chat/completions`, you can point Hermes at it. This means you can use local models, GPU inference servers, multi-provider routers, or any third-party API.
|
Hermes Agent works with **any OpenAI-compatible API endpoint**. If a server implements `/v1/chat/completions`, you can point Hermes at it. This means you can use local models, GPU inference servers, multi-provider routers, or any third-party API.
|
||||||
|
|
|
||||||
|
|
@ -67,7 +67,7 @@ All variables go in `~/.hermes/.env`. You can also set them with `hermes config
|
||||||
| `DASHSCOPE_BASE_URL` | Custom DashScope base URL (default: `https://dashscope-intl.aliyuncs.com/compatible-mode/v1`; use `https://dashscope.aliyuncs.com/compatible-mode/v1` for mainland-China region) |
|
| `DASHSCOPE_BASE_URL` | Custom DashScope base URL (default: `https://dashscope-intl.aliyuncs.com/compatible-mode/v1`; use `https://dashscope.aliyuncs.com/compatible-mode/v1` for mainland-China region) |
|
||||||
| `DEEPSEEK_API_KEY` | DeepSeek API key for direct DeepSeek access ([platform.deepseek.com](https://platform.deepseek.com/api_keys)) |
|
| `DEEPSEEK_API_KEY` | DeepSeek API key for direct DeepSeek access ([platform.deepseek.com](https://platform.deepseek.com/api_keys)) |
|
||||||
| `DEEPSEEK_BASE_URL` | Custom DeepSeek API base URL |
|
| `DEEPSEEK_BASE_URL` | Custom DeepSeek API base URL |
|
||||||
| `NOVITA_API_KEY` | NovitaAI API key — 90+ models, pay-per-use ([novita.ai/settings/key-management](https://novita.ai/settings/key-management)) |
|
| `NOVITA_API_KEY` | NovitaAI API key — AI-native cloud for Model API, Agent Sandbox, and GPU Cloud ([novita.ai/settings/key-management](https://novita.ai/settings/key-management)) |
|
||||||
| `NOVITA_BASE_URL` | Override NovitaAI base URL (default: `https://api.novita.ai/openai/v1`) |
|
| `NOVITA_BASE_URL` | Override NovitaAI base URL (default: `https://api.novita.ai/openai/v1`) |
|
||||||
| `NVIDIA_API_KEY` | NVIDIA NIM API key — Nemotron and open models ([build.nvidia.com](https://build.nvidia.com)) |
|
| `NVIDIA_API_KEY` | NVIDIA NIM API key — Nemotron and open models ([build.nvidia.com](https://build.nvidia.com)) |
|
||||||
| `NVIDIA_BASE_URL` | Override NVIDIA base URL (default: `https://integrate.api.nvidia.com/v1`; set to `http://localhost:8000/v1` for a local NIM endpoint) |
|
| `NVIDIA_BASE_URL` | Override NVIDIA base URL (default: `https://integrate.api.nvidia.com/v1`; set to `http://localhost:8000/v1` for a local NIM endpoint) |
|
||||||
|
|
|
||||||
Loading…
Add table
Add a link
Reference in a new issue