mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-27 11:22:03 +00:00

No description

Find a file

DavidMetcalfe 865a09a610 fix(agent): detect thinking-timeout for reasoning models and surface actionable guidance instead of misleading file-write advice Two-part fix: Part 1 (classifier override at agent/error_classifier.py:720-738): A transport disconnect on a reasoning model — even on a large session — now routes to FailoverReason.timeout instead of context_overflow. Without this, large-session reasoning-model disconnects route to the compression branch and silently delete conversation history on a phantom context-length error. The override is strictly targeted: non-reasoning models (gpt-4o, claude-3-5-sonnet, llama-3.3-70b, etc.) still route to context_overflow on large sessions — the existing intentional behavior for chat models whose proxy doesn't idle-kill during prefill/generation. Part 2 (new agent/thinking_timeout_guidance.py + integration at agent/conversation_loop.py:3488-3567): New is_thinking_timeout() and build_thinking_timeout_guidance() helpers. When a known reasoning model (NVIDIA Nemotron 3 Ultra, OpenAI o1/o3, Anthropic Opus 4.x thinking, DeepSeek R1, Qwen QwQ, xAI Grok reasoning) hits a transport-kill on a small session (classifier says timeout directly) or after Part 1 routes correctly (large session), the user now sees reasoning-specific guidance with three actionable workarounds in priority order: 1. Set providers.<provider>.models.<model>.stale_timeout_seconds: 900 in ~/.hermes/config.yaml (Hermes's built-in floor is already 600s for known reasoning models; raise further if upstream is even tighter). 2. Lower reasoning_budget or set reasoning_effort: medium on this model if the provider supports it. 3. Use a smaller / faster reasoning model if the task doesn't require deep thinking. The new guidance takes precedence via if/elif over the existing _is_stream_drop block, so a reasoning-model user with a transport-kill message sees actionable advice instead of the misleading "try execute_code with Python's open() for large files" advice (which is correct for the unrelated large-file-write stream-drop case but actively wrong for the thinking-timeout case). Verified: - 478 tests passing across 9 directly-relevant files (49 new + 429 existing, zero regressions). - Ruff lint clean on all 4 modified/new files. - Negative test: 6 parametrized regression guards confirm non-reasoning models still route to context_overflow on large sessions; 4 parametrized gates confirm non-timeout classifier reasons never trigger the guidance; 5 parametrized cases confirm non-transport messages never trigger it. - Regression guard: new guidance message does NOT contain "execute_code" or "open()" — the misleading advice is fully replaced, not appended alongside. - Cross-vendor dual review via agy -p: - Gemini 3.5 Flash (Medium) — passed: true, zero blockers, one SHOULD-FIX (vprint block duplication — fixed by extracting detection into a helper module). - GPT-OSS 120B (Medium) — passed: true, zero blockers, two nits (test placement — adopted at tests/agent/test_thinking_timeout_guidance.py; primary-model capture — accepted as non-issue per Flash's nit). Dependency note for maintainers: This PR includes agent/reasoning_timeouts.py (the reasoning-model allowlist module from PR #52238) because the Layer 1 override is load-bearing on get_reasoning_stale_timeout_floor(). After PR #52238 lands on main, this PR's duplicate agent/reasoning_timeouts.py should be rebased away. Either PR can land first; the other rebase is mechanical. Fixes #52271.		2026-06-25 19:00:48 -07:00
.github	fix(ci): run CI on all PRs to anywhere	2026-06-25 09:15:20 -07:00
.plans	Merge PR #724 : feat: --yolo flag to bypass all approval prompts	2026-03-10 20:56:30 -07:00
acp_adapter	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
acp_registry	chore: release v0.17.0 (2026.6.19)	2026-06-19 12:38:31 -07:00
agent	fix(agent): detect thinking-timeout for reasoning models and surface actionable guidance instead of misleading file-write advice	2026-06-25 19:00:48 -07:00
apps	Merge pull request #52772 from NousResearch/bb/editor	2026-06-25 20:25:06 -05:00
assets	Update banner image to new version	2026-02-25 11:53:44 -08:00
cron	fix(cron): add default retention to per-run job output (#52383 ) (#52646 )	2026-06-25 16:00:13 -07:00
datagen-config-examples	feat: add WebResearchEnv RL environment for multi-step web research	2026-03-05 14:34:36 +00:00
docker	fix(soul): installers seed the real default persona, upgrade legacy empty templates (#52246 )	2026-06-24 18:56:26 -07:00
docs	docs(chronos): pin hop-1 auth to the hosted-agent bootstrap token	2026-06-24 20:57:43 +10:00
gateway	fix(gateway): defer cross-process cache cleanup off the cache lock (#52197 ) (#52761 )	2026-06-25 18:58:47 -07:00
hermes_cli	fix(cron): detect partial job loss in restore_cron_jobs_if_emptied (#52144 )	2026-06-25 18:49:18 -07:00
locales	feat(i18n): add complete Spanish translation	2026-06-20 23:23:47 -07:00
nix	feat(desktop): wire project settings and shell chrome	2026-06-25 16:40:27 -05:00
optional-mcps	feat(mcp-catalog): add official Unreal Engine 5.8 MCP server	2026-06-18 09:16:40 -07:00
optional-skills	feat(skills): add cloudflare-temporary-deploy optional skill (#50849 )	2026-06-22 12:14:30 -07:00
packaging/homebrew	chore: prepare Hermes for Homebrew packaging (#4099 )	2026-03-30 17:34:43 -07:00
plugins	fix(telegram): heartbeat loop exits cleanly when bot has no get_me	2026-06-25 18:50:11 -07:00
providers	fix(models): pass model.base_url to fetch_models in /model picker	2026-06-16 13:09:40 -07:00
scripts	chore(release): map agt-user noreply email for #48496 salvage	2026-06-25 18:50:11 -07:00
skills	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
tests	fix(agent): detect thinking-timeout for reasoning models and surface actionable guidance instead of misleading file-write advice	2026-06-25 19:00:48 -07:00
tools	fix(approval): fold Windows absolute home paths in dangerous-command detection	2026-06-25 17:49:39 -07:00
tui_gateway	fix(desktop): resume latest compression continuation	2026-06-25 16:29:09 -07:00
ui-tui	feat(tui): add width-budgeted "resumes when subagent finishes" status segment	2026-06-25 19:57:58 -05:00
web	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
website	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
.dockerignore	fix(docker): support WebUI installs from read-only sources (#48541 )	2026-06-19 10:52:16 +10:00
.env.example	docs(.env.example): add HF_BASE_URL placeholder	2026-06-20 23:23:47 -07:00
.envrc	fix(node/nix): consolidate workspace lockfile + update all consumers	2026-06-02 20:28:18 -04:00
.gitattributes	chore: enforce LF line endings for container entrypoints (#12181 )	2026-06-05 09:54:01 +10:00
.gitignore	fix(docker): supervised gateway uses --replace to take over stale holder (NS-505) (#47555 )	2026-06-18 10:49:02 +10:00
.hadolint.yaml	feat(docker): remove gosu from bundled image; s6-setuidgid handles privilege drop	2026-05-24 18:05:33 -07:00
.mailmap	chore: add MestreY0d4-Uninter to AUTHOR_MAP and .mailmap	2026-04-15 15:03:28 -07:00
AGENTS.md	docs(agents): fix stale platform adapter path in token-lock note	2026-06-21 19:59:50 -07:00
batch_runner.py	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
cli-config.yaml.example	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
cli.py	feat(cli): note background delegate_task dispatch in _on_tool_complete	2026-06-25 19:57:58 -05:00
constraints-termux.txt	feat: add tested Termux install path and EOF-aware gh auth	2026-04-09 16:24:53 -07:00
CONTRIBUTING.es.md	feat(i18n): add complete Spanish translation	2026-06-20 23:23:47 -07:00
CONTRIBUTING.md	docs: add missing Prerequisites/How to Run sections to SKILL.md template	2026-06-20 23:23:47 -07:00
docker-compose.windows.yml	feat(docker): add Windows Docker Desktop compatible compose file	2026-05-23 21:52:34 +05:30
docker-compose.yml	docs(compose): update entrypoint comment for s6-overlay	2026-05-24 18:05:33 -07:00
Dockerfile	fix(docker): redirect lazy installs to a durable target so opt-in backends work in the immutable image (#51136 )	2026-06-25 09:20:13 +10:00
flake.lock	fix nix build	2026-04-11 15:30:37 -04:00
flake.nix	feat(nix): declarative plugin installation for NixOS module (#15953 )	2026-04-28 00:18:32 +05:30
hermes	fix: use argparse entrypoint in top-level launcher (#3874 )	2026-03-29 21:54:36 -07:00
hermes-already-has-routines.md	docs: stop recommending pip install; curl installer is the only supported path (#51743 )	2026-06-24 00:14:32 -07:00
hermes_bootstrap.py	fix(docker): redirect lazy installs to a durable target so opt-in backends work in the immutable image (#51136 )	2026-06-25 09:20:13 +10:00
hermes_constants.py	fix(browser): validate agent-browser is runnable, not just present (#51740 )	2026-06-24 00:14:49 -07:00
hermes_logging.py	refactor(gateway): migrate slack/dingtalk/whatsapp/matrix/feishu/telegram/wecom/email/sms adapters to bundled plugins	2026-06-20 10:26:45 -07:00
hermes_state.py	fix(state): exclude delegate/branch/tool children from resume walk + reconcile salvaged fixes	2026-06-25 16:29:09 -07:00
hermes_time.py	fix(managed-scope): honor managed scope in all standalone config loaders	2026-06-19 07:46:33 -07:00
LICENSE	fix: restore missing MIT license file	2026-03-07 13:43:08 -08:00
MANIFEST.in	fix(packaging): ship optional-mcps catalog in wheel and sdist (#39859 )	2026-06-09 14:03:20 -04:00
mcp_serve.py	docs(sessions): clarify sessions.json is the gateway routing index, not the session list (#51726 )	2026-06-23 23:56:36 -07:00
mini_swe_runner.py	fix(swe-runner): move logging.basicConfig out of Runner __init__ into main	2026-06-21 19:02:06 -07:00
model_tools.py	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
package-lock.json	feat(desktop): in-app spot editor for the file preview pane	2026-06-25 19:50:25 -05:00
package.json	fix(desktop): pin Electron below the broken native extract-zip install (#47792 )	2026-06-17 14:42:30 -04:00
pyproject.toml	chore: release v0.17.0 (2026.6.19)	2026-06-19 12:38:31 -07:00
README.es.md	feat(i18n): add complete Spanish translation	2026-06-20 23:23:47 -07:00
README.md	feat(i18n): add complete Spanish translation	2026-06-20 23:23:47 -07:00
README.ur-pk.md	docs: add Urdu translation of README (#40578 )	2026-06-08 06:15:27 +05:30
README.zh-CN.md	docs(README.zh-CN): update Windows install from 'not supported' to native PowerShell	2026-06-20 20:42:49 -07:00
run_agent.py	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
SECURITY.es.md	feat(i18n): add complete Spanish translation	2026-06-20 23:23:47 -07:00
SECURITY.md	docs(security): enumerate cron job scripts in §2.3 credential scoping	2026-06-20 00:30:42 +05:30
setup-hermes.sh	remove Vercel AI Gateway and Vercel Sandbox (#33067 )	2026-05-27 00:43:32 -07:00
setup.py	fix(docker): support WebUI installs from read-only sources (#48541 )	2026-06-19 10:52:16 +10:00
toolset_distributions.py	feat(moa): expose MoA presets as selectable virtual models (#46081 )	2026-06-25 13:52:06 -07:00
toolsets.py	feat(tools): add project workspace tools	2026-06-25 16:40:27 -05:00
trajectory_compressor.py	fix(compressor): remove logging.basicConfig from library class __init__	2026-06-21 19:02:06 -07:00
utils.py	fix(utils): unify YAML list indent across all config writers (#31999 )	2026-06-25 23:27:44 +05:30
uv.lock	chore: release v0.17.0 (2026.6.19)	2026-06-19 12:38:31 -07:00

README.md

Hermes Agent ☤

Hermes Agent | Hermes Desktop

The self-improving AI agent built by Nous Research. It's the only agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions. Run it on a $5 VPS, a GPU cluster, or serverless infrastructure that costs nearly nothing when idle. It's not tied to your laptop — talk to it from Telegram while it works on a cloud VM.

Use any model you want — Nous Portal, OpenRouter (200+ models), NovitaAI (AI-native cloud for Model API, Agent Sandbox, and GPU Cloud), NVIDIA NIM (Nemotron), Xiaomi MiMo, z.ai/GLM, Kimi/Moonshot, MiniMax, Hugging Face, OpenAI, or your own endpoint. Switch with hermes model — no code changes, no lock-in.

A real terminal interface	Full TUI with multiline editing, slash-command autocomplete, conversation history, interrupt-and-redirect, and streaming tool output.
Lives where you do	Telegram, Discord, Slack, WhatsApp, Signal, and CLI — all from a single gateway process. Voice memo transcription, cross-platform conversation continuity.
A closed learning loop	Agent-curated memory with periodic nudges. Autonomous skill creation after complex tasks. Skills self-improve during use. FTS5 session search with LLM summarization for cross-session recall. Honcho dialectic user modeling. Compatible with the agentskills.io open standard.
Scheduled automations	Built-in cron scheduler with delivery to any platform. Daily reports, nightly backups, weekly audits — all in natural language, running unattended.
Delegates and parallelizes	Spawn isolated subagents for parallel workstreams. Write Python scripts that call tools via RPC, collapsing multi-step pipelines into zero-context-cost turns.
Runs anywhere, not just your laptop	Six terminal backends — local, Docker, SSH, Singularity, Modal, and Daytona. Daytona and Modal offer serverless persistence — your agent's environment hibernates when idle and wakes on demand, costing nearly nothing between sessions. Run it on a $5 VPS or a GPU cluster.
Research-ready	Batch trajectory generation, trajectory compression for training the next generation of tool-calling models.

Quick Install

Linux, macOS, WSL2, Termux

curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash

Windows (native, PowerShell)

Heads up: Native Windows runs Hermes without WSL — CLI, gateway, TUI, and tools all work natively. If you'd rather use WSL2, the Linux/macOS one-liner above works there too. Found a bug? Please file issues.

Run this in PowerShell:

iex (irm https://hermes-agent.nousresearch.com/install.ps1)

The installer handles everything: uv, Python 3.11, Node.js, ripgrep, ffmpeg, and a portable Git Bash (MinGit, unpacked to %LOCALAPPDATA%\hermes\git — no admin required, completely isolated from any system Git install). Hermes uses this bundled Git Bash to run shell commands.

If you already have Git installed, the installer detects it and uses that instead. Otherwise a ~45MB MinGit download is all you need — it won't touch or interfere with any system Git.

Android / Termux: The tested manual path is documented in the Termux guide. On Termux, Hermes installs a curated .[termux] extra because the full .[all] extra currently pulls Android-incompatible voice dependencies.

Windows: Native Windows is fully supported — the PowerShell one-liner above installs everything. If you'd rather use WSL2, the Linux command works there too. Native Windows install lives under %LOCALAPPDATA%\hermes; WSL2 installs under ~/.hermes as on Linux.

After installation:

source ~/.bashrc    # reload shell (or: source ~/.zshrc)
hermes              # start chatting!

Troubleshooting

Windows Defender or antivirus flags `uv.exe` as malware

If your antivirus (Bitdefender, Windows Defender, etc.) quarantines uv.exe from the Hermes bin folder (%LOCALAPPDATA%\hermes\bin\uv.exe), this is a false positive. The file is Astral's uv — the Rust Python package manager Hermes bundles to manage its Python environment. ML-based antivirus engines commonly flag unsigned Rust binaries that download and install packages.

To verify your copy is authentic:

# Install GitHub CLI if needed
winget install --id GitHub.cli

# Login to GitHub
gh auth login

# Run verification
$uv = "$env:LOCALAPPDATA\hermes\bin\uv.exe"
$ver = (& $uv --version).Split(' ')[1]
[Net.ServicePointManager]::SecurityProtocol = [Net.SecurityProtocolType]::Tls12
$zip = "$env:TEMP\uv.zip"
Invoke-WebRequest "https://github.com/astral-sh/uv/releases/download/$ver/uv-x86_64-pc-windows-msvc.zip" -OutFile $zip -UseBasicParsing
gh attestation verify $zip --repo astral-sh/uv
Expand-Archive $zip "$env:TEMP\uv_x" -Force
(Get-FileHash "$env:TEMP\uv_x\uv.exe").Hash -eq (Get-FileHash $uv).Hash

If attestation says "Verification succeeded" and the last line prints True, you're good.

To whitelist Hermes:

Windows Defender: Run PowerShell as Admin → Add-MpPreference -ExclusionPath "$env:LOCALAPPDATA\hermes\bin"
Bitdefender: Add an exception in the Bitdefender console (Protection > Antivirus > Settings > Manage Exceptions)
Whitelist the folder, not the file hash — Hermes updates uv and the hash changes every version

For more context, see the upstream Astral reports: astral-sh/uv#13553, astral-sh/uv#15011, astral-sh/uv#10079.

Getting Started

hermes              # Interactive CLI — start a conversation
hermes model        # Choose your LLM provider and model
hermes tools        # Configure which tools are enabled
hermes config set   # Set individual config values
hermes gateway      # Start the messaging gateway (Telegram, Discord, etc.)
hermes setup        # Run the full setup wizard (configures everything at once)
hermes claw migrate # Migrate from OpenClaw (if coming from OpenClaw)
hermes update       # Update to the latest version
hermes doctor       # Diagnose any issues

📖 Full documentation →

Skip the API-key collection — Nous Portal

Hermes works with whatever provider you want — that's not changing. But if you'd rather not collect five separate API keys for the model, web search, image generation, TTS, and a cloud browser, Nous Portal covers all of them under one subscription:

300+ models — pick any of them with /model <name>
Tool Gateway — web search (Firecrawl), image generation (FAL), text-to-speech (OpenAI), cloud browser (Browser Use), all routed through your sub. No extra accounts.

One command from a fresh install:

hermes setup --portal

That logs you in via OAuth, sets Nous as your provider, and turns on the Tool Gateway. Check what's wired up any time with hermes portal info. Full details on the Tool Gateway docs page.

You can still bring your own keys per-tool whenever you want — the gateway is per-backend, not all-or-nothing.

CLI vs Messaging Quick Reference

Hermes has two entry points: start the terminal UI with hermes, or run the gateway and talk to it from Telegram, Discord, Slack, WhatsApp, Signal, or Email. Once you're in a conversation, many slash commands are shared across both interfaces.

Action	CLI	Messaging platforms
Start chatting	`hermes`	Run `hermes gateway setup` + `hermes gateway start`, then send the bot a message
Start fresh conversation	`/new` or `/reset`	`/new` or `/reset`
Change model	`/model [provider:model]`	`/model [provider:model]`
Set a personality	`/personality [name]`	`/personality [name]`
Retry or undo the last turn	`/retry`, `/undo`	`/retry`, `/undo`
Compress context / check usage	`/compress`, `/usage`, `/insights [--days N]`	`/compress`, `/usage`, `/insights [days]`
Browse skills	`/skills` or `/<skill-name>`	`/<skill-name>`
Interrupt current work	`Ctrl+C` or send a new message	`/stop` or send a new message
Platform-specific status	`/platforms`	`/status`, `/sethome`

For the full command lists, see the CLI guide and the Messaging Gateway guide.

Documentation

All documentation lives at hermes-agent.nousresearch.com/docs:

Section	What's Covered
Quickstart	Install → setup → first conversation in 2 minutes
CLI Usage	Commands, keybindings, personalities, sessions
Configuration	Config file, providers, models, all options
Messaging Gateway	Telegram, Discord, Slack, WhatsApp, Signal, Home Assistant
Security	Command approval, DM pairing, container isolation
Tools & Toolsets	40+ tools, toolset system, terminal backends
Skills System	Procedural memory, Skills Hub, creating skills
Memory	Persistent memory, user profiles, best practices
MCP Integration	Connect any MCP server for extended capabilities
Cron Scheduling	Scheduled tasks with platform delivery
Context Files	Project context that shapes every conversation
Architecture	Project structure, agent loop, key classes
Contributing	Development setup, PR process, code style
CLI Reference	All commands and flags
Environment Variables	Complete env var reference

Migrating from OpenClaw

If you're coming from OpenClaw, Hermes can automatically import your settings, memories, skills, and API keys.

During first-time setup: The setup wizard (hermes setup) automatically detects ~/.openclaw and offers to migrate before configuration begins.

Anytime after install:

hermes claw migrate              # Interactive migration (full preset)
hermes claw migrate --dry-run    # Preview what would be migrated
hermes claw migrate --preset user-data   # Migrate without secrets
hermes claw migrate --overwrite  # Overwrite existing conflicts

What gets imported:

SOUL.md — persona file
Memories — MEMORY.md and USER.md entries
Skills — user-created skills → ~/.hermes/skills/openclaw-imports/
Command allowlist — approval patterns
Messaging settings — platform configs, allowed users, working directory
API keys — allowlisted secrets (Telegram, OpenRouter, OpenAI, Anthropic, ElevenLabs)
TTS assets — workspace audio files
Workspace instructions — AGENTS.md (with --workspace-target)

See hermes claw migrate --help for all options, or use the openclaw-migration skill for an interactive agent-guided migration with dry-run previews.

Contributing

We welcome contributions! See the Contributing Guide for development setup, code style, and PR process.

Quick start for contributors — use the standard installer, then work from the full git checkout it creates at $HERMES_HOME/hermes-agent (usually ~/.hermes/hermes-agent). This matches the layout used by hermes update, the managed venv, lazy dependencies, gateway, and docs tooling.

curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
cd "${HERMES_HOME:-$HOME/.hermes}/hermes-agent"
uv pip install -e ".[all,dev]"
scripts/run_tests.sh

Manual clone fallback (for throwaway clones/CI where you intentionally do not want the managed install layout):

curl -LsSf https://astral.sh/uv/install.sh | sh
uv venv .venv --python 3.11
source .venv/bin/activate
uv pip install -e ".[all,dev]"
scripts/run_tests.sh

Community

💬 Discord
📚 Skills Hub
🐛 Issues
🔌 computer-use-linux — Linux desktop-control MCP server for Hermes and other MCP hosts, with AT-SPI accessibility trees, Wayland/X11 input, screenshots, and compositor window targeting.
🔌 HermesClaw — Community WeChat bridge: Run Hermes Agent and OpenClaw on the same WeChat account.

License

MIT — see LICENSE.

Built by Nous Research.