mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-06-21 10:22:18 +00:00

No description

Find a file

Siddharth Balyan 9362ce2575 feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams (#48899 ) * feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams Adds a unified `html-artifact` creative skill that produces self-contained, single-file HTML artifacts — concept explainers, implementation plans, status/incident reports, code-review walkthroughs, technical + educational SVG diagrams, multi-variant design comparisons, and throwaway editors that export their state back to the clipboard. Grounded in Anthropic's html-effectiveness gallery (MIT); the house style (token block, serif/sans/ mono split, hand-rolled diffs, inline-SVG diagrams, graceful degradation) is distilled from reading all 20 reference files. Supersedes and removes three overlapping skills, folding their unique value in: - sketch -> the fidelity dial (throwaway vs presentation) + the multi-variant comparison layouts + the browser-vision verify loop (references/fidelity-and-verify.md) - architecture-diagram-> the dark "infra" token variant + double-rect masking + semantic component palette (references/dark-tech.md, templates/diagram.html infra mode) - concept-diagrams -> the 9-ramp educational color system + the concept archetype library (references/concept-archetypes.md, the light design system in templates/diagram.html) Structure: - SKILL.md (description exactly 60 chars), 6 references, 3 templates - templates verified by headless-Chrome render + vision inspection - editor export logic (file://-safe clipboard, Promise-normalized) verified in node Cross-references updated in claude-design (new disambiguation table row drawing the design-taste vs information-artifact boundary), design-md, pretext, spike, and kanban-video-orchestrator. Website skill docs + catalogs regenerated; stale EN/zh-Hans per-skill pages pruned and i18n cross-refs fixed. Not folded (intentionally orthogonal): excalidraw (.excalidraw JSON), p5js (generative canvas), claude-design / popular-web-designs / design-md (visual design taste / brand vocab / token spec). * feat(skills): ship html-effectiveness gallery as fetched reference examples Add scripts/fetch-examples.sh (idempotent clone/pull of Anthropic's MIT html-effectiveness gallery) + references/examples.md mapping each of the 20 example files to a mode so the agent reads the right worked example. The clone lands in references/examples/ and is gitignored (it's a 384KB upstream repo, not vendored). SKILL.md workflow + reference list now point at it; falls back to the distilled pattern references when offline. * feat(skills): make reading a gallery example a required authoring step Reading the matching html-effectiveness example is now workflow step 2 (was an optional aside in step 3): fetch the gallery, read_file the file for your mode, mirror its structure. Models skip optional steps; the examples are the ground truth, so consulting one is mandatory. Added an 'Example' column to the mode->build quick-reference table and a 'don't skip the example' pitfall. Also dogfooded the skill: read 03-code-review-pr.html and 13-flowchart-diagram.html raw and reconciled the distilled references against source — aligned diff-row tint opacity to the source's 0.15 (was 0.18) and added the .ctx/.hunk rows in house-style.md + base.html so they match 03-code-review-pr.html verbatim. * docs(skills): explain the consolidation + bundled-vs-optional rationale The supersession note only stated what was folded, not why the prune is sound. Expand SKILL.md's intro into a 'Why this skill exists' section: the three former skills emitted the same artifact and overlapped, so consolidating removes which-one-do-I-load ambiguity; and the optional->bundled promotion of concept-diagrams is footprint-safe because this skill has zero deps (only cost is the 60-char description; everything else is progressive-disclosure). States the bundling dividing line explicitly: zero install cost + broadly useful gets bundled, real install cost (hyperframes: Node+FFmpeg+Chromium) stays optional. Regenerated website per-skill page to match.		2026-06-19 08:02:31 +00:00
.github	feat(billing): /billing terminal billing — interactive TUI + CLI client (#45449 )	2026-06-19 01:53:32 +05:30
.plans	Merge PR #724 : feat: --yolo flag to bypass all approval prompts	2026-03-10 20:56:30 -07:00
acp_adapter	fix(acp): preserve memory provider tools	2026-06-13 04:51:44 -07:00
acp_registry	chore: release v0.16.0 (2026.6.5) (#40206 )	2026-06-05 17:55:43 -07:00
agent	feat(image-gen): add image-to-image / editing to image_generate (#48705 )	2026-06-18 22:13:07 -07:00
apps	feat(model-picker): add Refresh Models control to bust stale model cache (#48691 )	2026-06-18 21:37:41 -07:00
assets	Update banner image to new version	2026-02-25 11:53:44 -08:00
cron	fix: complete cron jobs lock salvage	2026-06-15 06:29:00 -07:00
datagen-config-examples	feat: add WebResearchEnv RL environment for multi-step web research	2026-03-05 14:34:36 +00:00
docker	fix(update): scope install-method stamp to the code tree, not $HERMES_HOME (#48188 )	2026-06-18 14:14:41 +10:00
docs	feat(relay): WS-only inbound on the gateway adapter (Phase 3) (#48294 )	2026-06-19 09:33:15 +10:00
gateway	fix(relay): make hosted gateways actually connect AND complete the inbound/outbound round-trip (#48828 )	2026-06-19 16:30:24 +10:00
hermes_cli	refactor(dashboard): align Slack allowlist validation with gateway parse	2026-06-19 12:22:30 +05:30
locales	feat(status): restore model and context in gateway status	2026-06-15 07:46:34 -07:00
nix	fix(nix): refresh npmDepsHash after the Electron 40.10.2 pin (#47792 ) (#48457 )	2026-06-18 15:00:08 +00:00
optional-mcps	feat(mcp-catalog): add official Unreal Engine 5.8 MCP server	2026-06-18 09:16:40 -07:00
optional-skills	feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams (#48899 )	2026-06-19 08:02:31 +00:00
packaging/homebrew	chore: prepare Hermes for Homebrew packaging (#4099 )	2026-03-30 17:34:43 -07:00
plans	fix(gemini): tighten native routing and streaming replay	2026-04-19 12:40:08 -07:00
plugins	feat(image-gen): add image-to-image / editing to image_generate (#48705 )	2026-06-18 22:13:07 -07:00
providers	fix(models): pass model.base_url to fetch_models in /model picker	2026-06-16 13:09:40 -07:00
scripts	fix(dashboard): resolve chat TUI argv off event loop (#48561 )	2026-06-18 22:20:52 -04:00
skills	feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams (#48899 )	2026-06-19 08:02:31 +00:00
tests	refactor(dashboard): align Slack allowlist validation with gateway parse	2026-06-19 12:22:30 +05:30
tools	Merge pull request #48259 from NousResearch/fix/ns501-multipart-upload-salvage	2026-06-19 12:03:58 +05:30
tui_gateway	feat(model-picker): add Refresh Models control to bust stale model cache (#48691 )	2026-06-18 21:37:41 -07:00
ui-tui	refactor(tui): reuse DASHBOARD_TUI_MODE for hosted /exit guard	2026-06-19 12:59:52 +05:30
web	refactor(dashboard): align Slack allowlist validation with gateway parse	2026-06-19 12:22:30 +05:30
website	feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams (#48899 )	2026-06-19 08:02:31 +00:00
.dockerignore	fix(docker): support WebUI installs from read-only sources (#48541 )	2026-06-19 10:52:16 +10:00
.env.example	Add Hermes desktop app (#20059 )	2026-05-31 17:46:56 -05:00
.envrc	fix(node/nix): consolidate workspace lockfile + update all consumers	2026-06-02 20:28:18 -04:00
.gitattributes	chore: enforce LF line endings for container entrypoints (#12181 )	2026-06-05 09:54:01 +10:00
.gitignore	fix(docker): supervised gateway uses --replace to take over stale holder (NS-505) (#47555 )	2026-06-18 10:49:02 +10:00
.hadolint.yaml	feat(docker): remove gosu from bundled image; s6-setuidgid handles privilege drop	2026-05-24 18:05:33 -07:00
.mailmap	chore: add MestreY0d4-Uninter to AUTHOR_MAP and .mailmap	2026-04-15 15:03:28 -07:00
AGENTS.md	change(tooling): typecheck in CI, update ts to 6	2026-06-10 11:59:34 -04:00
batch_runner.py	feat(azure-foundry): add Microsoft Entra ID auth	2026-05-18 10:14:38 -07:00
cli-config.yaml.example	fix(telegram): edit streamed previews in place as rich (Bot API 10.1) (#46890 )	2026-06-16 05:26:04 -07:00
cli.py	feat(cli): lock hermes worktrees so concurrent processes can't clobber them	2026-06-18 19:15:04 -07:00
constraints-termux.txt	feat: add tested Termux install path and EOF-aware gh auth	2026-04-09 16:24:53 -07:00
CONTRIBUTING.md	docs: recommend standard installer for development (#46646 )	2026-06-15 06:14:57 -07:00
docker-compose.windows.yml	feat(docker): add Windows Docker Desktop compatible compose file	2026-05-23 21:52:34 +05:30
docker-compose.yml	docs(compose): update entrypoint comment for s6-overlay	2026-05-24 18:05:33 -07:00
Dockerfile	fix(update): scope install-method stamp to the code tree, not $HERMES_HOME (#48188 )	2026-06-18 14:14:41 +10:00
flake.lock	fix nix build	2026-04-11 15:30:37 -04:00
flake.nix	feat(nix): declarative plugin installation for NixOS module (#15953 )	2026-04-28 00:18:32 +05:30
hermes	fix: use argparse entrypoint in top-level launcher (#3874 )	2026-03-29 21:54:36 -07:00
hermes-already-has-routines.md	docs: finish Automation Blueprints terminology rebrand (#44470 )	2026-06-11 17:22:22 -04:00
hermes_bootstrap.py	hermes_bootstrap: Windows-only UTF-8 stdio shim for all entry points	2026-05-08 14:27:40 -07:00
hermes_constants.py	fix(cli): detect containerd/CRI cgroup-v2 containers in is_container() (#47131 )	2026-06-17 12:11:31 +10:00
hermes_logging.py	fix(logging): alias RotatingFileHandler to concurrent-log-handler (salvage #44921 ) (#46794 )	2026-06-17 15:39:04 -05:00
hermes_state.py	fix(agent): rebuild base fts without trigram	2026-06-18 19:14:52 -07:00
hermes_time.py	fix(hermes_time): implement reset_cache() referenced in docstrings (#41728 )	2026-06-07 22:08:01 -07:00
LICENSE	fix: restore missing MIT license file	2026-03-07 13:43:08 -08:00
MANIFEST.in	fix(packaging): ship optional-mcps catalog in wheel and sdist (#39859 )	2026-06-09 14:03:20 -04:00
mcp_serve.py	chore: ruff auto-fix PLR6201 — tuple → set in membership tests (#23937 )	2026-05-11 11:13:25 -07:00
mini_swe_runner.py	chore: prune unused imports and duplicate import redefinitions	2026-05-28 22:26:25 -07:00
model_tools.py	fix(dispatch): forward session_id into registry.dispatch (#28479 )	2026-06-14 00:27:59 -04:00
package-lock.json	fix(npm): lock react-simple-icons to 13.11.1	2026-06-18 17:41:58 -04:00
package.json	fix(desktop): pin Electron below the broken native extract-zip install (#47792 )	2026-06-17 14:42:30 -04:00
pyproject.toml	fix(dashboard): clean up upload temp file on client disconnect + pin python-multipart (NS-501)	2026-06-18 11:32:18 +05:30
README.md	docs: recommend standard installer for development (#46646 )	2026-06-15 06:14:57 -07:00
README.ur-pk.md	docs: add Urdu translation of README (#40578 )	2026-06-08 06:15:27 +05:30
README.zh-CN.md	docs: recommend standard installer for development (#46646 )	2026-06-15 06:14:57 -07:00
run_agent.py	fix(agent): summarize structured provider error messages	2026-06-18 21:37:52 -07:00
SECURITY.md	changes from feedback	2026-05-05 22:45:12 -04:00
setup-hermes.sh	remove Vercel AI Gateway and Vercel Sandbox (#33067 )	2026-05-27 00:43:32 -07:00
setup.py	fix(docker): support WebUI installs from read-only sources (#48541 )	2026-06-19 10:52:16 +10:00
toolset_distributions.py	chore: fix 154 f-strings, simplify getattr/URL patterns, remove dead code (#3119 )	2026-03-25 19:47:58 -07:00
toolsets.py	refactor: remove agent-callable send_message tool (#47856 )	2026-06-17 07:11:23 -07:00
trajectory_compressor.py	fix(research): keep tool_call/tool_response pairs intact when compressing trajectories	2026-06-07 05:01:27 -07:00
utils.py	fix(utils): copy fallback for atomic replace across devices (#43852 )	2026-06-13 14:50:05 -07:00
uv.lock	fix(dashboard): clean up upload temp file on client disconnect + pin python-multipart (NS-501)	2026-06-18 11:32:18 +05:30

README.md

Hermes Agent ☤

Hermes Agent | Hermes Desktop

The self-improving AI agent built by Nous Research. It's the only agent with a built-in learning loop — it creates skills from experience, improves them during use, nudges itself to persist knowledge, searches its own past conversations, and builds a deepening model of who you are across sessions. Run it on a $5 VPS, a GPU cluster, or serverless infrastructure that costs nearly nothing when idle. It's not tied to your laptop — talk to it from Telegram while it works on a cloud VM.

Use any model you want — Nous Portal, OpenRouter (200+ models), NovitaAI (AI-native cloud for Model API, Agent Sandbox, and GPU Cloud), NVIDIA NIM (Nemotron), Xiaomi MiMo, z.ai/GLM, Kimi/Moonshot, MiniMax, Hugging Face, OpenAI, or your own endpoint. Switch with hermes model — no code changes, no lock-in.

A real terminal interface	Full TUI with multiline editing, slash-command autocomplete, conversation history, interrupt-and-redirect, and streaming tool output.
Lives where you do	Telegram, Discord, Slack, WhatsApp, Signal, and CLI — all from a single gateway process. Voice memo transcription, cross-platform conversation continuity.
A closed learning loop	Agent-curated memory with periodic nudges. Autonomous skill creation after complex tasks. Skills self-improve during use. FTS5 session search with LLM summarization for cross-session recall. Honcho dialectic user modeling. Compatible with the agentskills.io open standard.
Scheduled automations	Built-in cron scheduler with delivery to any platform. Daily reports, nightly backups, weekly audits — all in natural language, running unattended.
Delegates and parallelizes	Spawn isolated subagents for parallel workstreams. Write Python scripts that call tools via RPC, collapsing multi-step pipelines into zero-context-cost turns.
Runs anywhere, not just your laptop	Six terminal backends — local, Docker, SSH, Singularity, Modal, and Daytona. Daytona and Modal offer serverless persistence — your agent's environment hibernates when idle and wakes on demand, costing nearly nothing between sessions. Run it on a $5 VPS or a GPU cluster.
Research-ready	Batch trajectory generation, trajectory compression for training the next generation of tool-calling models.

Quick Install

Linux, macOS, WSL2, Termux

curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash

Windows (native, PowerShell)

Heads up: Native Windows runs Hermes without WSL — CLI, gateway, TUI, and tools all work natively. If you'd rather use WSL2, the Linux/macOS one-liner above works there too. Found a bug? Please file issues.

Run this in PowerShell:

iex (irm https://hermes-agent.nousresearch.com/install.ps1)

The installer handles everything: uv, Python 3.11, Node.js, ripgrep, ffmpeg, and a portable Git Bash (MinGit, unpacked to %LOCALAPPDATA%\hermes\git — no admin required, completely isolated from any system Git install). Hermes uses this bundled Git Bash to run shell commands.

If you already have Git installed, the installer detects it and uses that instead. Otherwise a ~45MB MinGit download is all you need — it won't touch or interfere with any system Git.

Android / Termux: The tested manual path is documented in the Termux guide. On Termux, Hermes installs a curated .[termux] extra because the full .[all] extra currently pulls Android-incompatible voice dependencies.

Windows: Native Windows is fully supported — the PowerShell one-liner above installs everything. If you'd rather use WSL2, the Linux command works there too. Native Windows install lives under %LOCALAPPDATA%\hermes; WSL2 installs under ~/.hermes as on Linux.

After installation:

source ~/.bashrc    # reload shell (or: source ~/.zshrc)
hermes              # start chatting!

Getting Started

hermes              # Interactive CLI — start a conversation
hermes model        # Choose your LLM provider and model
hermes tools        # Configure which tools are enabled
hermes config set   # Set individual config values
hermes gateway      # Start the messaging gateway (Telegram, Discord, etc.)
hermes setup        # Run the full setup wizard (configures everything at once)
hermes claw migrate # Migrate from OpenClaw (if coming from OpenClaw)
hermes update       # Update to the latest version
hermes doctor       # Diagnose any issues

📖 Full documentation →

Skip the API-key collection — Nous Portal

Hermes works with whatever provider you want — that's not changing. But if you'd rather not collect five separate API keys for the model, web search, image generation, TTS, and a cloud browser, Nous Portal covers all of them under one subscription:

300+ models — pick any of them with /model <name>
Tool Gateway — web search (Firecrawl), image generation (FAL), text-to-speech (OpenAI), cloud browser (Browser Use), all routed through your sub. No extra accounts.

One command from a fresh install:

hermes setup --portal

That logs you in via OAuth, sets Nous as your provider, and turns on the Tool Gateway. Check what's wired up any time with hermes portal info. Full details on the Tool Gateway docs page.

You can still bring your own keys per-tool whenever you want — the gateway is per-backend, not all-or-nothing.

CLI vs Messaging Quick Reference

Hermes has two entry points: start the terminal UI with hermes, or run the gateway and talk to it from Telegram, Discord, Slack, WhatsApp, Signal, or Email. Once you're in a conversation, many slash commands are shared across both interfaces.

Action	CLI	Messaging platforms
Start chatting	`hermes`	Run `hermes gateway setup` + `hermes gateway start`, then send the bot a message
Start fresh conversation	`/new` or `/reset`	`/new` or `/reset`
Change model	`/model [provider:model]`	`/model [provider:model]`
Set a personality	`/personality [name]`	`/personality [name]`
Retry or undo the last turn	`/retry`, `/undo`	`/retry`, `/undo`
Compress context / check usage	`/compress`, `/usage`, `/insights [--days N]`	`/compress`, `/usage`, `/insights [days]`
Browse skills	`/skills` or `/<skill-name>`	`/<skill-name>`
Interrupt current work	`Ctrl+C` or send a new message	`/stop` or send a new message
Platform-specific status	`/platforms`	`/status`, `/sethome`

For the full command lists, see the CLI guide and the Messaging Gateway guide.

Documentation

All documentation lives at hermes-agent.nousresearch.com/docs:

Section	What's Covered
Quickstart	Install → setup → first conversation in 2 minutes
CLI Usage	Commands, keybindings, personalities, sessions
Configuration	Config file, providers, models, all options
Messaging Gateway	Telegram, Discord, Slack, WhatsApp, Signal, Home Assistant
Security	Command approval, DM pairing, container isolation
Tools & Toolsets	40+ tools, toolset system, terminal backends
Skills System	Procedural memory, Skills Hub, creating skills
Memory	Persistent memory, user profiles, best practices
MCP Integration	Connect any MCP server for extended capabilities
Cron Scheduling	Scheduled tasks with platform delivery
Context Files	Project context that shapes every conversation
Architecture	Project structure, agent loop, key classes
Contributing	Development setup, PR process, code style
CLI Reference	All commands and flags
Environment Variables	Complete env var reference

Migrating from OpenClaw

If you're coming from OpenClaw, Hermes can automatically import your settings, memories, skills, and API keys.

During first-time setup: The setup wizard (hermes setup) automatically detects ~/.openclaw and offers to migrate before configuration begins.

Anytime after install:

hermes claw migrate              # Interactive migration (full preset)
hermes claw migrate --dry-run    # Preview what would be migrated
hermes claw migrate --preset user-data   # Migrate without secrets
hermes claw migrate --overwrite  # Overwrite existing conflicts

What gets imported:

SOUL.md — persona file
Memories — MEMORY.md and USER.md entries
Skills — user-created skills → ~/.hermes/skills/openclaw-imports/
Command allowlist — approval patterns
Messaging settings — platform configs, allowed users, working directory
API keys — allowlisted secrets (Telegram, OpenRouter, OpenAI, Anthropic, ElevenLabs)
TTS assets — workspace audio files
Workspace instructions — AGENTS.md (with --workspace-target)

See hermes claw migrate --help for all options, or use the openclaw-migration skill for an interactive agent-guided migration with dry-run previews.

Contributing

We welcome contributions! See the Contributing Guide for development setup, code style, and PR process.

Quick start for contributors — use the standard installer, then work from the full git checkout it creates at $HERMES_HOME/hermes-agent (usually ~/.hermes/hermes-agent). This matches the layout used by hermes update, the managed venv, lazy dependencies, gateway, and docs tooling.

curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
cd "${HERMES_HOME:-$HOME/.hermes}/hermes-agent"
uv pip install -e ".[all,dev]"
scripts/run_tests.sh

Manual clone fallback (for throwaway clones/CI where you intentionally do not want the managed install layout):

curl -LsSf https://astral.sh/uv/install.sh | sh
uv venv .venv --python 3.11
source .venv/bin/activate
uv pip install -e ".[all,dev]"
scripts/run_tests.sh

Community

💬 Discord
📚 Skills Hub
🐛 Issues
🔌 computer-use-linux — Linux desktop-control MCP server for Hermes and other MCP hosts, with AT-SPI accessibility trees, Wayland/X11 input, screenshots, and compositor window targeting.
🔌 HermesClaw — Community WeChat bridge: Run Hermes Agent and OpenClaw on the same WeChat account.

License

MIT — see LICENSE.

Built by Nous Research.