hermes-agent/website/docs/reference/skills-catalog.md
Siddharth Balyan 9362ce2575
feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams (#48899)
* feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams

Adds a unified `html-artifact` creative skill that produces self-contained,
single-file HTML artifacts — concept explainers, implementation plans,
status/incident reports, code-review walkthroughs, technical + educational
SVG diagrams, multi-variant design comparisons, and throwaway editors that
export their state back to the clipboard. Grounded in Anthropic's
html-effectiveness gallery (MIT); the house style (token block, serif/sans/
mono split, hand-rolled diffs, inline-SVG diagrams, graceful degradation) is
distilled from reading all 20 reference files.

Supersedes and removes three overlapping skills, folding their unique value in:
- sketch              -> the fidelity dial (throwaway vs presentation) + the
                         multi-variant comparison layouts + the browser-vision
                         verify loop (references/fidelity-and-verify.md)
- architecture-diagram-> the dark "infra" token variant + double-rect masking +
                         semantic component palette (references/dark-tech.md,
                         templates/diagram.html infra mode)
- concept-diagrams    -> the 9-ramp educational color system + the concept
                         archetype library (references/concept-archetypes.md,
                         the light design system in templates/diagram.html)

Structure:
- SKILL.md (description exactly 60 chars), 6 references, 3 templates
- templates verified by headless-Chrome render + vision inspection
- editor export logic (file://-safe clipboard, Promise-normalized) verified in node

Cross-references updated in claude-design (new disambiguation table row drawing
the design-taste vs information-artifact boundary), design-md, pretext, spike,
and kanban-video-orchestrator. Website skill docs + catalogs regenerated;
stale EN/zh-Hans per-skill pages pruned and i18n cross-refs fixed.

Not folded (intentionally orthogonal): excalidraw (.excalidraw JSON), p5js
(generative canvas), claude-design / popular-web-designs / design-md (visual
design taste / brand vocab / token spec).

* feat(skills): ship html-effectiveness gallery as fetched reference examples

Add scripts/fetch-examples.sh (idempotent clone/pull of Anthropic's MIT
html-effectiveness gallery) + references/examples.md mapping each of the 20
example files to a mode so the agent reads the right worked example. The clone
lands in references/examples/ and is gitignored (it's a 384KB upstream repo,
not vendored). SKILL.md workflow + reference list now point at it; falls back to
the distilled pattern references when offline.

* feat(skills): make reading a gallery example a required authoring step

Reading the matching html-effectiveness example is now workflow step 2 (was an
optional aside in step 3): fetch the gallery, read_file the file for your mode,
mirror its structure. Models skip optional steps; the examples are the ground
truth, so consulting one is mandatory. Added an 'Example' column to the
mode->build quick-reference table and a 'don't skip the example' pitfall.

Also dogfooded the skill: read 03-code-review-pr.html and 13-flowchart-diagram.html
raw and reconciled the distilled references against source — aligned diff-row tint
opacity to the source's 0.15 (was 0.18) and added the .ctx/.hunk rows in
house-style.md + base.html so they match 03-code-review-pr.html verbatim.

* docs(skills): explain the consolidation + bundled-vs-optional rationale

The supersession note only stated *what* was folded, not *why* the prune is
sound. Expand SKILL.md's intro into a 'Why this skill exists' section: the three
former skills emitted the same artifact and overlapped, so consolidating removes
which-one-do-I-load ambiguity; and the optional->bundled promotion of
concept-diagrams is footprint-safe because this skill has zero deps (only cost is
the 60-char description; everything else is progressive-disclosure). States the
bundling dividing line explicitly: zero install cost + broadly useful gets
bundled, real install cost (hyperframes: Node+FFmpeg+Chromium) stays optional.

Regenerated website per-skill page to match.
2026-06-19 08:02:31 +00:00

16 KiB

sidebar_position title description
5 Bundled Skills Catalog Catalog of bundled skills that ship with Hermes Agent

Bundled Skills Catalog

Hermes ships with a large built-in skill library copied into ~/.hermes/skills/ on install. Each skill below links to a dedicated page with its full definition, setup, and usage.

Hermes also syncs bundled skills on hermes update, but the sync manifest respects local deletions and user edits. If a skill listed here is missing from your profile's ~/.hermes/skills/ tree, it is still shipped with Hermes; restore it with hermes skills reset <name> --restore.

If a skill is missing from this list but present in the repo, the catalog is regenerated by website/scripts/generate-skill-docs.py.

apple

Skill Description Path
apple-notes Manage Apple Notes via memo CLI: create, search, edit. apple/apple-notes
apple-reminders Apple Reminders via remindctl: add, list, complete. apple/apple-reminders
findmy Track Apple devices/AirTags via FindMy.app on macOS. apple/findmy
imessage Send and receive iMessages/SMS via the imsg CLI on macOS. apple/imessage
macos-computer-use Drive the macOS desktop in the background — screenshots, mouse, keyboard, scroll, drag — without stealing the user's cursor, keyboard focus, or Space. Works with any tool-capable model. Load this skill whenever the computer_use tool is... apple/macos-computer-use

autonomous-ai-agents

Skill Description Path
claude-code Delegate coding to Claude Code CLI (features, PRs). autonomous-ai-agents/claude-code
codex Delegate coding to OpenAI Codex CLI (features, PRs). autonomous-ai-agents/codex
hermes-agent Configure, extend, or contribute to Hermes Agent. autonomous-ai-agents/hermes-agent
opencode Delegate coding to OpenCode CLI (features, PR review). autonomous-ai-agents/opencode

creative

Skill Description Path
ascii-art ASCII art: pyfiglet, cowsay, boxes, image-to-ascii. creative/ascii-art
ascii-video ASCII video: convert video/audio to colored ASCII MP4/GIF. creative/ascii-video
baoyu-infographic Infographics: 21 layouts x 21 styles (信息图, 可视化). creative/baoyu-infographic
claude-design Design one-off HTML artifacts (landing, deck, prototype). creative/claude-design
comfyui Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution. creative/comfyui
design-md Author/validate/export Google's DESIGN.md token spec files. creative/design-md
excalidraw Hand-drawn Excalidraw JSON diagrams (arch, flow, seq). creative/excalidraw
html-artifact Build self-contained HTML files to explain, plan, or review. creative/html-artifact
humanizer Humanize text: strip AI-isms and add real voice. creative/humanizer
manim-video Manim CE animations: 3Blue1Brown math/algo videos. creative/manim-video
p5js p5.js sketches: gen art, shaders, interactive, 3D. creative/p5js
popular-web-designs 54 real design systems (Stripe, Linear, Vercel) as HTML/CSS. creative/popular-web-designs
pretext Use when building creative browser demos with @chenglou/pretext — DOM-free text layout for ASCII art, typographic flow around obstacles, text-as-geometry games, kinetic typography, and text-powered generative art. Produces single-file HT... creative/pretext
songwriting-and-ai-music Songwriting craft and Suno AI music prompts. creative/songwriting-and-ai-music
touchdesigner-mcp Control a running TouchDesigner instance via twozero MCP — create operators, set parameters, wire connections, execute Python, build real-time visuals. 36 native tools. creative/touchdesigner-mcp

data-science

Skill Description Path
jupyter-live-kernel Iterative Python via live Jupyter kernel (hamelnb). data-science/jupyter-live-kernel

devops

Skill Description Path
kanban-orchestrator Decomposition playbook + anti-temptation rules for an orchestrator profile routing work through Kanban. The "don't do the work yourself" rule and the basic lifecycle are auto-injected into every kanban worker's system prompt; this skill... devops/kanban-orchestrator
kanban-worker Pitfalls, examples, and edge cases for Hermes Kanban workers. The lifecycle itself is auto-injected into every worker's system prompt as KANBAN_GUIDANCE (from agent/prompt_builder.py); this skill is what you load when you want deeper det... devops/kanban-worker

dogfood

Skill Description Path
dogfood Exploratory QA of web apps: find bugs, evidence, reports. dogfood

email

Skill Description Path
himalaya Himalaya CLI: IMAP/SMTP email from terminal. email/himalaya

github

Skill Description Path
codebase-inspection Inspect codebases w/ pygount: LOC, languages, ratios. github/codebase-inspection
github-auth GitHub auth setup: HTTPS tokens, SSH keys, gh CLI login. github/github-auth
github-code-review Review PRs: diffs, inline comments via gh or REST. github/github-code-review
github-issues Create, triage, label, assign GitHub issues via gh or REST. github/github-issues
github-pr-workflow GitHub PR lifecycle: branch, commit, open, CI, merge. github/github-pr-workflow
github-repo-management Clone/create/fork repos; manage remotes, releases. github/github-repo-management

media

Skill Description Path
gif-search Search/download GIFs from Tenor via curl + jq. media/gif-search
heartmula HeartMuLa: Suno-like song generation from lyrics + tags. media/heartmula
songsee Audio spectrograms/features (mel, chroma, MFCC) via CLI. media/songsee
youtube-content YouTube transcripts to summaries, threads, blogs. media/youtube-content

mlops

Skill Description Path
audiocraft-audio-generation AudioCraft: MusicGen text-to-music, AudioGen text-to-sound. mlops/models/audiocraft
huggingface-hub HuggingFace hf CLI: search/download/upload models, datasets. mlops/huggingface-hub
llama-cpp llama.cpp local GGUF inference + HF Hub model discovery. mlops/inference/llama-cpp
evaluating-llms-harness lm-eval-harness: benchmark LLMs (MMLU, GSM8K, etc.). mlops/evaluation/lm-evaluation-harness
segment-anything-model SAM: zero-shot image segmentation via points, boxes, masks. mlops/models/segment-anything
serving-llms-vllm vLLM: high-throughput LLM serving, OpenAI API, quantization. mlops/inference/vllm
weights-and-biases W&B: log ML experiments, sweeps, model registry, dashboards. mlops/evaluation/weights-and-biases

note-taking

Skill Description Path
obsidian Read, search, create, and edit notes in the Obsidian vault. note-taking/obsidian

productivity

Skill Description Path
airtable Airtable REST API via curl. Records CRUD, filters, upserts. productivity/airtable
google-workspace Gmail, Calendar, Drive, Docs, Sheets via gws CLI or Python. productivity/google-workspace
maps Geocode, POIs, routes, timezones via OpenStreetMap/OSRM. productivity/maps
nano-pdf Edit PDF text/typos/titles via nano-pdf CLI (NL prompts). productivity/nano-pdf
notion Notion API + ntn CLI: pages, databases, markdown, Workers. productivity/notion
ocr-and-documents Extract text from PDFs/scans (pymupdf, marker-pdf). productivity/ocr-and-documents
powerpoint Create, read, edit .pptx decks, slides, notes, templates. productivity/powerpoint
teams-meeting-pipeline Operate the Teams meeting summary pipeline via Hermes CLI — summarize meetings, inspect pipeline status, replay jobs, manage Microsoft Graph subscriptions. productivity/teams-meeting-pipeline

research

Skill Description Path
arxiv Search arXiv papers by keyword, author, category, or ID. research/arxiv
blogwatcher Monitor blogs and RSS/Atom feeds via blogwatcher-cli tool. research/blogwatcher
llm-wiki Karpathy's LLM Wiki: build/query interlinked markdown KB. research/llm-wiki
polymarket Query Polymarket: markets, prices, orderbooks, history. research/polymarket
research-paper-writing Write ML papers for NeurIPS/ICML/ICLR: design→submit. research/research-paper-writing

smart-home

Skill Description Path
openhue Control Philips Hue lights, scenes, rooms via OpenHue CLI. smart-home/openhue

social-media

Skill Description Path
xurl X/Twitter via xurl CLI: post, search, DM, media, v2 API. social-media/xurl

software-development

Skill Description Path
hermes-agent-skill-authoring Author in-repo SKILL.md: frontmatter, validator, structure. software-development/hermes-agent-skill-authoring
node-inspect-debugger Debug Node.js via --inspect + Chrome DevTools Protocol CLI. software-development/node-inspect-debugger
plan Plan mode: write an actionable markdown plan to .hermes/plans/, no execution. Bite-sized tasks, exact paths, complete code. software-development/plan
python-debugpy Debug Python: pdb REPL + debugpy remote (DAP). software-development/python-debugpy
requesting-code-review Pre-commit review: security scan, quality gates, auto-fix. software-development/requesting-code-review
simplify-code Parallel 3-agent cleanup of recent code changes. software-development/simplify-code
spike Throwaway experiments to validate an idea before build. software-development/spike
systematic-debugging 4-phase root cause debugging: understand bugs before fixing. software-development/systematic-debugging
test-driven-development TDD: enforce RED-GREEN-REFACTOR, tests before code. software-development/test-driven-development

yuanbao

Skill Description Path
yuanbao Yuanbao (元宝) groups: @mention users, query info/members. yuanbao