mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-06-21 10:22:18 +00:00
* feat(skills): add html-artifact skill, fold in sketch + architecture-diagram + concept-diagrams
Adds a unified `html-artifact` creative skill that produces self-contained,
single-file HTML artifacts — concept explainers, implementation plans,
status/incident reports, code-review walkthroughs, technical + educational
SVG diagrams, multi-variant design comparisons, and throwaway editors that
export their state back to the clipboard. Grounded in Anthropic's
html-effectiveness gallery (MIT); the house style (token block, serif/sans/
mono split, hand-rolled diffs, inline-SVG diagrams, graceful degradation) is
distilled from reading all 20 reference files.
Supersedes and removes three overlapping skills, folding their unique value in:
- sketch -> the fidelity dial (throwaway vs presentation) + the
multi-variant comparison layouts + the browser-vision
verify loop (references/fidelity-and-verify.md)
- architecture-diagram-> the dark "infra" token variant + double-rect masking +
semantic component palette (references/dark-tech.md,
templates/diagram.html infra mode)
- concept-diagrams -> the 9-ramp educational color system + the concept
archetype library (references/concept-archetypes.md,
the light design system in templates/diagram.html)
Structure:
- SKILL.md (description exactly 60 chars), 6 references, 3 templates
- templates verified by headless-Chrome render + vision inspection
- editor export logic (file://-safe clipboard, Promise-normalized) verified in node
Cross-references updated in claude-design (new disambiguation table row drawing
the design-taste vs information-artifact boundary), design-md, pretext, spike,
and kanban-video-orchestrator. Website skill docs + catalogs regenerated;
stale EN/zh-Hans per-skill pages pruned and i18n cross-refs fixed.
Not folded (intentionally orthogonal): excalidraw (.excalidraw JSON), p5js
(generative canvas), claude-design / popular-web-designs / design-md (visual
design taste / brand vocab / token spec).
* feat(skills): ship html-effectiveness gallery as fetched reference examples
Add scripts/fetch-examples.sh (idempotent clone/pull of Anthropic's MIT
html-effectiveness gallery) + references/examples.md mapping each of the 20
example files to a mode so the agent reads the right worked example. The clone
lands in references/examples/ and is gitignored (it's a 384KB upstream repo,
not vendored). SKILL.md workflow + reference list now point at it; falls back to
the distilled pattern references when offline.
* feat(skills): make reading a gallery example a required authoring step
Reading the matching html-effectiveness example is now workflow step 2 (was an
optional aside in step 3): fetch the gallery, read_file the file for your mode,
mirror its structure. Models skip optional steps; the examples are the ground
truth, so consulting one is mandatory. Added an 'Example' column to the
mode->build quick-reference table and a 'don't skip the example' pitfall.
Also dogfooded the skill: read 03-code-review-pr.html and 13-flowchart-diagram.html
raw and reconciled the distilled references against source — aligned diff-row tint
opacity to the source's 0.15 (was 0.18) and added the .ctx/.hunk rows in
house-style.md + base.html so they match 03-code-review-pr.html verbatim.
* docs(skills): explain the consolidation + bundled-vs-optional rationale
The supersession note only stated *what* was folded, not *why* the prune is
sound. Expand SKILL.md's intro into a 'Why this skill exists' section: the three
former skills emitted the same artifact and overlapped, so consolidating removes
which-one-do-I-load ambiguity; and the optional->bundled promotion of
concept-diagrams is footprint-safe because this skill has zero deps (only cost is
the 60-char description; everything else is progressive-disclosure). States the
bundling dividing line explicitly: zero install cost + broadly useful gets
bundled, real install cost (hyperframes: Node+FFmpeg+Chromium) stays optional.
Regenerated website per-skill page to match.
16 KiB
16 KiB
| sidebar_position | title | description |
|---|---|---|
| 5 | Bundled Skills Catalog | Catalog of bundled skills that ship with Hermes Agent |
Bundled Skills Catalog
Hermes ships with a large built-in skill library copied into ~/.hermes/skills/ on install. Each skill below links to a dedicated page with its full definition, setup, and usage.
Hermes also syncs bundled skills on hermes update, but the sync manifest respects local deletions and user edits. If a skill listed here is missing from your profile's ~/.hermes/skills/ tree, it is still shipped with Hermes; restore it with hermes skills reset <name> --restore.
If a skill is missing from this list but present in the repo, the catalog is regenerated by website/scripts/generate-skill-docs.py.
apple
| Skill | Description | Path |
|---|---|---|
apple-notes |
Manage Apple Notes via memo CLI: create, search, edit. | apple/apple-notes |
apple-reminders |
Apple Reminders via remindctl: add, list, complete. | apple/apple-reminders |
findmy |
Track Apple devices/AirTags via FindMy.app on macOS. | apple/findmy |
imessage |
Send and receive iMessages/SMS via the imsg CLI on macOS. | apple/imessage |
macos-computer-use |
Drive the macOS desktop in the background — screenshots, mouse, keyboard, scroll, drag — without stealing the user's cursor, keyboard focus, or Space. Works with any tool-capable model. Load this skill whenever the computer_use tool is... |
apple/macos-computer-use |
autonomous-ai-agents
| Skill | Description | Path |
|---|---|---|
claude-code |
Delegate coding to Claude Code CLI (features, PRs). | autonomous-ai-agents/claude-code |
codex |
Delegate coding to OpenAI Codex CLI (features, PRs). | autonomous-ai-agents/codex |
hermes-agent |
Configure, extend, or contribute to Hermes Agent. | autonomous-ai-agents/hermes-agent |
opencode |
Delegate coding to OpenCode CLI (features, PR review). | autonomous-ai-agents/opencode |
creative
| Skill | Description | Path |
|---|---|---|
ascii-art |
ASCII art: pyfiglet, cowsay, boxes, image-to-ascii. | creative/ascii-art |
ascii-video |
ASCII video: convert video/audio to colored ASCII MP4/GIF. | creative/ascii-video |
baoyu-infographic |
Infographics: 21 layouts x 21 styles (信息图, 可视化). | creative/baoyu-infographic |
claude-design |
Design one-off HTML artifacts (landing, deck, prototype). | creative/claude-design |
comfyui |
Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution. | creative/comfyui |
design-md |
Author/validate/export Google's DESIGN.md token spec files. | creative/design-md |
excalidraw |
Hand-drawn Excalidraw JSON diagrams (arch, flow, seq). | creative/excalidraw |
html-artifact |
Build self-contained HTML files to explain, plan, or review. | creative/html-artifact |
humanizer |
Humanize text: strip AI-isms and add real voice. | creative/humanizer |
manim-video |
Manim CE animations: 3Blue1Brown math/algo videos. | creative/manim-video |
p5js |
p5.js sketches: gen art, shaders, interactive, 3D. | creative/p5js |
popular-web-designs |
54 real design systems (Stripe, Linear, Vercel) as HTML/CSS. | creative/popular-web-designs |
pretext |
Use when building creative browser demos with @chenglou/pretext — DOM-free text layout for ASCII art, typographic flow around obstacles, text-as-geometry games, kinetic typography, and text-powered generative art. Produces single-file HT... | creative/pretext |
songwriting-and-ai-music |
Songwriting craft and Suno AI music prompts. | creative/songwriting-and-ai-music |
touchdesigner-mcp |
Control a running TouchDesigner instance via twozero MCP — create operators, set parameters, wire connections, execute Python, build real-time visuals. 36 native tools. | creative/touchdesigner-mcp |
data-science
| Skill | Description | Path |
|---|---|---|
jupyter-live-kernel |
Iterative Python via live Jupyter kernel (hamelnb). | data-science/jupyter-live-kernel |
devops
| Skill | Description | Path |
|---|---|---|
kanban-orchestrator |
Decomposition playbook + anti-temptation rules for an orchestrator profile routing work through Kanban. The "don't do the work yourself" rule and the basic lifecycle are auto-injected into every kanban worker's system prompt; this skill... | devops/kanban-orchestrator |
kanban-worker |
Pitfalls, examples, and edge cases for Hermes Kanban workers. The lifecycle itself is auto-injected into every worker's system prompt as KANBAN_GUIDANCE (from agent/prompt_builder.py); this skill is what you load when you want deeper det... | devops/kanban-worker |
dogfood
| Skill | Description | Path |
|---|---|---|
dogfood |
Exploratory QA of web apps: find bugs, evidence, reports. | dogfood |
| Skill | Description | Path |
|---|---|---|
himalaya |
Himalaya CLI: IMAP/SMTP email from terminal. | email/himalaya |
github
| Skill | Description | Path |
|---|---|---|
codebase-inspection |
Inspect codebases w/ pygount: LOC, languages, ratios. | github/codebase-inspection |
github-auth |
GitHub auth setup: HTTPS tokens, SSH keys, gh CLI login. | github/github-auth |
github-code-review |
Review PRs: diffs, inline comments via gh or REST. | github/github-code-review |
github-issues |
Create, triage, label, assign GitHub issues via gh or REST. | github/github-issues |
github-pr-workflow |
GitHub PR lifecycle: branch, commit, open, CI, merge. | github/github-pr-workflow |
github-repo-management |
Clone/create/fork repos; manage remotes, releases. | github/github-repo-management |
media
| Skill | Description | Path |
|---|---|---|
gif-search |
Search/download GIFs from Tenor via curl + jq. | media/gif-search |
heartmula |
HeartMuLa: Suno-like song generation from lyrics + tags. | media/heartmula |
songsee |
Audio spectrograms/features (mel, chroma, MFCC) via CLI. | media/songsee |
youtube-content |
YouTube transcripts to summaries, threads, blogs. | media/youtube-content |
mlops
| Skill | Description | Path |
|---|---|---|
audiocraft-audio-generation |
AudioCraft: MusicGen text-to-music, AudioGen text-to-sound. | mlops/models/audiocraft |
huggingface-hub |
HuggingFace hf CLI: search/download/upload models, datasets. | mlops/huggingface-hub |
llama-cpp |
llama.cpp local GGUF inference + HF Hub model discovery. | mlops/inference/llama-cpp |
evaluating-llms-harness |
lm-eval-harness: benchmark LLMs (MMLU, GSM8K, etc.). | mlops/evaluation/lm-evaluation-harness |
segment-anything-model |
SAM: zero-shot image segmentation via points, boxes, masks. | mlops/models/segment-anything |
serving-llms-vllm |
vLLM: high-throughput LLM serving, OpenAI API, quantization. | mlops/inference/vllm |
weights-and-biases |
W&B: log ML experiments, sweeps, model registry, dashboards. | mlops/evaluation/weights-and-biases |
note-taking
| Skill | Description | Path |
|---|---|---|
obsidian |
Read, search, create, and edit notes in the Obsidian vault. | note-taking/obsidian |
productivity
| Skill | Description | Path |
|---|---|---|
airtable |
Airtable REST API via curl. Records CRUD, filters, upserts. | productivity/airtable |
google-workspace |
Gmail, Calendar, Drive, Docs, Sheets via gws CLI or Python. | productivity/google-workspace |
maps |
Geocode, POIs, routes, timezones via OpenStreetMap/OSRM. | productivity/maps |
nano-pdf |
Edit PDF text/typos/titles via nano-pdf CLI (NL prompts). | productivity/nano-pdf |
notion |
Notion API + ntn CLI: pages, databases, markdown, Workers. | productivity/notion |
ocr-and-documents |
Extract text from PDFs/scans (pymupdf, marker-pdf). | productivity/ocr-and-documents |
powerpoint |
Create, read, edit .pptx decks, slides, notes, templates. | productivity/powerpoint |
teams-meeting-pipeline |
Operate the Teams meeting summary pipeline via Hermes CLI — summarize meetings, inspect pipeline status, replay jobs, manage Microsoft Graph subscriptions. | productivity/teams-meeting-pipeline |
research
| Skill | Description | Path |
|---|---|---|
arxiv |
Search arXiv papers by keyword, author, category, or ID. | research/arxiv |
blogwatcher |
Monitor blogs and RSS/Atom feeds via blogwatcher-cli tool. | research/blogwatcher |
llm-wiki |
Karpathy's LLM Wiki: build/query interlinked markdown KB. | research/llm-wiki |
polymarket |
Query Polymarket: markets, prices, orderbooks, history. | research/polymarket |
research-paper-writing |
Write ML papers for NeurIPS/ICML/ICLR: design→submit. | research/research-paper-writing |
smart-home
| Skill | Description | Path |
|---|---|---|
openhue |
Control Philips Hue lights, scenes, rooms via OpenHue CLI. | smart-home/openhue |
social-media
| Skill | Description | Path |
|---|---|---|
xurl |
X/Twitter via xurl CLI: post, search, DM, media, v2 API. | social-media/xurl |
software-development
| Skill | Description | Path |
|---|---|---|
hermes-agent-skill-authoring |
Author in-repo SKILL.md: frontmatter, validator, structure. | software-development/hermes-agent-skill-authoring |
node-inspect-debugger |
Debug Node.js via --inspect + Chrome DevTools Protocol CLI. | software-development/node-inspect-debugger |
plan |
Plan mode: write an actionable markdown plan to .hermes/plans/, no execution. Bite-sized tasks, exact paths, complete code. | software-development/plan |
python-debugpy |
Debug Python: pdb REPL + debugpy remote (DAP). | software-development/python-debugpy |
requesting-code-review |
Pre-commit review: security scan, quality gates, auto-fix. | software-development/requesting-code-review |
simplify-code |
Parallel 3-agent cleanup of recent code changes. | software-development/simplify-code |
spike |
Throwaway experiments to validate an idea before build. | software-development/spike |
systematic-debugging |
4-phase root cause debugging: understand bugs before fixing. | software-development/systematic-debugging |
test-driven-development |
TDD: enforce RED-GREEN-REFACTOR, tests before code. | software-development/test-driven-development |
yuanbao
| Skill | Description | Path |
|---|---|---|
yuanbao |
Yuanbao (元宝) groups: @mention users, query info/members. | yuanbao |