hermes-agent/website/docs/reference/optional-skills-catalog.md
Teknium 9a57aa2b1f
fix(docs): unbreak docs-site-checks — ascii-guard diagram + MDX <1% (#12984)
* fix(docs): unbreak ascii-guard lint on github-pr-review-agent diagram

The intro diagram used 4 side-by-side boxes in one row. ascii-guard can't
parse that layout — it reads the whole thing as one 80-wide outer box and
flags the inner box borders at columns 17/39/60 as 'extra characters after
right border'. Per the ascii-guard-lint-fixing skill, the only fix is to
merge into a single outer box.

Rewritten as one 69-char outer box with four labeled regions separated by
arrows. Same semantic content, lint-clean.

Was blocking docs-site-checks CI as 'action_required' across multiple PRs
(see e.g. run 24661820677).

* fix(docs): backtick-wrap `<1%` to avoid MDX JSX parse error

Docusaurus MDX parses `<1%` as the start of a JSX tag, but `1` isn't a
valid tag-name start so compilation fails with 'Unexpected character `1`
(U+0031) before name'. Wrap in backticks so MDX treats it as literal code
text.

Found by running Build Docusaurus step on the PR that unblocked the
ascii-guard step; full docs tree scanned for other `<digit>` patterns
outside backticks/fences, only this one was unsafe.
2026-04-20 04:29:02 -07:00

12 KiB
Raw Blame History

sidebar_position title description
9 Optional Skills Catalog Official optional skills shipped with hermes-agent — install via hermes skills install official/<category>/<skill>

Optional Skills Catalog

Official optional skills ship with the hermes-agent repository under optional-skills/ but are not active by default. Install them explicitly:

hermes skills install official/<category>/<skill>

For example:

hermes skills install official/blockchain/solana
hermes skills install official/mlops/flash-attention

Once installed, the skill appears in the agent's skill list and can be loaded automatically when relevant tasks are detected.

To uninstall:

hermes skills uninstall <skill-name>

Autonomous AI Agents

Skill Description
blackbox Delegate coding tasks to Blackbox AI CLI agent. Multi-model agent with built-in judge that runs tasks through multiple LLMs and picks the best result.
honcho Configure and use Honcho memory with Hermes — cross-session user modeling, multi-profile peer isolation, observation config, and dialectic reasoning.

Blockchain

Skill Description
base Query Base (Ethereum L2) blockchain data with USD pricing — wallet balances, token info, transaction details, gas analysis, contract inspection, whale detection, and live network stats. No API key required.
solana Query Solana blockchain data with USD pricing — wallet balances, token portfolios, transaction details, NFTs, whale detection, and live network stats. No API key required.

Communication

Skill Description
one-three-one-rule Structured communication framework for proposals and decision-making.

Creative

Skill Description
blender-mcp Control Blender directly from Hermes via socket connection to the blender-mcp addon. Create 3D objects, materials, animations, and run arbitrary Blender Python (bpy) code.
concept-diagrams Generate flat, minimal light/dark-aware SVG diagrams as standalone HTML files, using a unified educational visual language (9 semantic color ramps, automatic dark mode). Best for physics setups, chemistry mechanisms, math curves, physical objects (aircraft, turbines, smartphones), floor plans, cross-sections, lifecycle/process narratives, and hub-spoke system diagrams. Ships with 15 example diagrams.
meme-generation Generate real meme images by picking a template and overlaying text with Pillow. Produces actual .png meme files.
touchdesigner-mcp Control a running TouchDesigner instance via the twozero MCP plugin — create operators, set parameters, wire connections, execute Python, build real-time audio-reactive visuals and GLSL networks. 36 native tools.

DevOps

Skill Description
cli Run 150+ AI apps via inference.sh CLI (infsh) — image generation, video creation, LLMs, search, 3D, and social automation.
docker-management Manage Docker containers, images, volumes, networks, and Compose stacks — lifecycle ops, debugging, cleanup, and Dockerfile optimization.

Email

Skill Description
agentmail Give the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses.

Health

Skill Description
fitness-nutrition Gym workout planner and nutrition tracker. Search 690+ exercises by muscle, equipment, or category via wger. Look up macros and calories for 380,000+ foods via USDA FoodData Central. Computes BMI, TDEE, one-rep max, macro splits, and body fat — pure Python, no pip installs.
neuroskill-bci Brain-Computer Interface (BCI) integration for neuroscience research workflows.

MCP

Skill Description
fastmcp Build, test, inspect, install, and deploy MCP servers with FastMCP in Python. Covers wrapping APIs or databases as MCP tools, exposing resources or prompts, and deployment.
mcporter The mcporter CLI — list, configure, auth, and call MCP servers/tools directly (HTTP or stdio) from the terminal. Useful for ad-hoc MCP interactions; for always-on tool discovery use the built-in native-mcp client instead.

Migration

Skill Description
openclaw-migration Migrate a user's OpenClaw customization footprint into Hermes Agent. Imports memories, SOUL.md, command allowlists, user skills, and selected workspace assets.

MLOps

The largest optional category — covers the full ML pipeline from data curation to production inference.

Skill Description
accelerate Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/FSDP/Megatron/DDP.
chroma Open-source embedding database. Store embeddings and metadata, perform vector and full-text search. Simple 4-function API for RAG and semantic search.
clip OpenAI's vision-language model connecting images and text. Zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning.
faiss Facebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU acceleration, and various index types (Flat, IVF, HNSW).
flash-attention Optimize transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Supports PyTorch SDPA, flash-attn library, H100 FP8, and sliding window.
guidance Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance — Microsoft Research's constrained generation framework.
hermes-atropos-environments Build, test, and debug Hermes Agent RL environments for Atropos training. Covers the HermesAgentBaseEnv interface, reward functions, agent loop integration, and evaluation.
huggingface-tokenizers Fast Rust-based tokenizers for research and production. Tokenizes 1GB in under 20 seconds. Supports BPE, WordPiece, and Unigram algorithms.
instructor Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, and stream partial results.
lambda-labs Reserved and on-demand GPU cloud instances for ML training and inference. SSH access, persistent filesystems, and multi-node clusters.
llava Large Language and Vision Assistant — visual instruction tuning and image-based conversations combining CLIP vision with LLaMA language models.
modal Serverless GPU cloud platform for running ML workloads. On-demand GPU access without infrastructure management, ML model deployment as APIs, or batch jobs with automatic scaling.
nemo-curator GPU-accelerated data curation for LLM training. Fuzzy deduplication (16x faster), quality filtering (30+ heuristics), semantic dedup, PII redaction. Scales with RAPIDS.
peft-fine-tuning Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Train <1% of parameters with minimal accuracy loss for 7B70B models on limited GPU memory. HuggingFace's official PEFT library.
pinecone Managed vector database for production AI. Auto-scaling, hybrid search (dense + sparse), metadata filtering, and low latency (under 100ms p95).
pytorch-fsdp Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP — parameter sharding, mixed precision, CPU offloading, FSDP2.
pytorch-lightning High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks, and minimal boilerplate.
qdrant High-performance vector similarity search engine. Rust-powered with fast nearest neighbor search, hybrid search with filtering, and scalable vector storage.
saelens Train and analyze Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features.
simpo Simple Preference Optimization — reference-free alternative to DPO with better performance (+6.4 pts on AlpacaEval 2.0). No reference model needed.
slime LLM post-training with RL using Megatron+SGLang framework. Custom data generation workflows and tight Megatron-LM integration for RL scaling.
stable-diffusion-image-generation State-of-the-art text-to-image generation with Stable Diffusion via HuggingFace Diffusers. Text-to-image, image-to-image translation, inpainting, and custom diffusion pipelines.
tensorrt-llm Optimize LLM inference with NVIDIA TensorRT for maximum throughput. 10-100x faster than PyTorch on A100/H100 with quantization (FP8/INT4) and in-flight batching.
torchtitan PyTorch-native distributed LLM pretraining with 4D parallelism (FSDP2, TP, PP, CP). Scale from 8 to 512+ GPUs with Float8 and torch.compile.
whisper OpenAI's general-purpose speech recognition. 99 languages, transcription, translation to English, and language ID. Six model sizes from tiny (39M) to large (1550M). Best for robust multilingual ASR.

Productivity

Skill Description
canvas Canvas LMS integration — fetch enrolled courses and assignments using API token authentication.
memento-flashcards Spaced repetition flashcard system for learning and knowledge retention.
siyuan SiYuan Note API for searching, reading, creating, and managing blocks and documents in a self-hosted knowledge base.
telephony Give Hermes phone capabilities — provision a Twilio number, send/receive SMS/MMS, make calls, and place AI-driven outbound calls through Bland.ai or Vapi.

Research

Skill Description
bioinformatics Gateway to 400+ bioinformatics skills from bioSkills and ClawBio. Covers genomics, transcriptomics, single-cell, variant calling, pharmacogenomics, metagenomics, and structural biology.
domain-intel Passive domain reconnaissance using Python stdlib. Subdomain discovery, SSL certificate inspection, WHOIS lookups, DNS records, and bulk multi-domain analysis. No API keys required.
duckduckgo-search Free web search via DuckDuckGo — text, news, images, videos. No API key needed.
gitnexus-explorer Index a codebase with GitNexus and serve an interactive knowledge graph via web UI and Cloudflare tunnel.
parallel-cli Vendor skill for Parallel CLI — agent-native web search, extraction, deep research, enrichment, and monitoring.
qmd Search personal knowledge bases, notes, docs, and meeting transcripts locally using qmd — a hybrid retrieval engine with BM25, vector search, and LLM reranking.
scrapling Web scraping with Scrapling — HTTP fetching, stealth browser automation, Cloudflare bypass, and spider crawling via CLI and Python.

Security

Skill Description
1password Set up and use 1Password CLI (op). Install the CLI, enable desktop app integration, sign in, and read/inject secrets for commands.
oss-forensics Open-source software forensics — analyze packages, dependencies, and supply chain risks.
sherlock OSINT username search across 400+ social networks. Hunt down social media accounts by username.

Contributing Optional Skills

To add a new optional skill to the repository:

  1. Create a directory under optional-skills/<category>/<skill-name>/
  2. Add a SKILL.md with standard frontmatter (name, description, version, author)
  3. Include any supporting files in references/, templates/, or scripts/ subdirectories
  4. Submit a pull request — the skill will appear in this catalog once merged