hermes-agent/optional-skills/autonomous-ai-agents/blackbox/SKILL.md
Teknium db22efbe88 feat(optional-skills): declare platforms frontmatter for all 63 undeclared skills
Extends the Windows-gating work to the optional-skills/ tree. Every
SKILL.md that previously omitted the platforms: field now carries an
explicit declaration, which Hermes's loader (agent.skill_utils.
skill_matches_platform) honors to skip-load on incompatible OSes.

58 skills declared cross-platform (platforms: [linux, macos, windows]):
  autonomous-ai-agents/blackbox, autonomous-ai-agents/honcho
  blockchain/base, blockchain/solana
  communication/one-three-one-rule
  creative/blender-mcp, creative/concept-diagrams, creative/hyperframes,
  creative/kanban-video-orchestrator, creative/meme-generation
  devops/cli (inference-sh-cli), devops/docker-management
  dogfood/adversarial-ux-test
  email/agentmail
  finance/3-statement-model, finance/comps-analysis, finance/dcf-model,
  finance/excel-author, finance/lbo-model, finance/merger-model,
  finance/pptx-author
  health/fitness-nutrition, health/neuroskill-bci
  mcp/fastmcp, mcp/mcporter
  migration/openclaw-migration
  mlops/accelerate, mlops/chroma, mlops/clip, mlops/guidance,
  mlops/hermes-atropos-environments, mlops/huggingface-tokenizers,
  mlops/instructor, mlops/lambda-labs, mlops/llava, mlops/modal,
  mlops/peft, mlops/pinecone, mlops/pytorch-lightning, mlops/qdrant,
  mlops/saelens, mlops/simpo, mlops/stable-diffusion
  productivity/canvas, productivity/shop-app, productivity/shopify,
  productivity/siyuan, productivity/telephony
  research/domain-intel, research/drug-discovery, research/duckduckgo-search,
  research/gitnexus-explorer, research/parallel-cli, research/scrapling
  security/1password, security/oss-forensics, security/sherlock
  web-development/page-agent

5 skills gated from Windows (platforms: [linux, macos]):
  mlops/flash-attention   - Flash Attention wheels are Linux-first; Windows
                            install requires building from source with CUDA
  mlops/faiss             - faiss-gpu has no Windows wheel; gate rather than
                            leak partial (faiss-cpu) support
  mlops/nemo-curator      - NVIDIA NeMo ecosystem has no first-class Windows path
  mlops/slime             - Megatron+SGLang RL stack is Linux-only in practice
  mlops/whisper           - openai-whisper + ffmpeg setup on Windows is
                            non-trivial; gate until Windows install stanza lands

Methodology: scanned every SKILL.md for Windows-hostile signals
(apt-get, brew, systemd, osascript, ptrace, X11 binaries, POSIX-only
Python APIs, Docker POSIX $(pwd) bind-mounts, explicit 'linux-only' /
'macos-only' text). 3 skills flagged as having hard signals on review:
docker-management and qdrant only had POSIX $(pwd) docker examples and
the tools themselves (Docker Desktop, Qdrant) run fine on Windows —
declared ALL. whisper had an apt/brew ffmpeg install path and nothing
else but the openai-whisper Windows install story is rough enough to
warrant gating.

Strict-over-lenient policy: when in doubt, gate. Easier to un-gate after
verified Windows support lands than to leak partial support that
manifests as mid-task failures for Windows users.
2026-05-08 14:27:40 -07:00

5.3 KiB

name description version author license platforms metadata
blackbox Delegate coding tasks to Blackbox AI CLI agent. Multi-model agent with built-in judge that runs tasks through multiple LLMs and picks the best result. Requires the blackbox CLI and a Blackbox AI API key. 1.0.0 Hermes Agent (Nous Research) MIT
linux
macos
windows
hermes
tags related_skills
Coding-Agent
Blackbox
Multi-Agent
Judge
Multi-Model
claude-code
codex
hermes-agent

Blackbox CLI

Delegate coding tasks to Blackbox AI via the Hermes terminal. Blackbox is a multi-model coding agent CLI that dispatches tasks to multiple LLMs (Claude, Codex, Gemini, Blackbox Pro) and uses a judge to select the best implementation.

The CLI is open-source (GPL-3.0, TypeScript, forked from Gemini CLI) and supports interactive sessions, non-interactive one-shots, checkpointing, MCP, and vision model switching.

Prerequisites

  • Node.js 20+ installed
  • Blackbox CLI installed: npm install -g @blackboxai/cli
  • Or install from source:
    git clone https://github.com/blackboxaicode/cli.git
    cd cli && npm install && npm install -g .
    
  • API key from app.blackbox.ai/dashboard
  • Configured: run blackbox configure and enter your API key
  • Use pty=true in terminal calls — Blackbox CLI is an interactive terminal app

One-Shot Tasks

terminal(command="blackbox --prompt 'Add JWT authentication with refresh tokens to the Express API'", workdir="/path/to/project", pty=true)

For quick scratch work:

terminal(command="cd $(mktemp -d) && git init && blackbox --prompt 'Build a REST API for todos with SQLite'", pty=true)

Background Mode (Long Tasks)

For tasks that take minutes, use background mode so you can monitor progress:

# Start in background with PTY
terminal(command="blackbox --prompt 'Refactor the auth module to use OAuth 2.0'", workdir="~/project", background=true, pty=true)
# Returns session_id

# Monitor progress
process(action="poll", session_id="<id>")
process(action="log", session_id="<id>")

# Send input if Blackbox asks a question
process(action="submit", session_id="<id>", data="yes")

# Kill if needed
process(action="kill", session_id="<id>")

Checkpoints & Resume

Blackbox CLI has built-in checkpoint support for pausing and resuming tasks:

# After a task completes, Blackbox shows a checkpoint tag
# Resume with a follow-up task:
terminal(command="blackbox --resume-checkpoint 'task-abc123-2026-03-06' --prompt 'Now add rate limiting to the endpoints'", workdir="~/project", pty=true)

Session Commands

During an interactive session, use these commands:

Command Effect
/compress Shrink conversation history to save tokens
/clear Wipe history and start fresh
/stats View current token usage
Ctrl+C Cancel current operation

PR Reviews

Clone to a temp directory to avoid modifying the working tree:

terminal(command="REVIEW=$(mktemp -d) && git clone https://github.com/user/repo.git $REVIEW && cd $REVIEW && gh pr checkout 42 && blackbox --prompt 'Review this PR against main. Check for bugs, security issues, and code quality.'", pty=true)

Parallel Work

Spawn multiple Blackbox instances for independent tasks:

terminal(command="blackbox --prompt 'Fix the login bug'", workdir="/tmp/issue-1", background=true, pty=true)
terminal(command="blackbox --prompt 'Add unit tests for auth'", workdir="/tmp/issue-2", background=true, pty=true)

# Monitor all
process(action="list")

Multi-Model Mode

Blackbox's unique feature is running the same task through multiple models and judging the results. Configure which models to use via blackbox configure — select multiple providers to enable the Chairman/judge workflow where the CLI evaluates outputs from different models and picks the best one.

Key Flags

Flag Effect
--prompt "task" Non-interactive one-shot execution
--resume-checkpoint "tag" Resume from a saved checkpoint
--yolo Auto-approve all actions and model switches
blackbox session Start interactive chat session
blackbox configure Change settings, providers, models
blackbox info Display system information

Vision Support

Blackbox automatically detects images in input and can switch to multimodal analysis. VLM modes:

  • "once" — Switch model for current query only
  • "session" — Switch for entire session
  • "persist" — Stay on current model (no switch)

Token Limits

Control token usage via .blackboxcli/settings.json:

{
  "sessionTokenLimit": 32000
}

Rules

  1. Always use pty=true — Blackbox CLI is an interactive terminal app and will hang without a PTY
  2. Use workdir — keep the agent focused on the right directory
  3. Background for long tasks — use background=true and monitor with process tool
  4. Don't interfere — monitor with poll/log, don't kill sessions because they're slow
  5. Report results — after completion, check what changed and summarize for the user
  6. Credits cost money — Blackbox uses a credit-based system; multi-model mode consumes credits faster
  7. Check prerequisites — verify blackbox CLI is installed before attempting delegation