mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-05-15 04:12:25 +00:00
Extends the Windows-gating work to the optional-skills/ tree. Every
SKILL.md that previously omitted the platforms: field now carries an
explicit declaration, which Hermes's loader (agent.skill_utils.
skill_matches_platform) honors to skip-load on incompatible OSes.
58 skills declared cross-platform (platforms: [linux, macos, windows]):
autonomous-ai-agents/blackbox, autonomous-ai-agents/honcho
blockchain/base, blockchain/solana
communication/one-three-one-rule
creative/blender-mcp, creative/concept-diagrams, creative/hyperframes,
creative/kanban-video-orchestrator, creative/meme-generation
devops/cli (inference-sh-cli), devops/docker-management
dogfood/adversarial-ux-test
email/agentmail
finance/3-statement-model, finance/comps-analysis, finance/dcf-model,
finance/excel-author, finance/lbo-model, finance/merger-model,
finance/pptx-author
health/fitness-nutrition, health/neuroskill-bci
mcp/fastmcp, mcp/mcporter
migration/openclaw-migration
mlops/accelerate, mlops/chroma, mlops/clip, mlops/guidance,
mlops/hermes-atropos-environments, mlops/huggingface-tokenizers,
mlops/instructor, mlops/lambda-labs, mlops/llava, mlops/modal,
mlops/peft, mlops/pinecone, mlops/pytorch-lightning, mlops/qdrant,
mlops/saelens, mlops/simpo, mlops/stable-diffusion
productivity/canvas, productivity/shop-app, productivity/shopify,
productivity/siyuan, productivity/telephony
research/domain-intel, research/drug-discovery, research/duckduckgo-search,
research/gitnexus-explorer, research/parallel-cli, research/scrapling
security/1password, security/oss-forensics, security/sherlock
web-development/page-agent
5 skills gated from Windows (platforms: [linux, macos]):
mlops/flash-attention - Flash Attention wheels are Linux-first; Windows
install requires building from source with CUDA
mlops/faiss - faiss-gpu has no Windows wheel; gate rather than
leak partial (faiss-cpu) support
mlops/nemo-curator - NVIDIA NeMo ecosystem has no first-class Windows path
mlops/slime - Megatron+SGLang RL stack is Linux-only in practice
mlops/whisper - openai-whisper + ffmpeg setup on Windows is
non-trivial; gate until Windows install stanza lands
Methodology: scanned every SKILL.md for Windows-hostile signals
(apt-get, brew, systemd, osascript, ptrace, X11 binaries, POSIX-only
Python APIs, Docker POSIX $(pwd) bind-mounts, explicit 'linux-only' /
'macos-only' text). 3 skills flagged as having hard signals on review:
docker-management and qdrant only had POSIX $(pwd) docker examples and
the tools themselves (Docker Desktop, Qdrant) run fine on Windows —
declared ALL. whisper had an apt/brew ffmpeg install path and nothing
else but the openai-whisper Windows install story is rough enough to
warrant gating.
Strict-over-lenient policy: when in doubt, gate. Easier to un-gate after
verified Windows support lands than to leak partial support that
manifests as mid-task failures for Windows users.
144 lines
5.3 KiB
Markdown
144 lines
5.3 KiB
Markdown
---
|
|
name: blackbox
|
|
description: Delegate coding tasks to Blackbox AI CLI agent. Multi-model agent with built-in judge that runs tasks through multiple LLMs and picks the best result. Requires the blackbox CLI and a Blackbox AI API key.
|
|
version: 1.0.0
|
|
author: Hermes Agent (Nous Research)
|
|
license: MIT
|
|
platforms: [linux, macos, windows]
|
|
metadata:
|
|
hermes:
|
|
tags: [Coding-Agent, Blackbox, Multi-Agent, Judge, Multi-Model]
|
|
related_skills: [claude-code, codex, hermes-agent]
|
|
---
|
|
|
|
# Blackbox CLI
|
|
|
|
Delegate coding tasks to [Blackbox AI](https://www.blackbox.ai/) via the Hermes terminal. Blackbox is a multi-model coding agent CLI that dispatches tasks to multiple LLMs (Claude, Codex, Gemini, Blackbox Pro) and uses a judge to select the best implementation.
|
|
|
|
The CLI is [open-source](https://github.com/blackboxaicode/cli) (GPL-3.0, TypeScript, forked from Gemini CLI) and supports interactive sessions, non-interactive one-shots, checkpointing, MCP, and vision model switching.
|
|
|
|
## Prerequisites
|
|
|
|
- Node.js 20+ installed
|
|
- Blackbox CLI installed: `npm install -g @blackboxai/cli`
|
|
- Or install from source:
|
|
```
|
|
git clone https://github.com/blackboxaicode/cli.git
|
|
cd cli && npm install && npm install -g .
|
|
```
|
|
- API key from [app.blackbox.ai/dashboard](https://app.blackbox.ai/dashboard)
|
|
- Configured: run `blackbox configure` and enter your API key
|
|
- Use `pty=true` in terminal calls — Blackbox CLI is an interactive terminal app
|
|
|
|
## One-Shot Tasks
|
|
|
|
```
|
|
terminal(command="blackbox --prompt 'Add JWT authentication with refresh tokens to the Express API'", workdir="/path/to/project", pty=true)
|
|
```
|
|
|
|
For quick scratch work:
|
|
```
|
|
terminal(command="cd $(mktemp -d) && git init && blackbox --prompt 'Build a REST API for todos with SQLite'", pty=true)
|
|
```
|
|
|
|
## Background Mode (Long Tasks)
|
|
|
|
For tasks that take minutes, use background mode so you can monitor progress:
|
|
|
|
```
|
|
# Start in background with PTY
|
|
terminal(command="blackbox --prompt 'Refactor the auth module to use OAuth 2.0'", workdir="~/project", background=true, pty=true)
|
|
# Returns session_id
|
|
|
|
# Monitor progress
|
|
process(action="poll", session_id="<id>")
|
|
process(action="log", session_id="<id>")
|
|
|
|
# Send input if Blackbox asks a question
|
|
process(action="submit", session_id="<id>", data="yes")
|
|
|
|
# Kill if needed
|
|
process(action="kill", session_id="<id>")
|
|
```
|
|
|
|
## Checkpoints & Resume
|
|
|
|
Blackbox CLI has built-in checkpoint support for pausing and resuming tasks:
|
|
|
|
```
|
|
# After a task completes, Blackbox shows a checkpoint tag
|
|
# Resume with a follow-up task:
|
|
terminal(command="blackbox --resume-checkpoint 'task-abc123-2026-03-06' --prompt 'Now add rate limiting to the endpoints'", workdir="~/project", pty=true)
|
|
```
|
|
|
|
## Session Commands
|
|
|
|
During an interactive session, use these commands:
|
|
|
|
| Command | Effect |
|
|
|---------|--------|
|
|
| `/compress` | Shrink conversation history to save tokens |
|
|
| `/clear` | Wipe history and start fresh |
|
|
| `/stats` | View current token usage |
|
|
| `Ctrl+C` | Cancel current operation |
|
|
|
|
## PR Reviews
|
|
|
|
Clone to a temp directory to avoid modifying the working tree:
|
|
|
|
```
|
|
terminal(command="REVIEW=$(mktemp -d) && git clone https://github.com/user/repo.git $REVIEW && cd $REVIEW && gh pr checkout 42 && blackbox --prompt 'Review this PR against main. Check for bugs, security issues, and code quality.'", pty=true)
|
|
```
|
|
|
|
## Parallel Work
|
|
|
|
Spawn multiple Blackbox instances for independent tasks:
|
|
|
|
```
|
|
terminal(command="blackbox --prompt 'Fix the login bug'", workdir="/tmp/issue-1", background=true, pty=true)
|
|
terminal(command="blackbox --prompt 'Add unit tests for auth'", workdir="/tmp/issue-2", background=true, pty=true)
|
|
|
|
# Monitor all
|
|
process(action="list")
|
|
```
|
|
|
|
## Multi-Model Mode
|
|
|
|
Blackbox's unique feature is running the same task through multiple models and judging the results. Configure which models to use via `blackbox configure` — select multiple providers to enable the Chairman/judge workflow where the CLI evaluates outputs from different models and picks the best one.
|
|
|
|
## Key Flags
|
|
|
|
| Flag | Effect |
|
|
|------|--------|
|
|
| `--prompt "task"` | Non-interactive one-shot execution |
|
|
| `--resume-checkpoint "tag"` | Resume from a saved checkpoint |
|
|
| `--yolo` | Auto-approve all actions and model switches |
|
|
| `blackbox session` | Start interactive chat session |
|
|
| `blackbox configure` | Change settings, providers, models |
|
|
| `blackbox info` | Display system information |
|
|
|
|
## Vision Support
|
|
|
|
Blackbox automatically detects images in input and can switch to multimodal analysis. VLM modes:
|
|
- `"once"` — Switch model for current query only
|
|
- `"session"` — Switch for entire session
|
|
- `"persist"` — Stay on current model (no switch)
|
|
|
|
## Token Limits
|
|
|
|
Control token usage via `.blackboxcli/settings.json`:
|
|
```json
|
|
{
|
|
"sessionTokenLimit": 32000
|
|
}
|
|
```
|
|
|
|
## Rules
|
|
|
|
1. **Always use `pty=true`** — Blackbox CLI is an interactive terminal app and will hang without a PTY
|
|
2. **Use `workdir`** — keep the agent focused on the right directory
|
|
3. **Background for long tasks** — use `background=true` and monitor with `process` tool
|
|
4. **Don't interfere** — monitor with `poll`/`log`, don't kill sessions because they're slow
|
|
5. **Report results** — after completion, check what changed and summarize for the user
|
|
6. **Credits cost money** — Blackbox uses a credit-based system; multi-model mode consumes credits faster
|
|
7. **Check prerequisites** — verify `blackbox` CLI is installed before attempting delegation
|