mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-05-12 03:42:08 +00:00
Completes the Windows-gating coverage for the built-in skills/ tree. Every
bundled SKILL.md now carries an explicit platforms: declaration so the
loader (agent.skill_utils.skill_matches_platform) can skip-load skills
that don't fit the current OS.
74 skills declared cross-platform (platforms: [linux, macos, windows]):
Creative (16): ascii-art, ascii-video, architecture-diagram, baoyu-comic,
baoyu-infographic, claude-design, creative-ideation, design-md,
excalidraw, humanizer, manim-video, p5js, pixel-art,
popular-web-designs, pretext, sketch, songwriting-and-ai-music,
touchdesigner-mcp
Autonomous agents: claude-code, codex, hermes-agent, opencode
Data/devops: jupyter-live-kernel, kanban-orchestrator, kanban-worker,
webhook-subscriptions, dogfood, codebase-inspection
GitHub: github-auth, github-code-review, github-issues,
github-pr-workflow, github-repo-management
Media: gif-search, heartmula, songsee, spotify, youtube-content
MCP / email / gaming / notes / smart-home: native-mcp, himalaya,
pokemon-player, obsidian, openhue
mlops (non-broken): weights-and-biases, huggingface-hub, llama-cpp,
outlines, segment-anything-model, dspy, trl-fine-tuning
Productivity: airtable, google-workspace, linear, maps, nano-pdf,
notion, ocr-and-documents, powerpoint
Red-teaming / research: godmode, arxiv, blogwatcher, llm-wiki,
polymarket
Software-dev: debugging-hermes-tui-commands, hermes-agent-skill-authoring,
node-inspect-debugger, plan, requesting-code-review, spike,
subagent-driven-development, systematic-debugging,
test-driven-development, writing-plans
Misc: yuanbao
5 skills gated from Windows (platforms: [linux, macos]):
mlops/inference/vllm (serving-llms-vllm)
vLLM is officially Linux-only; Windows requires WSL.
mlops/training/axolotl
Axolotl's flash-attn + deepspeed + bitsandbytes stack is Linux-first.
mlops/training/unsloth
Requires Triton + xformers + flash-attn — Linux only in practice.
mlops/models/audiocraft (audiocraft-audio-generation)
torchaudio ffmpeg backend + encodec dependencies are Linux-first.
mlops/inference/obliteratus
Research abliteration workflow; relies on Linux-focused pytorch
kernels and MLX — no first-class Windows path.
Same strict-over-lenient policy as the optional-skills sweep: when the
underlying tool's Windows support is rough, missing, or WSL-only, gate the
skill. Easier to un-gate after verified Windows support lands than to leak
partial support that manifests as mid-task failures.
Combined with prior commits in this branch, every bundled SKILL.md
(skills/ + optional-skills/) now has a platforms: declaration.
161 lines
8.4 KiB
Markdown
161 lines
8.4 KiB
Markdown
---
|
|
name: kanban-worker
|
|
description: Pitfalls, examples, and edge cases for Hermes Kanban workers. The lifecycle itself is auto-injected into every worker's system prompt as KANBAN_GUIDANCE (from agent/prompt_builder.py); this skill is what you load when you want deeper detail on specific scenarios.
|
|
version: 2.0.0
|
|
platforms: [linux, macos, windows]
|
|
metadata:
|
|
hermes:
|
|
tags: [kanban, multi-agent, collaboration, workflow, pitfalls]
|
|
related_skills: [kanban-orchestrator]
|
|
---
|
|
|
|
# Kanban Worker — Pitfalls and Examples
|
|
|
|
> You're seeing this skill because the Hermes Kanban dispatcher spawned you as a worker with `--skills kanban-worker` — it's loaded automatically for every dispatched worker. The **lifecycle** (6 steps: orient → work → heartbeat → block/complete) also lives in the `KANBAN_GUIDANCE` block that's auto-injected into your system prompt. This skill is the deeper detail: good handoff shapes, retry diagnostics, edge cases.
|
|
|
|
## Workspace handling
|
|
|
|
Your workspace kind determines how you should behave inside `$HERMES_KANBAN_WORKSPACE`:
|
|
|
|
| Kind | What it is | How to work |
|
|
|---|---|---|
|
|
| `scratch` | Fresh tmp dir, yours alone | Read/write freely; it gets GC'd when the task is archived. |
|
|
| `dir:<path>` | Shared persistent directory | Other runs will read what you write. Treat it like long-lived state. Path is guaranteed absolute (the kernel rejects relative paths). |
|
|
| `worktree` | Git worktree at the resolved path | If `.git` doesn't exist, run `git worktree add <path> <branch>` from the main repo first, then cd and work normally. Commit work here. |
|
|
|
|
## Tenant isolation
|
|
|
|
If `$HERMES_TENANT` is set, the task belongs to a tenant namespace. When reading or writing persistent memory, prefix memory entries with the tenant so context doesn't leak across tenants:
|
|
|
|
- Good: `business-a: Acme is our biggest customer`
|
|
- Bad (leaks): `Acme is our biggest customer`
|
|
|
|
## Good summary + metadata shapes
|
|
|
|
The `kanban_complete(summary=..., metadata=...)` handoff is how downstream workers read what you did. Patterns that work:
|
|
|
|
**Coding task:**
|
|
```python
|
|
kanban_complete(
|
|
summary="shipped rate limiter — token bucket, keys on user_id with IP fallback, 14 tests pass",
|
|
metadata={
|
|
"changed_files": ["rate_limiter.py", "tests/test_rate_limiter.py"],
|
|
"tests_run": 14,
|
|
"tests_passed": 14,
|
|
"decisions": ["user_id primary, IP fallback for unauthenticated requests"],
|
|
},
|
|
)
|
|
```
|
|
|
|
**Research task:**
|
|
```python
|
|
kanban_complete(
|
|
summary="3 competing libraries reviewed; vLLM wins on throughput, SGLang on latency, Tensorrt-LLM on memory efficiency",
|
|
metadata={
|
|
"sources_read": 12,
|
|
"recommendation": "vLLM",
|
|
"benchmarks": {"vllm": 1.0, "sglang": 0.87, "trtllm": 0.72},
|
|
},
|
|
)
|
|
```
|
|
|
|
**Review task:**
|
|
```python
|
|
kanban_complete(
|
|
summary="reviewed PR #123; 2 blocking issues found (SQL injection in /search, missing CSRF on /settings)",
|
|
metadata={
|
|
"pr_number": 123,
|
|
"findings": [
|
|
{"severity": "critical", "file": "api/search.py", "line": 42, "issue": "raw SQL concat"},
|
|
{"severity": "high", "file": "api/settings.py", "issue": "missing CSRF middleware"},
|
|
],
|
|
"approved": False,
|
|
},
|
|
)
|
|
```
|
|
|
|
Shape `metadata` so downstream parsers (reviewers, aggregators, schedulers) can use it without re-reading your prose.
|
|
|
|
## Claiming cards you actually created
|
|
|
|
If your run produced new kanban tasks (via `kanban_create`), pass the ids in `created_cards` on `kanban_complete`. The kernel verifies each id exists and was created by your profile; any phantom id blocks the completion with an error listing what went wrong, and the rejected attempt is permanently recorded on the task's event log. **Only list ids you captured from a successful `kanban_create` return value — never invent ids from prose, never paste ids from earlier runs, never claim cards another worker created.**
|
|
|
|
```python
|
|
# GOOD — capture return values, then claim them.
|
|
c1 = kanban_create(title="remediate SQL injection", assignee="security-worker")
|
|
c2 = kanban_create(title="fix CSRF middleware", assignee="web-worker")
|
|
|
|
kanban_complete(
|
|
summary="Review done; spawned remediations for both findings.",
|
|
metadata={"pr_number": 123, "approved": False},
|
|
created_cards=[c1["task_id"], c2["task_id"]],
|
|
)
|
|
```
|
|
|
|
```python
|
|
# BAD — claiming ids you don't have captured return values for.
|
|
kanban_complete(
|
|
summary="Created remediation cards t_a1b2c3d4, t_deadbeef", # hallucinated
|
|
created_cards=["t_a1b2c3d4", "t_deadbeef"], # → gate rejects
|
|
)
|
|
```
|
|
|
|
If a `kanban_create` call fails (exception, tool_error), the card was NOT created — do not include a phantom id for it. Retry the create, or omit the id and mention the failure in your summary. The prose-scan pass also catches `t_<hex>` references in your free-form summary that don't resolve; these don't block the completion but show up as advisory warnings on the task in the dashboard.
|
|
|
|
## Block reasons that get answered fast
|
|
|
|
Bad: `"stuck"` — the human has no context.
|
|
|
|
Good: one sentence naming the specific decision you need. Leave longer context as a comment instead.
|
|
|
|
```python
|
|
kanban_comment(
|
|
task_id=os.environ["HERMES_KANBAN_TASK"],
|
|
body="Full context: I have user IPs from Cloudflare headers but some users are behind NATs with thousands of peers. Keying on IP alone causes false positives.",
|
|
)
|
|
kanban_block(reason="Rate limit key choice: IP (simple, NAT-unsafe) or user_id (requires auth, skips anonymous endpoints)?")
|
|
```
|
|
|
|
The block message is what appears in the dashboard / gateway notifier. The comment is the deeper context a human reads when they open the task.
|
|
|
|
## Heartbeats worth sending
|
|
|
|
Good heartbeats name progress: `"epoch 12/50, loss 0.31"`, `"scanned 1.2M/2.4M rows"`, `"uploaded 47/120 videos"`.
|
|
|
|
Bad heartbeats: `"still working"`, empty notes, sub-second intervals. Every few minutes max; skip entirely for tasks under ~2 minutes.
|
|
|
|
## Retry scenarios
|
|
|
|
If you open the task and `kanban_show` returns `runs: [...]` with one or more closed runs, you're a retry. The prior runs' `outcome` / `summary` / `error` tell you what didn't work. Don't repeat that path. Typical retry diagnostics:
|
|
|
|
- `outcome: "timed_out"` — the previous attempt hit `max_runtime_seconds`. You may need to chunk the work or shorten it.
|
|
- `outcome: "crashed"` — OOM or segfault. Reduce memory footprint.
|
|
- `outcome: "spawn_failed"` + `error: "..."` — usually a profile config issue (missing credential, bad PATH). Ask the human via `kanban_block` instead of retrying blindly.
|
|
- `outcome: "reclaimed"` + `summary: "task archived..."` — operator archived the task out from under the previous run; you probably shouldn't be running at all, check status carefully.
|
|
- `outcome: "blocked"` — a previous attempt blocked; the unblock comment should be in the thread by now.
|
|
|
|
## Do NOT
|
|
|
|
- Call `delegate_task` as a substitute for `kanban_create`. `delegate_task` is for short reasoning subtasks inside YOUR run; `kanban_create` is for cross-agent handoffs that outlive one API loop.
|
|
- Modify files outside `$HERMES_KANBAN_WORKSPACE` unless the task body says to.
|
|
- Create follow-up tasks assigned to yourself — assign to the right specialist.
|
|
- Complete a task you didn't actually finish. Block it instead.
|
|
|
|
## Pitfalls
|
|
|
|
**Task state can change between dispatch and your startup.** Between when the dispatcher claimed and when your process actually booted, the task may have been blocked, reassigned, or archived. Always `kanban_show` first. If it reports `blocked` or `archived`, stop — you shouldn't be running.
|
|
|
|
**Workspace may have stale artifacts.** Especially `dir:` and `worktree` workspaces can have files from previous runs. Read the comment thread — it usually explains why you're running again and what state the workspace is in.
|
|
|
|
**Don't rely on the CLI when the guidance is available.** The `kanban_*` tools work across all terminal backends (Docker, Modal, SSH). `hermes kanban <verb>` from your terminal tool will fail in containerized backends because the CLI isn't installed there. When in doubt, use the tool.
|
|
|
|
## CLI fallback (for scripting)
|
|
|
|
Every tool has a CLI equivalent for human operators and scripts:
|
|
- `kanban_show` ↔ `hermes kanban show <id> --json`
|
|
- `kanban_complete` ↔ `hermes kanban complete <id> --summary "..." --metadata '{...}'`
|
|
- `kanban_block` ↔ `hermes kanban block <id> "reason"`
|
|
- `kanban_create` ↔ `hermes kanban create "title" --assignee <profile> [--parent <id>]`
|
|
- etc.
|
|
|
|
Use the tools from inside an agent; the CLI exists for the human at the terminal.
|