hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-07-25 17:18:11 +00:00

History

Teknium fdc90346ea chore(skills): move red-team skills (godmode, obliteratus) to optional-skills — Anthropic classifier (#43221 ) * chore(skills): remove red-team skills (godmode, obliteratus) from bundled catalog Anthropic's output classifier on claude-fable-5 (and likely other Claude models served through it) intermittently returns empty content for sessions whose system prompt advertises these skills. The bundled skills-catalog block is injected into every session's system prompt, so the descriptions - red-teaming/godmode 'Jailbreak LLMs: Parseltongue, GODMODE, ULTRAPLINIAN' - mlops/inference/obliteratus 'OBLITERATUS: abliterate LLM refusals (diff-in-means)' trip the classifier on EVERY session regardless of which skill is actually loaded, killing unrelated legitimate work (PR review, codebase audits, etc.). Measured impact (controlled, interleaved A/B, claude-fable-5 via OpenRouter, prompts differing only by the ~204 chars of these catalog lines, N=20 each): catalog lines present -> 19/20 (95%) blocked catalog lines absent -> 5/20 (25%) blocked Removing them ~quartered the block rate. Rewording the descriptions was not enough; the skills must leave the bundled catalog. - Delete skills/red-teaming/godmode and skills/mlops/inference/obliteratus - Drop their generated doc pages + catalog/sidebar entries (EN + zh-Hans) - Drop the godmode hand-written-page exception in generate-skill-docs.py * chore(skills): relocate godmode + obliteratus to optional-skills Rather than deleting outright, move both into optional-skills/ so they remain installable via `hermes skills install` while leaving the always-injected bundled catalog (which is what tripped Anthropic's classifier). - optional-skills/security/godmode (was skills/red-teaming/godmode) - optional-skills/mlops/obliteratus (was skills/mlops/inference/obliteratus) - regenerate optional-skills catalog + sidebar entries		2026-06-09 21:41:00 -07:00
..
apple	docs(skills): clarify Reminders alarm timing	2026-05-29 04:01:01 -07:00
autonomous-ai-agents	docs(codex): document --sandbox danger-full-access for gateway bubblewrap failures (#40619 )	2026-06-07 18:36:18 -07:00
creative	refactor(skills): clean up bundled skill set + add environments: relevance gate (#39028 )	2026-06-04 06:11:22 -07:00
data-science	feat(skills): declare platforms frontmatter for all 79 undeclared built-in skills	2026-05-08 14:27:40 -07:00
devops	refactor(skills): clean up bundled skill set + add environments: relevance gate (#39028 )	2026-06-04 06:11:22 -07:00
dogfood	feat(skills): declare platforms frontmatter for all 79 undeclared built-in skills	2026-05-08 14:27:40 -07:00
email	docs(email): clarify gateway vs Himalaya setup	2026-05-28 05:42:09 -07:00
github	feat(skills): declare platforms frontmatter for all 79 undeclared built-in skills	2026-05-08 14:27:40 -07:00
index-cache	Release set of skills	2026-02-25 05:21:17 -08:00
media	refactor(skills): clean up bundled skill set + add environments: relevance gate (#39028 )	2026-06-04 06:11:22 -07:00
mlops	chore(skills): move red-team skills (godmode, obliteratus) to optional-skills — Anthropic classifier (#43221 )	2026-06-09 21:41:00 -07:00
note-taking	feat(skills): declare platforms frontmatter for all 79 undeclared built-in skills	2026-05-08 14:27:40 -07:00
productivity	fix(google-workspace): fall back to uv when venv has no pip (#39516 )	2026-06-05 13:30:02 +10:00
research	docs: fix separate typo; hyphenate built-in trust wording	2026-05-29 12:06:22 -07:00
smart-home	feat(skills): declare platforms frontmatter for all 79 undeclared built-in skills	2026-05-08 14:27:40 -07:00
social-media	fix(skills): document xurl X Article ingestion	2026-06-03 15:11:57 -07:00
software-development	feat(skills): add simplify-code skill — parallel 3-agent code review and cleanup (#41691 )	2026-06-07 22:02:41 -07:00
yuanbao	feat(skills): declare platforms frontmatter for all 79 undeclared built-in skills	2026-05-08 14:27:40 -07:00