hermes-agent

mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-04-25 00:51:20 +00:00

Author	SHA1	Message	Date
Teknium	73bccc94c7	skills: consolidate mlops redundancies (gguf+llama-cpp, grpo+trl, guidance→optional) (#11965 ) Three tightly-scoped built-in skill consolidations to reduce redundancy in the available_skills listing injected into every system prompt: 1. gguf-quantization → llama-cpp (merged) GGUF is llama.cpp's format; two skills covered the same toolchain. The merged llama-cpp skill keeps the full K-quant table + imatrix workflow from gguf and the ROCm/benchmarks/supported-models sections from the original llama-cpp. All 5 reference files preserved. 2. grpo-rl-training → fine-tuning-with-trl (folded in) GRPO isn't a framework, it's a trainer inside TRL. Moved the 17KB deep-dive SKILL.md to references/grpo-training.md and the working template to templates/basic_grpo_training.py. TRL's GRPO workflow section now points to both. Atropos skill's related_skills updated. 3. guidance → optional-skills/mlops/ Dropped from built-in. Outlines (still built-in) covers the same structured-generation ground with wider adoption. Listed in the optional catalog for users who specifically want Guidance. Net: 3 fewer built-in skill lines in every system prompt, zero content loss. Contributor authorship preserved via git rename detection.	2026-04-17 21:36:40 -07:00
Teknium	fdefd98aa3	docs(skills): make descriptions self-contained, not cross-dependent Previous pass assumed both skills would always be loaded together, so each description pointed at the other ('use concept-diagrams instead'). That breaks when only one skill is active — the agent reads 'use the other skill' and there is no other skill. Now each skill's description and scope section is fully self-contained: - States what it's best suited for - Lists subjects where a more specialized skill (if available) would be a better fit, naming them only as 'consider X if available' - Explicitly offers itself as a general SVG diagram fallback when no more specialized skill exists An agent loading either skill alone gets unambiguous guidance; an agent with both loaded still gets useful routing via the 'consider X if available' hints and the related_skills metadata.	2026-04-16 20:39:55 -07:00
Teknium	7d535969ff	docs(skills): make architecture-diagram vs concept-diagrams routing explicit Both skills generate SVG system diagrams, but for very different subjects and aesthetics. The old descriptions didn't make the split clear, so an agent loading either one couldn't confidently pick. Changes: - Rewrote both frontmatter descriptions to state the scope up front plus an explicit 'for X, use the other skill instead' pointer. - Added a symmetric 'When to use this skill vs <other>' decision table to the top of each SKILL.md body, so the guidance is visible whether the agent is reading frontmatter or full content. - Added architecture-diagram <-> concept-diagrams to each other's related_skills metadata. Rule of thumb baked into both skills: software/cloud infra -> architecture-diagram physical / scientific / educational -> concept-diagrams	2026-04-16 20:39:55 -07:00
Teknium	80855f964e	fix: stop hermes update from nagging about llm-wiki's wiki.path (#11222 ) llm-wiki was the only shipped skill using metadata.hermes.config, which caused 'hermes update' and 'hermes config migrate' to prompt for a wiki directory on every run — even for users who have never touched the skill — because 'enabled' is opt-out (all shipped skills count as enabled unless explicitly disabled). Declining the prompt didn't persist anything, so the nag fired again on every update. Switch llm-wiki to the env var + runtime default pattern that obsidian and google-workspace already use: WIKI_PATH env var, default $HOME/wiki. No prompting infrastructure, no config.yaml touch, no nag loop. Changes: - skills/research/llm-wiki/SKILL.md: remove metadata.hermes.config, document WIKI_PATH env var in the Wiki Location section, update the orientation snippet and initialization guidance. - Docs: replace llm-wiki's wiki.path examples with a generic 'myplugin.path' placeholder across configuration.md, features/skills.md, and creating-skills.md so users don't try to set skills.config.wiki.path expecting llm-wiki to use it. - skills-catalog.md: mention WIKI_PATH instead of skills.config.wiki.path. E2E verified: discover_all_skill_config_vars() and get_missing_skill_config_vars() both return 0 entries after this change, so the prompt branch in migrate_config() no longer fires. The metadata.hermes.config feature stays in place for third-party skills that genuinely need structured config, but built-ins now prefer env vars.	2026-04-16 13:34:16 -07:00
LeonSGP43	daef0519e9	fix(google-workspace): normalize authorized user token writes	2026-04-16 04:22:16 -07:00
Teknium	5ef0fe1665	docs: fix stale hermes login references in hermes-agent skill (#10603 ) Follow-up to #10471 — replace remaining 'hermes login --provider' references with current 'hermes auth' flow.	2026-04-15 17:43:54 -07:00
Teknium	e402906d48	fix: five HERMES_HOME profile-isolation leaks (#10570 ) * fix: show correct env var name in provider API key error (#9506) The error message for missing provider API keys dynamically built the env var name as PROVIDER_API_KEY (e.g. ALIBABA_API_KEY), but some providers use different names (alibaba uses DASHSCOPE_API_KEY). Users following the error message set the wrong variable. Fix: look up the actual env var from PROVIDER_REGISTRY before building the error. Falls back to the dynamic name if the registry lookup fails. Closes #9506 * fix: five HERMES_HOME profile-isolation leaks (#5947) Bug A: Thread session_title from session_db to memory provider init kwargs so honcho can derive chat-scoped session keys instead of falling back to cwd-based naming that merges all gateway users into one session. Bug B: Replace 14 hardcoded ~/.hermes/skills/ paths across 10 skill files with HERMES_HOME-aware alternatives (${HERMES_HOME:-$HOME/.hermes} in shell, os.environ.get('HERMES_HOME', ...) in Python). Bug C: install.sh now respects HERMES_HOME env var and adds --hermes-home flag. Previously --dir only set INSTALL_DIR while HERMES_HOME was always hardcoded to $HOME/.hermes. Bug D: Remove hardcoded ~/.hermes/honcho.json fallback in resolve_config_path(). Non-default profiles no longer silently inherit the default profile's honcho config. Falls through to ~/.honcho/config.json (global) instead. Bug E: Guard _edit_skill, _patch_skill, _delete_skill, _write_file, and _remove_file against writing to skills found in external_dirs. Skills outside the local SKILLS_DIR are now read-only from the agent's perspective. Closes #5947	2026-04-15 17:09:41 -07:00
Misturi	dedc4600dd	fix(skills): handle missing fields in Google Workspace token file gracefully instead of crashing with KeyError	2026-04-15 03:45:09 -07:00
Misturi	8bc9b5a0b4	fix(skills): use `is None` check for coordinates in find-nearby to avoid dropping valid 0.0 values	2026-04-15 03:45:09 -07:00
Teknium	ef04de3e98	docs: update tool-adding instructions for auto-discovery - AGENTS.md: 3 files → 2 files, remove _discover_tools() step - adding-tools.md: remove Step 3, note auto-discovery - architecture.md: update discovery description - tools-runtime.md: replace manual list with discover_builtin_tools() docs - hermes-agent skill: remove manual import step	2026-04-14 21:12:29 -07:00
Teknium	b24e5ee4b0	feat(google-workspace): add --from flag for custom sender display name (#9931 ) Adds --from flag to gmail send and gmail reply commands, allowing agents to customize the From header display name when sharing the same email account. Usage: --from '"Agent Name" <user@example.com>' Also syncs repo google_api.py with the deployed standalone implementation (replaces outdated gws_bridge thin wrapper), adds dedicated docs page under Features > Skills, and updates sidebar navigation. Requested by community user @Maxime44.	2026-04-14 16:55:34 -07:00
Teknium	55ce76b372	feat: add architecture-diagram skill (Cocoon AI port) (#9906 ) Port of Cocoon AI's architecture-diagram-generator (MIT) as a Hermes skill. Generates professional dark-themed system architecture diagrams as standalone HTML/SVG files. Self-contained output, no dependencies. - SKILL.md with design system specs, color palette, layout rules - HTML template with all component types, arrow styles, legend examples - Fits alongside excalidraw in creative/ category Source: https://github.com/Cocoon-AI/architecture-diagram-generator	2026-04-14 16:10:18 -07:00
Teknium	3804556cd9	fix: restore clarify toolset row removed in cherry-pick	2026-04-13 02:49:11 -07:00
Haoqing Wang	8e0ae66520	fix(skills): correct TTS/STT providers, add missing platforms/commands in hermes-agent skill Fixes verified via 5-container parallel testing against v0.8.0 codebase. Critical fixes: - TTS providers: replace nonexistent kokoro/fish with actual minimax/mistral/neutts - STT providers: add missing mistral (Voxtral Transcribe) - Testing section: remove `source venv/bin/activate` (no venv dir in project) Expanded coverage: - Provider table: 13 → 22 entries (add Gemini, xAI, Xiaomi, Qwen OAuth, MiniMax CN, etc.) - Platform list: add BlueBubbles (iMessage) and Weixin (WeChat), clarify Open WebUI - Slash commands: add 14 undocumented commands (/approve, /deny, /branch, /fast, etc.) - Toolsets: add 4 missing (messaging, search, todo, rl) - Troubleshooting: expand from 6 to 10 sections with practical deployment fixes (Copilot OAuth 403, gateway linger, WSL2 systemd, Discord intents, etc.) Minor fixes: - agent/ directory description expanded - delegation config keys completed - /restart noted as gateway-only - hermes honcho noted as plugin-dependent	2026-04-13 02:49:11 -07:00
Teknium	b22663ea69	docs: restore Orchestra Research attribution in research-paper-writing skill (#8800 ) PR #4654 replaced ml-paper-writing with research-paper-writing, preserving the writing philosophy and reference files but dropping the dedicated 'Sources Behind This Guidance' attribution table from the SKILL.md body. Re-adds: - The researcher attribution table (Nanda, Farquhar, Gopen & Swan, Lipton, Steinhardt, Perez, Karpathy) with affiliations and links to SKILL.md - Orchestra Research credit as original compiler of the writing philosophy - 'Origin & Attribution' section in sources.md documenting the full chain: Nanda blog → Orchestra skill → teknium integration → SHL0MS expansion	2026-04-12 22:03:18 -07:00
SHL0MS	801a26c014	feat(skills): add creative ideation — constraint-driven project generation Generate project ideas through creative constraints. Constraint + direction = creativity. Core skill (SKILL.md, 147 lines): - 15 curated constraints across 3 categories: developers, makers, anyone - Developer-focused prompts: 'solve your own itch', 'the CLI tool that should exist', 'automate the annoying thing', 'nothing new except glue' - Matching table: maps user mood/intent to appropriate constraints - Complete worked example with 3 concrete project ideas - Output format for consistent, actionable idea presentation Extended library (references/full-prompt-library.md, 110 lines): - 30+ additional constraints: communication, screens, philosophy, transformation, identity, scale, starting points Constraint approach inspired by wttdotm.com/prompts.html. Adapted for software development and general-purpose ideation.	2026-04-11 01:44:36 -04:00
SHL0MS	941608cdde	feat(skills): add creative divergence strategies for experimental output Adds opt-in creative thinking frameworks to ascii-video, p5js, and manim-video skills, based on Lluminate (joelsimon.net/lluminate). Only engaged when the user explicitly asks for creative, experimental, or unconventional output. Straightforward requests are unaffected. Each skill gets 2-3 strategies matched to its domain: - ascii-video: Forced Connections, Conceptual Blending, Oblique Strategies - p5js: Conceptual Blending, SCAMPER, Distance Association - manim-video: SCAMPER, Assumption Reversal Strategies sourced from creativity research (Boden, Eno, de Bono, Koestler, Fauconnier & Turner, Osborn), formalized for LLM prompting by Lluminate.	2026-04-09 21:40:16 -04:00
Teknium	e9168f917e	fix: handle HTTP errors gracefully in gws_bridge token refresh Instead of crashing with a raw urllib traceback on refresh failure, print a clean error message and suggest re-running setup.py.	2026-04-09 14:28:35 -07:00
Teknium	73eb59db8d	fix: follow-up fixes for google-workspace gws migration - Fix npm package name: @anthropic -> @googleworkspace/cli - Add Homebrew install option - Fix calendar_list to respect --start/--end args (uses raw Calendar API for date ranges, +agenda helper for default 7-day view) - Improve check_auth partial scope output (list missing scopes) - Add output format documentation with key JSON shapes - Use npm install in troubleshooting (no Rust toolchain needed) Follow-up to cherry-picked PR #6713	2026-04-09 14:28:35 -07:00
spideystreet	127b4caf0d	feat(skills): migrate google-workspace to gws CLI backend Migrate the google-workspace skill from custom Python API wrappers (google-api-python-client) to Google's official Rust CLI gws (googleworkspace/cli). Add gws_bridge.py for headless-compatible token refresh. Fix partial OAuth scope handling. Co-authored-by: spideystreet <dhicham.pro@gmail.com> Cherry-picked from PR #6713	2026-04-09 14:28:35 -07:00
Teknium	1780ad24b1	fix: normalize remaining reasoning effort orderings and add missing 'minimal' Follow-up to cherry-picked PR #6698. Fixes spots the original PR missed: - hermes_constants.py: VALID_REASONING_EFFORTS tuple ordering - gateway/run.py: _load_reasoning_config docstring + validation tuple - configuration.md and batch-processing.md: docs ordering - hermes-agent skill: /reasoning usage hint was missing 'minimal'	2026-04-09 14:20:16 -07:00
Teknium	2772d99085	fix: remove /prompt slash command — footgun via prefix expansion (#6752 ) /pr <anything> silently resolved to /prompt via the shortest-match tiebreaker in prefix expansion, permanently overwriting the system prompt and persisting to config. The command's functionality (setting agent.system_prompt) is available via config.yaml and /personality covers the common use case. Removes: CommandDef, dispatch branch, _handle_prompt_command handler, docs references, and updates subcommand extraction test.	2026-04-09 11:27:27 -07:00
Teknium	d0ffb111c2	refactor: codebase-wide lint cleanup — unused imports, dead code, and inefficient patterns (#5821 ) Comprehensive cleanup across 80 files based on automated (ruff, pyflakes, vulture) and manual analysis of the entire codebase. Changes by category: Unused imports removed (~95 across 55 files): - Removed genuinely unused imports from all major subsystems - agent/, hermes_cli/, tools/, gateway/, plugins/, cron/ - Includes imports in try/except blocks that were truly unused (vs availability checks which were left alone) Unused variables removed (~25): - Removed dead variables: connected, inner, channels, last_exc, source, new_server_names, verify, pconfig, default_terminal, result, pending_handled, temperature, loop - Dropped unused argparse subparser assignments in hermes_cli/main.py (12 instances of add_parser() where result was never used) Dead code removed: - run_agent.py: Removed dead ternary (None if False else None) and surrounding unreachable branch in identity fallback - run_agent.py: Removed write-only attribute _last_reported_tool - hermes_cli/providers.py: Removed dead @property decorator on module-level function (decorator has no effect outside a class) - gateway/run.py: Removed unused MCP config load before reconnect - gateway/platforms/slack.py: Removed dead SessionSource construction Undefined name bugs fixed (would cause NameError at runtime): - batch_runner.py: Added missing logger = logging.getLogger(__name__) - tools/environments/daytona.py: Added missing Dict and Path imports Unnecessary global statements removed (14): - tools/terminal_tool.py: 5 functions declared global for dicts they only mutated via .pop()/[key]=value (no rebinding) - tools/browser_tool.py: cleanup thread loop only reads flag - tools/rl_training_tool.py: 4 functions only do dict mutations - tools/mcp_oauth.py: only reads the global - hermes_time.py: only reads cached values Inefficient patterns fixed: - startswith/endswith tuple form: 15 instances of x.startswith('a') or x.startswith('b') consolidated to x.startswith(('a', 'b')) - len(x)==0 / len(x)>0: 13 instances replaced with pythonic truthiness checks (not x / bool(x)) - in dict.keys(): 5 instances simplified to in dict - Redefined unused name: removed duplicate _strip_mdv2 import in send_message_tool.py Other fixes: - hermes_cli/doctor.py: Replaced undefined logger.debug() with pass - hermes_cli/config.py: Consolidated chained .endswith() calls Test results: 3934 passed, 17 failed (all pre-existing on main), 19 skipped. Zero regressions.	2026-04-07 10:25:31 -07:00
SHL0MS	902a02e3d5	Merge pull request #5791 from leotrs/manim-ce-reference-improvements Expand Manim CE reference docs: geometry, animations, and LaTeX environments	2026-04-07 12:15:59 -04:00
Teknium	8b861b77c1	refactor: remove browser_close tool — auto-cleanup handles it (#5792 ) * refactor: remove browser_close tool — auto-cleanup handles it The browser_close tool was called in only 9% of browser sessions (13/144 navigations across 66 sessions), always redundantly — cleanup_browser() already runs via _cleanup_task_resources() at conversation end, and the background inactivity reaper catches anything else. Removing it saves one tool schema slot in every browser-enabled API call. Also fixes a latent bug: cleanup_browser() now handles Camofox sessions too (previously only Browserbase). Camofox sessions were never auto-cleaned per-task because they live in a separate dict from _active_sessions. Files changed (13): - tools/browser_tool.py: remove function, schema, registry entry; add camofox cleanup to cleanup_browser() - toolsets.py, model_tools.py, prompt_builder.py, display.py, acp_adapter/tools.py: remove browser_close from all tool lists - tests/: remove browser_close test, update toolset assertion - docs/skills: remove all browser_close references * fix: repeat browser_scroll 5x per call for meaningful page movement Most backends scroll ~100px per call — barely visible on a typical viewport. Repeating 5x gives ~500px (~half a viewport), making each scroll tool call actually useful. Backend-agnostic approach: works across all 7+ browser backends without needing to configure each one's scroll amount individually. Breaks early on error for the agent-browser path. * feat: auto-return compact snapshot from browser_navigate Every browser session starts with navigate → snapshot. Now navigate returns the compact accessibility tree snapshot inline, saving one tool call per browser task. The snapshot captures the full page DOM (not viewport-limited), so scroll position doesn't affect it. browser_snapshot remains available for refreshing after interactions or getting full=true content. Both Browserbase and Camofox paths auto-snapshot. If the snapshot fails for any reason, navigation still succeeds — the snapshot is a bonus, not a requirement. Schema descriptions updated to guide models: navigate mentions it returns a snapshot, snapshot mentions it's for refresh/full content. * refactor: slim cronjob tool schema — consolidate model/provider, drop unused params Session data (151 calls across 67 sessions) showed several schema properties were never used by models. Consolidated and cleaned up: Removed from schema (still work via backend/CLI): - skill (singular): use skills array instead - reason: pause-only, unnecessary - include_disabled: now defaults to true - base_url: extreme edge case, zero usage - provider (standalone): merged into model object Consolidated: - model + provider → single 'model' object with {model, provider} fields. If provider is omitted, the current main provider is pinned at creation time so the job stays stable even if the user changes their default. Kept: - script: useful data collection feature - skills array: standard interface for skill loading Schema shrinks from 14 to 10 properties. All backend functionality preserved — the Python function signature and handler lambda still accept every parameter. * fix: remove mixture_of_agents from core toolsets — opt-in only via hermes tools MoA was in _HERMES_CORE_TOOLS and composite toolsets (hermes-cli, hermes-messaging, safe), which meant it appeared in every session for anyone with OPENROUTER_API_KEY set. The _DEFAULT_OFF_TOOLSETS gate only works after running 'hermes tools' explicitly. Now MoA only appears when a user explicitly enables it via 'hermes tools'. The moa toolset definition and check_fn remain unchanged — it just needs to be opted into.	2026-04-07 03:28:44 -07:00
Leo Torres	e8f6854cab	docs: expand Manim CE reference docs with additional API coverage Add geometry mobjects, movement/creation animations, and LaTeX environments to the skill's reference docs. All verified against Manim CE v0.20.1. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-07 11:36:13 +02:00
Teknium	f609bf277d	feat: update blogwatcher skill to JulienTant's fork (#5759 ) Replace Hyaxia/blogwatcher with JulienTant/blogwatcher-cli fork which adds: - Docker support with BLOGWATCHER_DB env var for persistent storage - SQL injection prevention - SSRF protection (blocks private IPs/metadata endpoints) - HTML scraping fallback when RSS unavailable - OPML import from Feedly/Inoreader/NewsBlur - Category filtering for articles - Direct binary downloads (no Go required) - Migration guide from original blogwatcher Binary name changed: blogwatcher -> blogwatcher-cli Community contribution by Ao (JulienTant). Closes discussion about Docker compatibility.	2026-04-06 23:59:26 -07:00
Zainan Victor Zhou	190471fdc0	docs: use HERMES_HOME in google-workspace skill examples - avoid hard-coded ~/.hermes paths in the setup and API shorthands - prefer HERMES_HOME with a sane default to /Users/peteradams/.hermes - keep the examples aligned with profile-aware Hermes installs	2026-04-06 16:50:07 -07:00
Zainan Victor Zhou	83df001d01	fix: allow google-workspace skill scripts to run directly - fall back to adding the repo root to sys.path when hermes_constants is not importable - fixes direct execution of setup.py and google_api.py from the repo checkout - keeps the upstream PR scoped to the google-workspace compatibility fix	2026-04-06 16:50:07 -07:00
Teknium	539629923c	docs(llm-wiki): add Obsidian Headless setup for servers (#5660 ) Adds obsidian-headless (npm) setup guide to the Obsidian Integration section — Node 22+, ob login, sync-create-remote, sync-setup, systemd service for continuous background sync. Covers the full headless workflow for agents running on servers syncing to Obsidian desktop on other devices.	2026-04-06 16:37:14 -07:00
Teknium	150f70f821	feat(skills): add skill config interface + llm-wiki skill (#5635 ) Skills can now declare config.yaml settings via metadata.hermes.config in their SKILL.md frontmatter. Values are stored under skills.config.* namespace, prompted during hermes config migrate, shown in hermes config show, and injected into the skill context at load time. Also adds the llm-wiki skill (Karpathy's LLM Wiki pattern) as the first skill to use the new config interface, declaring wiki.path. Skill config interface (new): - agent/skill_utils.py: extract_skill_config_vars(), discover_all_skill_config_vars(), resolve_skill_config_values(), SKILL_CONFIG_PREFIX - agent/skill_commands.py: _inject_skill_config() injects resolved values into skill messages as [Skill config: ...] block - hermes_cli/config.py: get_missing_skill_config_vars(), skill config prompting in migrate_config(), Skill Settings in show_config() LLM Wiki skill (skills/research/llm-wiki/SKILL.md): - Three-layer architecture (raw sources, wiki pages, schema) - Three operations (ingest, query, lint) - Session orientation, page thresholds, tag taxonomy, update policy, scaling guidance, log rotation, archiving workflow Docs: creating-skills.md, configuration.md, skills.md, skills-catalog.md Closes #5100	2026-04-06 13:49:13 -07:00
SHL0MS	17e2a27c51	feat(skills): add p5js creative coding skill Production pipeline for interactive and generative visual art using p5.js. Covers 7 modes: generative art, data visualization, interactive experiences, animation/motion graphics, 3D scenes, image processing, and audio-reactive. Includes: - SKILL.md with creative standard, pipeline, and critical implementation notes - 10 reference files covering core API, shapes, visual effects (noise, flow fields, particles, domain warp, attractors, L-systems, circle packing, bloom, reaction-diffusion), animation (easing, springs, state machines, scene transitions), typography, color systems, WebGL/3D/shaders, interaction, and comprehensive export pipeline - Deterministic headless frame capture via Puppeteer (noLoop + redraw) - ffmpeg render pipeline for MP4 video export - Per-clip architecture for multi-scene video production - Interactive viewer template with seed navigation and parameter controls - Performance guidance: FES disable, Math.* hot loops, per-pixel budgets - Addon library coverage: p5.brush, p5.grain, CCapture.js, p5.js-svg - fxhash/Art Blocks generative platform conventions - p5.js 2.0 migration guide (async setup, OKLCH, splineVertex, shader.modify) - 13 documented common mistakes and troubleshooting patterns 17 files, ~5,900 lines.	2026-04-06 14:39:00 -04:00
SHL0MS	a912cd4568	docs(manim-video): add 5 new reference files — design thinking, updaters, paper explainer, decorations, production quality Five new reference files expanding the skill from rendering knowledge into production methodology: animation-design-thinking.md (161 lines): When to animate vs show static, concept decomposition into visual beats, pacing rules, narration sync, equation reveal strategies, architecture diagram patterns, common design mistakes. updaters-and-trackers.md (260 lines): Deep ValueTracker mental model, lambda/time-based/always_redraw updaters, DecimalNumber and Variable live displays, animation-based updaters, 4 complete practical patterns (dot tracing, live area, connected diagram, parameter exploration). paper-explainer.md (255 lines): Full workflow for turning research papers into animations. Audience selection, 5-minute template, pre-code gates (narration, scene list, style contract), equation reveal strategies, architecture diagram building, results animation, domain-specific patterns for ML/physics/ biomedical papers. decorations.md (202 lines): SurroundingRectangle, BackgroundRectangle, Brace, arrows (straight, curved, labeled), DashedLine, Angle/RightAngle, Cross, Underline, color highlighting workflows, annotation lifecycle pattern. production-quality.md (190 lines): Pre-code, pre-render, post-render checklists. Text overlap prevention, spatial layout coordinate budget, max simultaneous elements, animation variety audit, tempo curve, color consistency, data viz minimums. Total skill now: 14 reference files, 2614 lines.	2026-04-06 13:51:36 -04:00
SHL0MS	95a044a2e0	feat(research-paper-writing): fill coverage gaps and integrate patterns from AI-Scientist, GPT-Researcher Fix duplicate step numbers (5.3, 7.3) and missing 7.5. Add coverage for human evaluation, theory/survey/benchmark/position papers, ethics/broader impact, arXiv strategy, code packaging, negative results, workshop papers, multi-author coordination, compute budgeting, and post-acceptance deliverables. Integrate ensemble reviewing with meta-reviewer and negative bias, pre-compilation validation pipeline, experiment journal with tree structure, breadth/depth literature search, context management for large projects, two-pass refinement, VLM visual review, and claim verification. New references: human-evaluation.md, paper-types.md.	2026-04-06 01:12:32 -04:00
SHL0MS	b26e7fd43a	fix(manim-video): recommend monospace fonts — proportional fonts have broken kerning in Pango Manim's Pango text renderer produces broken kerning with proportional fonts (Helvetica, Inter, SF Pro, Arial) at all sizes and resolutions. Characters overlap and spacing is inconsistent. This is a fundamental Pango limitation. Changes: - Recommend Menlo (monospace) as the default font for ALL text - Proportional fonts only acceptable for large titles (>=48, short strings) - Set minimum font_size=18 for readability - Update all code examples to use MONO='Menlo' pattern - Remove Inter/Helvetica/SF Pro from recommendations	2026-04-06 00:35:43 -04:00
SHL0MS	447ec076a4	docs(manim-video): expand references with comprehensive Manim CE and 3b1b patterns Adds 601 lines across 6 reference files, sourced from deep review of: - Manim CE v0.20.1 full reference manual - 3b1b/manim example_scenes.py and source modules - 3b1b/videos production CLAUDE.md and workflow patterns - Manim CE thematic guides (voiceover, text, configuration) animations.md: always_redraw, TracedPath, FadeTransform, TransformFromCopy, ApplyMatrix, squish_rate_func, ShowIncreasingSubsets, ShowPassingFlash, expanded rate functions mobjects.md: SVGMobject, ImageMobject, Variable, BulletedList, DashedLine, Angle/RightAngle, boolean ops, LabeledArrow, t2c/t2f/t2s/t2w per-substring styling, backstroke for readability, apply_complex_function with prepare_for_nonlinear_transform equations.md: substrings_to_isolate, multi-line equations, TransformMatchingTex with matched_keys and key_map, set_color_by_tex graphs-and-data.md: Graph/DiGraph with layout algorithms, ArrowVectorField/StreamLines, ComplexPlane/PolarPlane camera-and-3d.md: ZoomedScene with inset zoom, LinearTransformationScene for 3b1b-style linear algebra rendering.md: manim.cfg project config, self.next_section() chapter markers, manim-voiceover plugin with ElevenLabs/GTTS integration and bookmark-based audio sync	2026-04-06 00:08:17 -04:00
SHL0MS	1572956fdc	Merge pull request #4930 from SHL0MS/feat/manim-video-skill-v2 feat(skills): add manim-video skill for mathematical and technical animations	2026-04-05 16:10:30 -07:00
SHL0MS	9d885b266c	feat(skills): add manim-video skill for mathematical and technical animations Production pipeline for creating 3Blue1Brown-style animated videos using Manim Community Edition. The agent handles the full workflow: creative planning, Python code generation, rendering, scene stitching, audio muxing, and iterative refinement. Modes: concept explainers, equation derivations, algorithm visualizations, data stories, architecture diagrams, paper explainers, 3D visualizations. 9 reference files, setup verification script, README. All API references verified against ManimCommunity/manim source.	2026-04-05 19:09:37 -04:00
Teknium	cb63b5f381	feat(skills): add popular-web-designs skill with 54 website design systems (#5194 ) Curated collection of production-quality design system specifications extracted from real websites (sourced from VoltAgent/awesome-design-md). Each template captures a site's complete visual language: colors, typography, components, layout, shadows, responsive behavior, and agent-ready CSS values. Hermes-specific adaptations in every template: - Google Fonts CDN link tags for proprietary font substitutes - CSS font-family stacks with proper fallbacks - Integration notes for write_file + generative-widgets workflow - browser_vision verification reminders SKILL.md includes categorized catalog, font substitution reference table, HTML generation pattern, and design-to-use-case matching guide. Sites: Airbnb, Airtable, Apple, BMW, Cal.com, Claude, Clay, ClickHouse, Cohere, Coinbase, Composio, Cursor, ElevenLabs, Expo, Figma, Framer, HashiCorp, IBM, Intercom, Kraken, Linear, Lovable, Minimax, Mintlify, Miro, Mistral AI, MongoDB, Notion, NVIDIA, Ollama, OpenCode, Pinterest, PostHog, Raycast, Replicate, Resend, Revolut, RunwayML, Sanity, Sentry, SpaceX, Spotify, Stripe, Supabase, Superhuman, Together AI, Uber, Vercel, VoltAgent, Warp, Webflow, Wise, xAI, Zapier	2026-04-05 00:42:55 -07:00
Teknium	0fd3de2674	docs(skill): claude-code v2.2 — add cheat sheet commands, env vars, rules, advanced features (#5158 ) Expands the claude-code skill with content from official docs and community cheat sheets that was missing from v2.0: Slash commands: /cost, /btw, /plan, /loop, /batch, /security-review, /resume, /effort (with auto level), /mcp, /release-notes, /voice details Keyboard shortcuts: Alt+P (model), Alt+T (thinking), Alt+O (fast mode), Ctrl+V (paste image), Ctrl+O (transcript), Ctrl+G (external editor) Ultrathink keyword for max reasoning on a specific turn Rules directory: .claude/rules/.md and ~/.claude/rules/.md Auto-memory: ~/.claude/projects/<proj>/memory/ (25KB/200 lines limit) Environment variables: CLAUDE_CODE_EFFORT_LEVEL, MAX_THINKING_TOKENS, CLAUDE_CODE_NO_FLICKER, CLAUDE_CODE_SUBPROCESS_ENV_SCRUB MCP limits: 2KB tool desc cap, maxResultSizeChars 500K, transport types Reorganized slash commands into Session/Development/Configuration groups Reorganized keyboard shortcuts into Controls/Toggles/Multiline groups	2026-04-04 19:15:57 -07:00
Teknium	ff544526cd	docs(skill): comprehensive claude-code skill rewrite v2.0 (#5155 ) Major rewrite of the claude-code orchestration skill from 94 to 460 lines. Based on official docs research, community guides, and live experimentation. Key additions: - Two orchestration modes: Print mode (-p) vs Interactive PTY via tmux - Detailed PTY dialog handling (trust + permissions bypass patterns) - Print mode deep dive: JSON output, piped input, session resumption, --json-schema, --bare mode for CI - Complete flag reference (20+ flags organized by category) - Interactive session patterns with tmux send-keys/capture-pane - Claude's slash commands and keyboard shortcuts reference - CLAUDE.md, hooks, custom subagents, MCP, custom commands docs - Cost/performance tips (effort levels, budget caps, context mgmt) - 10 specific pitfalls discovered through live testing - 10 rules for Hermes agents orchestrating Claude Code	2026-04-04 19:00:50 -07:00
Teknium	34308e4de9	docs: improve youtube-content skill structure and workflow Clearer workflow with validation/chunking steps, expanded description with trigger terms for better agent matching, tightened error handling. Fixed stray pipe character in original PR diff. Based on PR #4778 by fernandezbaptiste. Co-authored-by: fernandezbaptiste <fernandezbaptiste@users.noreply.github.com>	2026-04-03 22:18:00 -07:00
kshitijk4poor	5359921199	refactor: simplify scope validation helpers in google workspace scripts Fix double file read bug in google_api.py _missing_scopes(), consolidate redundant _normalize_scope_values into callers, merge duplicate except blocks.	2026-04-03 17:49:18 -07:00
kshitijk4poor	37e2ef6c3f	fix: protect profile-scoped google workspace oauth tokens	2026-04-03 17:49:18 -07:00
Teknium	52ddd6bc64	refactor(skills): consolidate code verification skills into one (#4854 ) * chore: release v0.7.0 (2026.4.3) 168 merged PRs, 223 commits, 46 resolved issues, 40+ contributors. Highlights: pluggable memory providers, credential pools, Camofox browser, inline diff previews, API server session continuity, ACP MCP registration, gateway hardening, secret exfiltration blocking. * refactor(skills): consolidate code-review + verify-code-changes into requesting-code-review Merge the passive code-review checklist and the automated verification pipeline (from PR #4459 by @MorAlekss) into a single requesting-code-review skill. This eliminates model confusion between three overlapping skills. Now includes: - Static security scan (grep on diff lines) - Baseline-aware quality gates (only flag NEW failures) - Multi-language tool detection (Python, Node, Rust, Go) - Independent reviewer subagent with fail-closed JSON verdict - Auto-fix loop with separate fixer agent (max 2 attempts) - Git checkpoint and [verified] commit convention Deletes: skills/software-development/code-review/ (absorbed) Closes: #406 (independent code verification)	2026-04-03 14:13:27 -07:00
SHL0MS	b86647c295	Replace ml-paper-writing with research-paper-writing: full research pipeline skill Replaces the writing-focused ml-paper-writing skill (940 lines) with a complete end-to-end research paper pipeline (1,599 lines SKILL.md + 3,184 lines across 7 reference files). New content: - Full 8-phase pipeline: project setup, literature review, experiment design, execution/monitoring, analysis, paper drafting, review/revision, submission preparation - Iterative refinement strategy guide from autoreason research (when to use autoreason vs critique-and-revise vs single-pass, model selection) - Hermes agent integration: delegate_task parallel drafting, cronjob monitoring, memory/todo state management, skill composition - Professional LaTeX tooling: microtype, siunitx, TikZ diagram patterns, algorithm2e, subcaption, latexdiff, SciencePlots - Human evaluation design: annotation protocols, inter-annotator agreement, crowdsourcing platforms - Title, Figure 1, conclusion, appendix strategy, page budget management - Anonymization checklist, rebuttal writing, camera-ready preparation - AAAI and COLM venue coverage (checklists, reviewer guidelines) Preserved from ml-paper-writing: - All writing philosophy (Nanda, Farquhar, Gopen & Swan, Lipton, Perez) - Citation verification workflow (5-step mandatory process) - All 6 conference templates (NeurIPS, ICML, ICLR, ACL, AAAI, COLM) - Conference requirements, format conversion workflow - Proactivity/collaboration guidance Bug fixes in inherited reference files: - BibLaTeX recommendation now correctly says natbib for conferences - Bare except clauses fixed to except Exception - Jinja2 template tags removed from citation-workflow.md - Stale date caveats added to reviewer-guidelines.md	2026-04-02 16:13:26 -04:00
analista	3400098481	fix: update fetch_transcript.py for youtube-transcript-api v1.x The library removed the static get_transcript() method in v1.0. Migrate to the new instance-based fetch() API and normalize FetchedTranscriptSnippet objects back to dicts for compatibility with the rest of the script.	2026-04-01 10:49:24 -07:00
Teknium	85e96a4638	fix(skills): move unified hermes-agent skill into autonomous-ai-agents category (#4435 ) The unified skill from PR #4332 was placed at a top-level skills/hermes-agent/ directory, creating a redundant standalone category. Move it to skills/autonomous-ai-agents/hermes-agent/ alongside claude-code, codex, and opencode where it belongs.	2026-04-01 03:39:25 -07:00
Teknium	b118f607b2	feat(skills): unify hermes-agent and hermes-agent-setup into single skill (#4332 ) Merges the hermes-agent-spawning skill (autonomous-ai-agents/) and hermes-agent-setup skill (dogfood/) into a single comprehensive skills/hermes-agent/ skill. The unified skill covers: - What Hermes Agent is and how it compares to Claude Code/Codex/OpenClaw - Complete CLI reference (all subcommands and flags) - Slash command reference - Configuration guide (providers, toolsets, config sections) - Voice/STT/TTS setup - Spawning additional agent instances (one-shot and interactive PTY) - Multi-agent coordination patterns - Troubleshooting guide - Where-to-find-things lookup table with docs links - Concise contributor quick reference Removes: - skills/autonomous-ai-agents/hermes-agent/ (hermes-agent-spawning) - skills/dogfood/hermes-agent-setup/	2026-03-31 14:49:20 -07:00
SHL0MS	ab62614a89	ascii-video: add text readability techniques and external layout oracle pattern - composition.md: add text backdrop (gaussian dark mask behind glyphs) and external layout oracle pattern (browser-based text layout → JSON → Python renderer pipeline for obstacle-aware text reflow) - shaders.md: add reverse vignette shader (center-darkening for text readability) - troubleshooting.md: add diagnostic entries for text-over-busy-background readability and kaleidoscope-destroys-text pitfall	2026-03-30 18:48:22 -04:00

1 2 3 4

158 commits