docs(skills): salvage dropped trigger content into skill bodies

For 14 of 74 compressed skills, the original description contained trigger keywords, technique counts, attribution, or use-case phrases not covered by the existing body content. Prepends a 'When to use' / 'What's inside' block near the top so the agent still has the full context when the skill is loaded. Skills salvaged: - codex, ascii-video, creative-ideation, excalidraw, manim-video, p5js - gif-search, heartmula, youtube-content - lm-evaluation-harness, obliteratus, vllm, axolotl - powerpoint Remaining 60 skills were verified to already cover the dropped content in their existing body sections (When to Use, overview, intro prose) or had short descriptions fully captured by the new compressed form.
2026-04-30 01:41:43 +00:00 · 2026-04-26 21:50:26 -07:00 · 2026-04-26 21:50:26 -07:00 · 9f1b1977bc
commit 9f1b1977bc
parent e3921e7ca4
14 changed files with 66 additions and 1 deletions
--- a/skills/mlops/evaluation/lm-evaluation-harness/SKILL.md
+++ b/skills/mlops/evaluation/lm-evaluation-harness/SKILL.md
@ -13,6 +13,10 @@ metadata:

 # lm-evaluation-harness - LLM Benchmarking

+## What's inside
+
+Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking training progress. Industry standard used by EleutherAI, HuggingFace, and major labs. Supports HuggingFace, vLLM, APIs.
+
 ## Quick start

 lm-evaluation-harness evaluates LLMs across 60+ academic benchmarks using standardized prompts and metrics.