mirror of https://github.com/NousResearch/hermes-agent.git synced 2026-05-06 02:41:48 +00:00

James 20859cc408 docs(skill): sync hyperframes skill with upstream changes

Pulls the hyperframes skill up to the latest state of heygen-com/hyperframes
skill content. Opened 2026-04-17; upstream has shipped CLI, layout, and path
changes since.

- SKILL.md: promote the visual-style check to a proper HARD-GATE
  (DESIGN.md > named style > ask 3 questions, with the #333/#3b82f6/Roboto
  tells); expand Step 6 to cover audio-reactive (mandatory per-frame
  tl.call() sampling loop — a single long tween does NOT react to audio),
  caption exit guarantee (hard tl.set kill after group.end), marker
  highlighting, and scene transitions; add the animation-map script to
  Verification; link the new features.md.

- references/cli.md: add capture and validate (both shipped commands, both
  referenced from the workflow but missing from the reference). Add
  --lang to tts with the voice-prefix auto-inference table and espeak-ng
  dependency note (heygen-com/hyperframes#351, 2026-04-20 — after this
  PR opened).

- references/website-to-video.md: update all paths to the capture/
  subfolder layout introduced in heygen-com/hyperframes#345
  (capture/screenshots/, capture/assets/, capture/extracted/tokens.json).
  Old captured/ prefix was broken — agents following the skill were
  looking for files in wrong locations.

- references/features.md (new): distilled coverage for captions (language
  rule, tone table, word grouping, fitTextFontSize, exit guarantee), TTS
  (multilingual phonemization, speed tuning), audio-reactive (data
  format, mapping table, sampling pattern), marker highlighting
  (highlight/circle/burst/scribble/sketchout), and transitions (energy/
  mood tables, presets, shader-compatible CSS rules). Five topics the
  original PR didn't cover.

2026-05-04 14:13:17 -07:00

5.1 KiB

Raw Blame History

Website to Video

Capture a website, produce a professional video from it. Use when the user provides a URL and wants a video — social ad, product tour, 30-second promo, etc.

The workflow has 7 steps. Each produces an artifact that gates the next. Do not skip steps — each artifact prevents a downstream failure mode.

Step 1: Capture & Understand

npx hyperframes capture https://example.com -o example-video

Produces example-video/capture/ with:

capture/screenshots/ — above-the-fold + section screenshots (up to --max-screenshots)
capture/assets/ — logos, hero images, background video (if any)
capture/extracted/tokens.json — colors, fonts, and spacing tokens
capture/extracted/visible-text.txt — extracted headings, paragraphs, CTAs
capture/extracted/fonts.json — font families and stacks detected in computed styles
capture/asset-descriptions.md — auto-generated asset catalog

All subsequent steps read from the capture/ subfolder — capture/extracted/tokens.json, capture/assets/hero.png, etc. Never strip the capture/ prefix when referencing these files.

Gate: Print a site summary — name, top 3 colors, primary + display fonts, hero asset path, one-sentence vibe. Keep it in your context — don't re-capture.

Step 2: Write DESIGN.md

Small brand reference at the project root. 6 sections, ~90 lines. This is the cheat sheet — not the creative plan.

# DESIGN

## Brand
- Name: Example Co.
- One-line mission: "…"

## Colors
- Background: #0B0F14
- Primary: #00E0A4 (accent, CTA)
- Secondary: #7A8B9B (body text)
- Text: #FFFFFF

## Typography
- Display: "Inter Tight", 700, tight letter-spacing
- Body: "Inter", 400

## Motion
- Mood: precise, technical, confident
- Eases: `power3.out` for entrances, `expo.in` for exits

## Assets
- Logo: `capture/assets/logo.svg`
- Hero image: `capture/assets/hero.png`

## What NOT to Do
- No purple, no pastels, no serif body
- No playful/bubbly eases (`elastic`, `bounce`)
- No drop shadows on text

Gate: DESIGN.md exists in the project directory.

Step 3: Write SCRIPT.md

Narration script. Story backbone. Scene durations come from the narration, not from guessing.

# SCRIPT

## Scene 1 — Hook (0:00–0:04)
"What if your dashboards wrote themselves?"

## Scene 2 — Problem (0:04–0:11)
"Teams spend hours stitching together queries, charts, and callouts — every Monday."

## Scene 3 — Solution (0:11–0:22)
"Example Co. watches your data streams and proposes the dashboard you'd have built — in seconds."

## Scene 4 — CTA (0:22–0:28)
"Try it free at example.com."

Run npx hyperframes tts SCRIPT.md --voice af_nova --output narration.wav to generate TTS audio. Note the exact duration — that's the video's duration.

Gate: SCRIPT.md + narration.wav exist and durations match the plan (±0.3s).

Step 4: Storyboard

Text-only scene plan: for each scene, describe the hero frame — what's on screen at the scene's most-visible moment.

# STORYBOARD

## Scene 1 (0:00–0:04) — Hook
Hero frame: giant "WHAT IF YOUR DASHBOARDS WROTE THEMSELVES?" in display font, centered, on near-black. Logo top-left at 40% opacity.
Entrance: each word staggers in, 0.08s apart.
Transition out: flash-through-white into Scene 2.

One paragraph per scene. Do NOT skip this step — it's where you catch narrative gaps before writing HTML.

Gate: STORYBOARD.md exists. Each scene has: hero frame, entrance, transition.

Step 5: Composition

Write index.html scene-by-scene:

Each scene is a <div class="scene scene-N"> positioned absolutely, full-bleed.
Static HTML+CSS for the hero frame first (no GSAP).
Layer the narration <audio> at data-start="0" on a high track index.
Add a transitions component (flash-through-white, liquid-wipe, etc.) between each scene.
THEN add GSAP entrances (gsap.from()), no exits — transitions own the exit.
Register window.__timelines["root"] = tl.

Install transitions as needed:

npx hyperframes add flash-through-white

Step 6: Render

npx hyperframes lint --strict          # must pass
npx hyperframes validate               # WCAG contrast audit
npx hyperframes render --quality draft --output draft.mp4

Watch the draft. Note issues in a REVIEW.md bullet list (scene, timestamp, issue). Fix, re-render.

When happy:

npx hyperframes render --quality high --output final.mp4

Step 7: Deliver

Report file path + duration + file size to the user.
If the user wants a vertical cut, re-render with a 9:16 composition (data-width="1080" data-height="1920") — typically requires a separate index-vertical.html with tighter typography and re-stacked scene layout.

Common Failure Modes

Skipped DESIGN.md → colors drift scene-to-scene; output feels like "AI slides."
Skipped STORYBOARD.md → scenes overlap or hero frames collide with transitions.
Exit animations before transitions → empty frames when the transition fires.
Narration longer than data-duration → audio clips mid-sentence. Update the composition's data-duration to match the TTS output length + 0.5s buffer.

5.1 KiB Raw Blame History Unescape Escape