Page prompts are written in Step 5 from the text descriptions in characters/characters.md — the PNG sheet generated in Step 7.1 cannot be used to write them. Reposition the PNG as a human-facing review artifact (and reference for later regenerations / manual edits), and drop the confusing "Character sheet | Strategy" tables since the embedding rule is uniform.
5.6 KiB
Port Notes — baoyu-comic
Ported from JimLiu/baoyu-skills v1.56.1.
Changes from upstream
SKILL.md adaptations
| Change | Upstream | Hermes |
|---|---|---|
| Metadata namespace | openclaw |
hermes (with tags + homepage) |
| Trigger | Slash commands / CLI flags | Natural language skill matching |
| User config | EXTEND.md file (project/user/XDG paths) | Removed — not part of Hermes infra |
| User prompts | AskUserQuestion (batched) |
clarify tool (one question at a time) |
| Image generation | baoyu-imagine (Bun/TypeScript, supports --ref) |
image_generate — prompt-only, returns a URL; no reference image input; agent must download the URL to the output directory |
| PDF assembly | scripts/merge-to-pdf.ts (Bun + pdf-lib) |
Removed — the PDF merge step is out of scope for this port; pages are delivered as PNGs only |
| Platform support | Linux/macOS/Windows/WSL/PowerShell | Linux/macOS only |
| File operations | Generic instructions | Hermes file tools (write_file, read_file) |
Structural removals
references/config/directory (removed entirely):first-time-setup.md— blocking first-time setup flow for EXTEND.mdpreferences-schema.md— EXTEND.md YAML schemawatermark-guide.md— watermark config (tied to EXTEND.md)
scripts/directory (removed entirely): upstream'smerge-to-pdf.tsdepended onpdf-lib, which is not declared anywhere in the Hermes repo. Rather than add a new dependency, the port drops PDF assembly and delivers per-page PNGs.- Workflow Step 8 (Merge to PDF) removed from
workflow.md; Step 9 (Completion report) renumbered to Step 8. - Workflow Step 1.1 — "Load Preferences (EXTEND.md)" section removed from
workflow.md; steps 1.2/1.3 renumbered to 1.1/1.2. - Generic "User Input Tools" and "Image Generation Tools" preambles — SKILL.md no longer lists fallback rules for multiple possible tools; it references
clarifyandimage_generatedirectly.
Image generation strategy changes
image_generate's schema accepts only prompt and aspect_ratio (landscape | portrait | square). Upstream's reference-image flow (--ref characters.png for character consistency, plus user-supplied refs for style/palette/scene) does not map to this tool, so the workflow was restructured:
- Character sheet PNG is still generated for multi-page comics, but it is repositioned as a human-facing review artifact (for visual verification) and a reference for later regenerations / manual prompt edits. Page prompts themselves are built from the text descriptions in
characters/characters.md(embedded inline during Step 5).image_generatenever sees the PNG as a visual input. - User-supplied reference images are reduced to
style/palette/scenetrait extraction — traits are embedded in the prompt body; the image files themselves are kept only for provenance underrefs/. - Page prompts now mandate that character descriptions are embedded inline (copied from
characters/characters.md) — this is the only mechanism left to enforce cross-page character consistency. - Download step — after every
image_generatecall, the returned URL is fetched to disk (e.g.,curl -fsSL "<url>" -o <target>.png) and verified before the workflow advances.
SKILL.md reductions
- CLI option columns (
--art,--tone,--layout,--aspect,--lang,--ref,--storyboard-only,--prompts-only,--images-only,--regenerate) converted to plain-English option descriptions. - Preset files (
presets/*.md) andohmsha-guide.md:`--style X`/`--art X --tone Y`shorthand rewritten toart=X, tone=Y+ natural-language references. partial-workflows.md: per-skill slash command invocations rewritten as user-intent cues; PDF-related outputs removed.auto-selection.md: priority order dropped the EXTEND.md tier.analysis-framework.md: language-priority comment updated (user option → conversation → source).
File naming convention
Source content pasted by the user is saved as source-{slug}.md, where {slug} is the kebab-case topic slug used for the output directory. Backups follow the same pattern with a -backup-YYYYMMDD-HHMMSS suffix. SKILL.md and workflow.md now agree on this single convention.
What was preserved verbatim
- All 6 art-style definitions (
references/art-styles/) - All 7 tone definitions (
references/tones/) - All 7 layout definitions (
references/layouts/) - Core templates:
character-template.md,storyboard-template.md,base-prompt.md - Preset bodies (only the first few intro lines adapted; special rules unchanged)
- Author, version, homepage attribution
Syncing with upstream
To pull upstream updates:
# Compare versions
curl -sL https://raw.githubusercontent.com/JimLiu/baoyu-skills/main/skills/baoyu-comic/SKILL.md | head -5
# Look for the version: line
# Diff a reference file
diff <(curl -sL https://raw.githubusercontent.com/JimLiu/baoyu-skills/main/skills/baoyu-comic/references/art-styles/manga.md) \
references/art-styles/manga.md
Art-style, tone, and layout reference files can usually be overwritten directly (they're upstream-verbatim). SKILL.md, references/workflow.md, references/partial-workflows.md, references/auto-selection.md, references/analysis-framework.md, references/ohmsha-guide.md, and references/presets/*.md must be manually merged since they contain Hermes-specific adaptations.
If upstream adds a Hermes-compatible PDF merge step (no extra npm deps), restore scripts/ and reintroduce Step 8 in workflow.md.