mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-04-25 00:51:20 +00:00
fix(skills): clarify baoyu-comic character sheet role
Page prompts are written in Step 5 from the text descriptions in characters/characters.md — the PNG sheet generated in Step 7.1 cannot be used to write them. Reposition the PNG as a human-facing review artifact (and reference for later regenerations / manual edits), and drop the confusing "Character sheet | Strategy" tables since the embedding rule is uniform.
This commit is contained in:
parent
fe025425cb
commit
83a7a005aa
3 changed files with 7 additions and 17 deletions
|
|
@ -32,7 +32,7 @@ Ported from [JimLiu/baoyu-skills](https://github.com/JimLiu/baoyu-skills) v1.56.
|
||||||
|
|
||||||
`image_generate`'s schema accepts only `prompt` and `aspect_ratio` (`landscape` | `portrait` | `square`). Upstream's reference-image flow (`--ref characters.png` for character consistency, plus user-supplied refs for style/palette/scene) does not map to this tool, so the workflow was restructured:
|
`image_generate`'s schema accepts only `prompt` and `aspect_ratio` (`landscape` | `portrait` | `square`). Upstream's reference-image flow (`--ref characters.png` for character consistency, plus user-supplied refs for style/palette/scene) does not map to this tool, so the workflow was restructured:
|
||||||
|
|
||||||
- **Character sheet** is still generated, but it is an **agent-facing** reference used when writing each page's prompt text. `image_generate` never sees it as a visual input.
|
- **Character sheet PNG** is still generated for multi-page comics, but it is repositioned as a **human-facing review artifact** (for visual verification) and a reference for later regenerations / manual prompt edits. Page prompts themselves are built from the **text descriptions** in `characters/characters.md` (embedded inline during Step 5). `image_generate` never sees the PNG as a visual input.
|
||||||
- **User-supplied reference images** are reduced to `style` / `palette` / `scene` trait extraction — traits are embedded in the prompt body; the image files themselves are kept only for provenance under `refs/`.
|
- **User-supplied reference images** are reduced to `style` / `palette` / `scene` trait extraction — traits are embedded in the prompt body; the image files themselves are kept only for provenance under `refs/`.
|
||||||
- **Page prompts** now mandate that character descriptions are embedded inline (copied from `characters/characters.md`) — this is the only mechanism left to enforce cross-page character consistency.
|
- **Page prompts** now mandate that character descriptions are embedded inline (copied from `characters/characters.md`) — this is the only mechanism left to enforce cross-page character consistency.
|
||||||
- **Download step** — after every `image_generate` call, the returned URL is fetched to disk (e.g., `curl -fsSL "<url>" -o <target>.png`) and verified before the workflow advances.
|
- **Download step** — after every `image_generate` call, the returned URL is fetched to disk (e.g., `curl -fsSL "<url>" -o <target>.png`) and verified before the workflow advances.
|
||||||
|
|
|
||||||
|
|
@ -47,7 +47,7 @@ references:
|
||||||
traits: "muted earth tones, soft-edged ink wash, low-contrast backgrounds"
|
traits: "muted earth tones, soft-edged ink wash, low-contrast backgrounds"
|
||||||
```
|
```
|
||||||
|
|
||||||
Character consistency is still driven by the **character sheet workflow** (Step 7.1–7.2) below, which relies on detailed text descriptions rather than direct image references.
|
Character consistency is driven by **text descriptions** in `characters/characters.md` (written in Step 3) that get embedded inline in every page prompt (Step 5). The optional PNG character sheet generated in Step 7.1 is a human-facing review artifact, not an input to `image_generate`.
|
||||||
|
|
||||||
## Options
|
## Options
|
||||||
|
|
||||||
|
|
@ -188,14 +188,9 @@ Use Hermes' built-in `image_generate` tool for all image rendering. Its schema a
|
||||||
2. Fetch the image bytes (e.g., `curl -fsSL "<url>" -o <target>.png`)
|
2. Fetch the image bytes (e.g., `curl -fsSL "<url>" -o <target>.png`)
|
||||||
3. Verify the file exists and is non-empty before proceeding to the next page
|
3. Verify the file exists and is non-empty before proceeding to the next page
|
||||||
|
|
||||||
**7.1 Character sheet** — generate it (to `characters/characters.png`, aspect `landscape`) when the comic is multi-page with recurring characters. Skip for simple presets (e.g., four-panel minimalist) or single-page comics. The prompt file at `characters/characters.md` must exist before invoking `image_generate`. After download, the character sheet is consumed **for the agent's own reference** when writing each page's prompt text — Hermes' `image_generate` cannot accept it as a visual input.
|
**7.1 Character sheet** — generate it (to `characters/characters.png`, aspect `landscape`) when the comic is multi-page with recurring characters. Skip for simple presets (e.g., four-panel minimalist) or single-page comics. The prompt file at `characters/characters.md` must exist before invoking `image_generate`. The rendered PNG is a **human-facing review artifact** (so the user can visually verify character design) and a reference for later regenerations or manual prompt edits — it does **not** drive Step 7.2. Page prompts are already written in Step 5 from the **text descriptions** in `characters/characters.md`; `image_generate` cannot accept images as visual input.
|
||||||
|
|
||||||
**7.2 Pages** — each page's prompt MUST already be at `prompts/NN-{cover|page}-[slug].md` before invoking `image_generate`. Because `image_generate` is prompt-only, character consistency is enforced by **embedding character descriptions in every prompt**:
|
**7.2 Pages** — each page's prompt MUST already be at `prompts/NN-{cover|page}-[slug].md` before invoking `image_generate`. Because `image_generate` is prompt-only, character consistency is enforced by **embedding character descriptions (sourced from `characters/characters.md`) inline in every page prompt during Step 5**. The embedding is done uniformly whether or not a PNG sheet is produced in 7.1; the PNG is only a review/regeneration aid.
|
||||||
|
|
||||||
| Character sheet | Strategy |
|
|
||||||
|-----------------|----------|
|
|
||||||
| Exists | Prepend relevant character descriptions (from `characters/characters.md`) to every page prompt |
|
|
||||||
| Skipped | Prompt file already contains all descriptions inline |
|
|
||||||
|
|
||||||
**Backup rule**: existing `prompts/…md` and `…png` files → rename with `-backup-YYYYMMDD-HHMMSS` suffix before regenerating.
|
**Backup rule**: existing `prompts/…md` and `…png` files → rename with `-backup-YYYYMMDD-HHMMSS` suffix before regenerating.
|
||||||
|
|
||||||
|
|
@ -237,5 +232,5 @@ Full step-by-step workflow (analysis, storyboard, review gates, regeneration var
|
||||||
- Use stylized alternatives for sensitive public figures
|
- Use stylized alternatives for sensitive public figures
|
||||||
- **Step 2 confirmation required** - do not skip
|
- **Step 2 confirmation required** - do not skip
|
||||||
- **Steps 4/6 conditional** - only if user requested in Step 2
|
- **Steps 4/6 conditional** - only if user requested in Step 2
|
||||||
- **Step 7.1 character sheet** - recommended for multi-page comics, optional for simple presets. It is an **agent-facing** reference used to write consistent page prompts; `image_generate` does not accept it as a visual input
|
- **Step 7.1 character sheet** - recommended for multi-page comics, optional for simple presets. The PNG is a review/regeneration aid; page prompts (written in Step 5) use the text descriptions in `characters/characters.md`, not the PNG. `image_generate` does not accept images as visual input
|
||||||
- **Strip secrets** — scan source content for API keys, tokens, or credentials before writing any output file
|
- **Strip secrets** — scan source content for API keys, tokens, or credentials before writing any output file
|
||||||
|
|
|
||||||
|
|
@ -340,7 +340,7 @@ Character sheet is recommended for multi-page comics with recurring characters,
|
||||||
3. Call `image_generate` with `landscape` format
|
3. Call `image_generate` with `landscape` format
|
||||||
4. Download the returned URL → save to `characters/characters.png`
|
4. Download the returned URL → save to `characters/characters.png`
|
||||||
|
|
||||||
**Important**: the downloaded sheet is for the **agent's own reference** when writing each page's prompt text below. `image_generate` cannot accept it as a visual input.
|
**Important**: the downloaded sheet is a **human-facing review artifact** (so the user can visually verify character design) and a reference for later regenerations or manual prompt edits. It does **not** drive Step 7.2 — page prompts were already written in Step 5 from the text descriptions in `characters/characters.md`. `image_generate` cannot accept images as visual input, so the text is the sole cross-page consistency mechanism.
|
||||||
|
|
||||||
### 7.2 Generate Comic Pages
|
### 7.2 Generate Comic Pages
|
||||||
|
|
||||||
|
|
@ -348,12 +348,7 @@ Character sheet is recommended for multi-page comics with recurring characters,
|
||||||
1. Confirm each prompt file exists at `prompts/NN-{cover|page}-[slug].md`
|
1. Confirm each prompt file exists at `prompts/NN-{cover|page}-[slug].md`
|
||||||
2. Confirm that each prompt has character descriptions embedded inline (see Step 5). `image_generate` is prompt-only, so the prompt text is the sole consistency mechanism.
|
2. Confirm that each prompt has character descriptions embedded inline (see Step 5). `image_generate` is prompt-only, so the prompt text is the sole consistency mechanism.
|
||||||
|
|
||||||
**Page Generation Strategy** (embed everything in the prompt text):
|
**Page Generation Strategy**: every page prompt must embed character descriptions (sourced from `characters/characters.md`) inline. This is done during Step 5, uniformly whether or not the PNG sheet was produced in 7.1 — the PNG is only a review/regeneration aid, never a generation input.
|
||||||
|
|
||||||
| Character sheet | Strategy |
|
|
||||||
|-----------------|----------|
|
|
||||||
| Exists | Use it as an agent-side reference when composing each prompt; embed the key traits inline in the prompt text |
|
|
||||||
| Skipped | Prompt file already contains all descriptions inline |
|
|
||||||
|
|
||||||
**Example embedded prompt** (`prompts/01-page-xxx.md`):
|
**Example embedded prompt** (`prompts/01-page-xxx.md`):
|
||||||
|
|
||||||
|
|
|
||||||
Loading…
Add table
Add a link
Reference in a new issue