picture
Image Generation
Generate AI images using Google Imagen via the Gemini API.
When to Use
- Presentation images (hero visuals, section dividers)
- Conceptual illustrations (visual metaphors, abstract concepts)
- Photo-realistic images (product mockups, scenarios)
- Visual explanations that benefit from AI generation
When NOT to Use
- Precise diagrams or charts — use
diagrammingskill (editable SVG, exact data) - Screenshots — use
screenshotskill - Simple icons — often faster to find stock or use emoji
Overlap with diagramming: There's fuzzy boundary. A "chart for a presentation" could go either way:
- Need precise data, editability → diagramming
- Need striking visual punch → image-generation
- Use judgement; ask if unclear.
Workflow
1. Understand the Need
Clarify with user:
- Purpose — presentation, concept illustration, visual metaphor?
- Style — photorealistic, illustration, abstract?
- Brand — does it need ITV styling? (if so, read itv-styling skill)
2. Draft with Flash
Use the fast model for initial iterations:
${CLAUDE_SKILL_DIR}/imagen.sh "prompt" --model gemini-2.5-flash-image
3. Review and Refine
Open the image, assess, iterate:
# Edit mode: refine previous output
${CLAUDE_SKILL_DIR}/imagen.sh "make it warmer, add more contrast" --input ./images/previous.png
4. Final Render with Pro
For client-facing or final deliverables:
${CLAUDE_SKILL_DIR}/imagen.sh "prompt" --model gemini-3-pro-image-preview
Command Reference
# Basic generation
imagen.sh "prompt" [--output ./images] [--model MODEL]
# Edit existing image
imagen.sh "refinement prompt" --input previous.png
# Models
--model gemini-2.5-flash-image # Fast, cheap (default)
--model gemini-3-pro-image-preview # Higher quality
--model imagen-4.0-generate-preview-06-06 # Imagen 4
Output: Saves to ./images/ with timestamped filename, prints path.
Prompting Framework
Based on Max Woolf's Nano Banana research.
Structure
[Specific object description with exact requirements in CAPS]
Aspects that MUST be followed EXACTLY:
- [Compositional rule 1]
- [Compositional rule 2]
[Publication/camera details for style elevation]
Do not include [unwanted elements].
Key Techniques
| Technique | Example |
|---|---|
| Structured bullets | Requirements as dashed list, not prose |
| ALL CAPS constraints | "MUST", "EXACTLY" increases adherence |
| Hex colors | #9F2B68 more precise than "magenta" |
| Composition rules | "rule of thirds", "negative space", "depth of field" |
| Style elevators | "Pulitzer Prize-winning cover photo for NYT" |
| Camera specs | "Canon EOS 90D DSLR camera" |
| Publication targets | "Vanity Fair cover profile" |
| Negative constraints | "Do not include text, watermarks, or line overlays" |
Example Prompt
A professional headshot of a confident business executive.
Aspects that MUST be followed EXACTLY:
- Shot from shoulders up, rule of thirds composition
- Neutral background with soft gradient #E8E8E8 to #FFFFFF
- Natural 3PM diffuse lighting from left
- Sharp focus on eyes, slight bokeh on background
Pulitzer Prize-winning portrait, Canon EOS R5, 85mm f/1.4.
Do not include any text, logos, or watermarks.
Composing with Brand Skills
With itv-styling
When creating ITV-branded images:
- Read
itv-stylingfor color palette and principles - Bake brand constraints into prompt:
Corporate presentation image for ITV.
Aspects that MUST be followed EXACTLY:
- Dark background #0F2323 (ITV dark green)
- Accent elements in #E8E557 (ITV yellow) or #4ECDC4 (ITV teal)
- Clean, modern, professional aesthetic
- No busy patterns or off-brand colors
Professional corporate photography style.
With diagramming
For hybrid needs (visual + precise data):
- Generate AI background/illustration with image-generation
- Overlay precise elements with diagramming
- Composite manually if needed
Limitations
| Limitation | Workaround |
|---|---|
| Style transfer fails ("Studio Ghibli style") | Use structural descriptions instead |
| Text generation imperfect | Add text as overlay after generation |
| Exact positioning difficult | Iterate with refinement prompts |
| Rate limits | Use Flash for drafts, Pro only for finals |
Output Location
Images save to ./images/ in the project directory:
- Created on first use
- Timestamped filenames for uniqueness
- Stays with project for easy reference
Anti-Patterns
| Pattern | Problem | Fix |
|---|---|---|
| Skip brand check | Inconsistent styling | Load itv-styling first when brand applies |
| Vague prompts | Poor results | Use specific, concrete descriptions |
| Wrong tool for data | Inaccurate charts | Use diagram skill for precise data |
See Also
references/prompting.md— Extended prompting referencediagrammingskill — For precise diagrams and chartsitv-stylingskill — For brand-constrained outputs
More from spm1001/claude-suite
server-checkup
Systematic Linux server management. Use BEFORE making changes to capture baseline, then AFTER for verification. Triggers on 'check this server', 'audit', 'set up this machine', 'security audit', 'harden this Pi', 'fresh Pi setup', 'provision this server'. (user)
33diagram
Orchestrates diagram creation with iterative render-and-check workflow. Invoke FIRST when asked to 'create a diagram', 'Venn diagram', 'flow chart', 'architecture diagram', 'visualize this' — ensures CRAP-principled self-critique before showing user, preventing amateur layouts and low-contrast text. Composes with brand skills for styling. (user)
32screenshot
>
24filing
Orchestrates file cleanup with mandatory processing — reads content and extracts actions BEFORE moving files. Prevents lost waiting-fors and buried actions. MANDATORY during weekly review filing phase. Triggers on 'where should this go', 'help me tidy', 'clean up downloads', 'triage inbox'. (user)
20setup
>
16ia-presenter
Writes and validates iA Presenter markdown where tab-indentation controls slide visibility. Triggers on 'presentation', 'iA Presenter', 'slides from markdown', '.presenter file', or when working in iCloud~net~ia~presenter folder. Validates tab characters (not spaces) for visible text, --- slide separators, and image metadata syntax. (user)
15