Image Generation Skill

This skill enables AI-powered image generation, editing, and asset creation using Google Gemini (Gemini) and OpenAI GPT-Image.

When to Use

Activate this skill when the user wants to:

Generate images from text descriptions
Edit or modify existing images
Create project assets (icons, favicons, social images)
Generate design inspiration (moodboards)
Create consistent character designs
Compare different AI image providers

Available Commands

Command	Use For
`/imagegen:generate`	Generate images from prompts
`/imagegen:edit`	Edit existing images
`/imagegen:iterate`	Refine images through multiple steps
`/imagegen:compare`	Compare Google vs OpenAI
`/imagegen:assets`	Generate project assets
`/imagegen:moodboard`	Create design inspiration sets
`/imagegen:character`	Create consistent character sheets
`/imagegen:config`	Configure defaults

Delegation

For complex image generation tasks, delegate to the image-generator subagent which has access to all generation scripts and can handle multi-step workflows.

Quick Reference

Providers

Google Gemini (Gemini)

Models: gemini-2.5-flash-image, gemini-3-pro-image-preview
Best for: Character consistency, multi-turn iteration, style variety
API Key: GEMINI_API_KEY or GOOGLE_API_KEY

OpenAI GPT-Image

Models: gpt-image-1.5, gpt-image-1, gpt-image-1-mini
Best for: Text in images, transparent backgrounds, precise edits
API Key: OPENAI_API_KEY

Common Sizes/Aspect Ratios

Format	Google	OpenAI
Square	1:1	1024x1024
Landscape	16:9	1536x1024
Portrait	9:16	1024x1536
Wide	21:9	-

Example Interactions

User: "Generate an image of a sunset over mountains" Action: Use /imagegen:generate --prompt "A sunset over mountains"

User: "Create app icons for my project" Action: Use /imagegen:assets --type icons --prompt "[ask for description]"

User: "Edit this image to add rain" Action: Use /imagegen:edit --image [path] --prompt "Add rain falling"

User: "I want to iterate on this design" Action: Use /imagegen:iterate --image [path] --prompt "[refinement]"

User: "Which provider would be better for logos?" Action: Explain Google is better for style variety, OpenAI for text, and suggest /imagegen:compare to test both.

Prerequisites Check

Before generating, verify:

Required Python packages: google-genai, openai, Pillow (for resizing)
API keys set in environment
Output directory accessible

# Install packages
pip install google-genai openai Pillow

# Set API keys (user's responsibility)
export GEMINI_API_KEY=your_key
export OPENAI_API_KEY=your_key

Prompt Tips

Help users craft effective prompts:

Be descriptive but concise
Specify style (photorealistic, watercolor, minimalist)
Include lighting (golden hour, dramatic, soft)
Mention composition (close-up, wide shot, centered)
For characters, include distinctive features
For logos, specify simplicity level