AI Image Generator

Generate images using AI APIs (Google Gemini and OpenAI GPT). This skill teaches the prompting patterns and API mechanics for producing professional images directly from Claude Code.

Managed alternative: If you don't want to manage API keys, ImageBot provides a managed image generation service with album templates and brand kit support.

Model Selection

Choose the right model for the job:

Need	Model	Why
Photorealistic scenes / stock photos	Gemini 3.1 Flash Image	Best depth, complexity, environmental context
Final client scenes (higher detail)	Gemini 3 Pro Image	Higher detail, better style consistency
Text on images (posters, OG with copy, infographics)	GPT Image 2	Text rendering actually works — including multi-script
10-variation style exploration	GPT Image 2	Native batch — one prompt, 10 variants sharing composition + palette
Multi-reference compositing (product + lifestyle)	GPT Image 2	Handles lighting, scale, perspective across references
Transparent icons / logos	GPT Image 1.5	Native RGBA alpha — GPT Image 2 cannot do transparency
Quick drafts / iteration	Gemini 2.5 Flash Image	Free tier (~500/day)

ai-image-generator

AI Image Generator

Model Selection