openai-image-gen
SKILL.md
OpenAI Image Gen
Generate a handful of “random but structured” prompts and render them via the OpenAI Images API.
Run
python3 {baseDir}/scripts/gen.py
open ~/Projects/tmp/openai-image-gen-*/index.html # if ~/Projects/tmp exists; else ./tmp/...
Useful flags:
# GPT image models with various options
python3 {baseDir}/scripts/gen.py --count 16 --model gpt-image-1
python3 {baseDir}/scripts/gen.py --prompt "ultra-detailed studio photo of a lobster astronaut" --count 4
python3 {baseDir}/scripts/gen.py --size 1536x1024 --quality high --out-dir ./out/images
python3 {baseDir}/scripts/gen.py --model gpt-image-1.5 --background transparent --output-format webp
# DALL-E 3 (note: count is automatically limited to 1)
python3 {baseDir}/scripts/gen.py --model dall-e-3 --quality hd --size 1792x1024 --style vivid
python3 {baseDir}/scripts/gen.py --model dall-e-3 --style natural --prompt "serene mountain landscape"
# DALL-E 2
python3 {baseDir}/scripts/gen.py --model dall-e-2 --size 512x512 --count 4
Model-Specific Parameters
Different models support different parameter values. The script automatically selects appropriate defaults based on the model.
Size
- GPT image models (
gpt-image-1,gpt-image-1-mini,gpt-image-1.5):1024x1024,1536x1024(landscape),1024x1536(portrait), orauto- Default:
1024x1024
- Default:
- dall-e-3:
1024x1024,1792x1024, or1024x1792- Default:
1024x1024
- Default:
- dall-e-2:
256x256,512x512, or1024x1024- Default:
1024x1024
- Default:
Quality
- GPT image models:
auto,high,medium, orlow- Default:
high
- Default:
- dall-e-3:
hdorstandard- Default:
standard
- Default:
- dall-e-2:
standardonly- Default:
standard
- Default:
Other Notable Differences
- dall-e-3 only supports generating 1 image at a time (
n=1). The script automatically limits count to 1 when using this model. - GPT image models support additional parameters:
--background:transparent,opaque, orauto(default)--output-format:png(default),jpeg, orwebp- Note:
streamandmoderationare available via API but not yet implemented in this script
- dall-e-3 has a
--styleparameter:vivid(hyper-real, dramatic) ornatural(more natural looking)
Output
*.png,*.jpeg, or*.webpimages (output format depends on model +--output-format)prompts.json(prompt → file mapping)index.html(thumbnail gallery)