openai-image-gen
OpenAI Image Gen
Generate a handful of “random but structured” prompts and render them via the OpenAI Images API.
Run
python3 {baseDir}/scripts/gen.py
open ~/Projects/tmp/openai-image-gen-*/index.html # if ~/Projects/tmp exists; else ./tmp/...
Useful flags:
# GPT image models with various options
python3 {baseDir}/scripts/gen.py --count 16 --model gpt-image-1
python3 {baseDir}/scripts/gen.py --prompt "ultra-detailed studio photo of a lobster astronaut" --count 4
python3 {baseDir}/scripts/gen.py --size 1536x1024 --quality high --out-dir ./out/images
python3 {baseDir}/scripts/gen.py --model gpt-image-1.5 --background transparent --output-format webp
# DALL-E 3 (note: count is automatically limited to 1)
python3 {baseDir}/scripts/gen.py --model dall-e-3 --quality hd --size 1792x1024 --style vivid
python3 {baseDir}/scripts/gen.py --model dall-e-3 --style natural --prompt "serene mountain landscape"
# DALL-E 2
python3 {baseDir}/scripts/gen.py --model dall-e-2 --size 512x512 --count 4
Model-Specific Parameters
Different models support different parameter values. The script automatically selects appropriate defaults based on the model.
Size
- GPT image models (
gpt-image-1,gpt-image-1-mini,gpt-image-1.5):1024x1024,1536x1024(landscape),1024x1536(portrait), orauto- Default:
1024x1024
- Default:
- dall-e-3:
1024x1024,1792x1024, or1024x1792- Default:
1024x1024
- Default:
- dall-e-2:
256x256,512x512, or1024x1024- Default:
1024x1024
- Default:
Quality
- GPT image models:
auto,high,medium, orlow- Default:
high
- Default:
- dall-e-3:
hdorstandard- Default:
standard
- Default:
- dall-e-2:
standardonly- Default:
standard
- Default:
Other Notable Differences
- dall-e-3 only supports generating 1 image at a time (
n=1). The script automatically limits count to 1 when using this model. - GPT image models support additional parameters:
--background:transparent,opaque, orauto(default)--output-format:png(default),jpeg, orwebp- Note:
streamandmoderationare available via API but not yet implemented in this script
- dall-e-3 has a
--styleparameter:vivid(hyper-real, dramatic) ornatural(more natural looking)
Output
*.png,*.jpeg, or*.webpimages (output format depends on model +--output-format)prompts.json(prompt → file mapping)index.html(thumbnail gallery)
More from thinkfleetai/thinkfleet-engine
local-whisper
Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.
149flyio-cli-public
Use the Fly.io flyctl CLI for deploying and operating apps on Fly.io: deploys (local or remote builder), viewing status/logs, SSH/console, secrets/config, scaling, machines, volumes, and Fly Postgres (create/attach/manage databases). Use when asked to deploy to Fly.io, debug fly deploy/build/runtime failures, set up GitHub Actions deploys/previews, or safely manage Fly apps and Postgres.
24kagi-search
Web search using Kagi Search API. Use when you need to search the web for current information, facts, or references. Requires KAGI_API_KEY in the environment.
22feishu-bridge
Connect a Feishu (Lark) bot to ThinkFleet via WebSocket long-connection. No public server, domain, or ngrok required. Use when setting up Feishu/Lark as a messaging channel, troubleshooting the Feishu bridge, or managing the bridge service (start/stop/logs). Covers bot creation on Feishu Open Platform, credential setup, bridge startup, macOS launchd auto-restart, and group chat behavior tuning.
13video-subtitles
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
10freshrss
Query headlines and articles from a self-hosted FreshRSS instance. Use when the user asks for RSS news, latest headlines, feed updates, or wants to browse articles from their FreshRSS reader. Supports filtering by category, time range, and count.
9