skills/tuziapi/tuzi-skills/tuzi-image-gen

tuzi-image-gen

SKILL.md

Image Generation (AI SDK)

Multi-provider image generation. Default provider: Tuzi (兔子API, api.tu-zi.com).

Script Directory

Agent Execution:

  1. SKILL_DIR = this SKILL.md file's directory
  2. Script path = ${SKILL_DIR}/scripts/main.ts

Step 0: Load Preferences ⛔ BLOCKING

CRITICAL: This step MUST complete BEFORE any image generation. Do NOT skip or defer.

0.1 Check API Key

Check if the selected provider's API key is available. For Tuzi (default):

# Check env, then .tuzi-skills/.env files
echo "${TUZI_API_KEY:-not_set}"
grep -s TUZI_API_KEY .tuzi-skills/.env "$HOME/.tuzi-skills/.env"
Result Action
Key found Continue to Step 0.2
Key NOT found ⛔ Run API key setup (see references/config/first-time-setup.md → "API Key Setup") → Store key → Then continue

CRITICAL: If API key is missing, MUST guide user to obtain and store it BEFORE any generation. Generation is BLOCKED until key is configured.

0.2 Check EXTEND.md

Check EXTEND.md existence (priority: project → user):

test -f .tuzi-skills/tuzi-image-gen/EXTEND.md && echo "project"
test -f "$HOME/.tuzi-skills/tuzi-image-gen/EXTEND.md" && echo "user"
Result Action
Found Load, parse, apply settings. If default_model.[provider] is null → ask model only (Flow 2)
Not found ⛔ Run first-time setup (references/config/first-time-setup.md) → Save EXTEND.md → Then continue

CRITICAL: If not found, complete the full setup (provider + model + quality + save location) using AskUserQuestion BEFORE generating any images. Generation is BLOCKED until EXTEND.md is created.

Path Location
.tuzi-skills/tuzi-image-gen/EXTEND.md Project directory
$HOME/.tuzi-skills/tuzi-image-gen/EXTEND.md User home

EXTEND.md Supports: Default provider | Default quality | Default aspect ratio | Default image size | Default models

Schema: references/config/preferences-schema.md

Usage

# Basic (uses Tuzi provider by default)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image cat.png

# With aspect ratio
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A landscape" --image out.png --ar 16:9

# With quality (Tuzi: 1k/2k/4k)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --quality 2k

# 4K VIP model
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --model gemini-3-pro-image-preview-4k-vip

# With reference images
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "Make it blue" --image out.png --ref source.png

# From prompt files
npx -y bun ${SKILL_DIR}/scripts/main.ts --promptfiles system.md content.md --image out.png

# Async model (auto-polls)
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --model gemini-3-pro-image-preview-2k-async

# Other providers
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider google
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider openai
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "一只可爱的猫" --image out.png --provider dashscope
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate

Options

Option Description
--prompt <text>, -p Prompt text
--promptfiles <files...> Read prompt from files (concatenated)
--image <path> Output image path (required)
--provider tuzi|google|openai|dashscope|replicate Force provider (default: auto-detect, Tuzi first)
--model <id>, -m Model ID (see Tuzi Models section for full list)
--ar <ratio> Aspect ratio (e.g., 16:9, 1:1, 4:3). Tuzi converts to NxN format
--size <WxH> Size override (e.g., 1024x1024, 16x9)
--quality normal|2k Quality preset. Tuzi: maps to 1k/2k. Google: maps to 1K/2K
--imageSize 1K|2K|4K Image size (Tuzi and Google). Overrides --quality
--ref <files...> Reference images. Tuzi: base64 in JSON body. Google: multimodal. OpenAI: edits API
--n <count> Number of images
--json JSON output

Tuzi Models

Tuzi API (api.tu-zi.com) is the default provider. Models differ in quality, speed, and supported parameters.

Recommended

Model ID Alias Quality Notes
gemini-3-pro-image-preview nano-banana-pro --quality 1k/2k/4k Default. High quality, supports quality param
gemini-3.1-flash-image-preview nano-banana-2 --quality 1k/2k/4k Fast, supports extended aspect ratios
gemini-3-pro-image-preview-vip nano-banana-pro-vip 1k built-in High quality, VIP
gemini-3-pro-image-preview-2k-vip nano-banana-pro-2k-vip 2k built-in High quality 2K, VIP
gemini-3-pro-image-preview-4k-vip nano-banana-pro-4k-vip 4k built-in High quality 4K, VIP
gemini-2.5-flash-image-vip nano-banana-vip 1k built-in Fastest, VIP

More Models

Model ID Alias Notes
gemini-3-pro-image-preview nano-banana-pro --quality 1k/2k/4k
gemini-2.5-flash-image nano-banana Fast
gemini-3-pro-image-preview-hd nano-banana-pro-hd HD built-in
gemini-3-pro-image-preview-2k nano-banana-pro-2k 2K built-in
gemini-3-pro-image-preview-4k nano-banana-pro-4k 4K built-in
gpt-image-1.5 Size: 1:1, 3:2, 2:3 only
bfl-flux-2-pro flux-2-pro Flux
bfl-flux-2-max flux-2-max Flux highest quality
flux-kontext-pro kontext-pro Multi-ref editing
flux-kontext-max kontext-max Multi-ref editing (max)
doubao-seedream-4-0-250828 Seedream 4.0 2K/4K
doubao-seedream-4-5-251128 Seedream 4.5 2K/4K
doubao-seedream-5-0-260128 Seedream 5.0 lite 2K/3K

Async Models

Auto-detected. Script submits task and polls until complete (5s interval, max 30min).

Model ID Notes
gemini-3-pro-image-preview-async 1K async
gemini-3-pro-image-preview-2k-async 2K async
gemini-3-pro-image-preview-4k-async 4K async
mj-imagine Midjourney, MJ params in prompt

Model-Specific Parameters

Quality (--quality or --imageSize 1K|2K|4K):

Applies to Values Notes
gemini-3.1-flash-image-preview 1k / 2k / 4k Default model, quality adjustable
gemini-3-pro-image-preview 1k / 2k / 4k Quality adjustable
*-2k-vip, *-4k-vip, *-hd Quality built into model name, param ignored
Other models Param ignored

Aspect ratio (--ar):

Applies to Supported ratios
Gemini models (default) 1:1, 16:9, 9:16, 3:2, 2:3, 4:3, 3:4, 5:4, 4:5, 21:9
gemini-3.1-flash-image-preview Above + 1:4, 4:1, 1:8, 8:1 (extreme ratios)
gpt-image-1.5 1:1, 3:2, 2:3
Omitted Model auto-decides

Reference images (--ref):

  • Sync models: base64 data URL in JSON image field
  • Async models: input_reference in FormData
  • All Tuzi models support reference images

Environment Variables

Variable Description
TUZI_API_KEY Tuzi API key (https://api.tu-zi.com)
TUZI_IMAGE_MODEL Tuzi default model (default: gemini-3-pro-image-preview)
TUZI_BASE_URL Custom Tuzi endpoint (default: https://api.tu-zi.com/v1)
GOOGLE_API_KEY Google API key
OPENAI_API_KEY OpenAI API key
DASHSCOPE_API_KEY DashScope API key (阿里云)
REPLICATE_API_TOKEN Replicate API token
GOOGLE_IMAGE_MODEL Google model override
OPENAI_IMAGE_MODEL OpenAI model override
DASHSCOPE_IMAGE_MODEL DashScope model override
REPLICATE_IMAGE_MODEL Replicate model override
GOOGLE_BASE_URL Custom Google endpoint
OPENAI_BASE_URL Custom OpenAI endpoint
DASHSCOPE_BASE_URL Custom DashScope endpoint
REPLICATE_BASE_URL Custom Replicate endpoint

Load Priority: CLI args > EXTEND.md > env vars > <cwd>/.tuzi-skills/.env > ~/.tuzi-skills/.env

Model Resolution

Priority (highest → lowest), all providers:

  1. CLI: --model <id>
  2. EXTEND.md: default_model.[provider]
  3. Env var: <PROVIDER>_IMAGE_MODEL
  4. Built-in default

Agent MUST display model info before each generation:

  • Show: Using [provider] / [model]
  • Show switch hint: Switch model: --model <id> | EXTEND.md default_model.[provider] | env <PROVIDER>_IMAGE_MODEL

Provider Selection

  1. --provider specified → use it
  2. --ref provided + no --provider → Tuzi > Google > OpenAI > Replicate
  3. Only one API key available → use that provider
  4. Multiple available → Tuzi first

Quality Presets

Preset Tuzi Google OpenAI
normal 1k 1K 1024px
2k (default) 2k 2K 2048px

--imageSize 1K|2K|4K overrides quality for Tuzi and Google.

Generation Mode

Default: Sequential (one at a time).

Parallel: Only when user explicitly requests. Use Task tool with run_in_background=true, recommended 4 subagents (max 8).

Error Handling

  • Missing API key → ⛔ MUST run API key setup from Step 0.1 (guide user to https://api.tu-zi.com/token, store in .tuzi-skills/.env). Do NOT suggest GOOGLE_API_KEY or other provider keys unless user explicitly chose a different provider.
  • Generation failure → auto-retry once
  • Tuzi PROHIBITED_CONTENT → content rejection error
  • Tuzi NO_IMAGE → prompt too vague, suggest more explicit prompt
  • Async timeout → error after 30 minutes
  • Invalid aspect ratio → warning, proceed with default

Replicate Models

Format: owner/name or owner/name:version

npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate
npx -y bun ${SKILL_DIR}/scripts/main.ts --prompt "A cat" --image out.png --provider replicate --model google/nano-banana

Extension Support

Custom configurations via EXTEND.md. See Step 0 for paths and supported options.

Weekly Installs
35
GitHub Stars
19
First Seen
9 days ago
Installed on
opencode34
gemini-cli34
github-copilot34
amp34
cline34
codex34