fal-qwen-image-edit
fal-qwen-image-edit
Required inputs
- A text prompt describing the desired result.
- One or more reference images (
--ref) as local files or URLs.- Typically: 1 “base” image to edit + optional extra reference/style images.
- A fal.ai API key in
workers.jsonc(fal.key).
Optional inputs
- Output sizes / aspect ratios:
--image-size <preset>(repeatable) or--sizes a,b,c(comma-separated presets)--size <WxH>(repeatable pixel-size hint; mapped to the closest supported preset)- Presets:
square_hd,square,portrait_4_3,landscape_4_3,portrait_16_9,landscape_16_9
--negative-prompt,--seed,--num-images, and other quality controls supported by the script.
Workflow
- Prepare inputs
- Collect the base image and any extra references (PNG/JPEG recommended).
- Write a prompt that is specific and testable:
- Subject: what should be preserved/changed from the base image.
- Composition: angle, crop, framing, background.
- Lighting + color: time of day, soft/hard light, mood.
- Style: photoreal, studio product shot, cinematic, etc.
- If you need to avoid certain elements (logos, text, hands, extra objects), use
--negative-prompt.
- Run the generator
- Use the bundled script to call fal.ai.
- Final deliverables go to
artifacts/; request/response logs and intermediate files stay inlogs/.- Basic:
node skills/car-ads-designer/fal-qwen-image-edit/scripts/generate.mjs --prompt "..." --ref path/to/base.jpg
- Multiple sizes (one call, multiple requests):
node skills/car-ads-designer/fal-qwen-image-edit/scripts/generate.mjs --prompt "..." --ref base.jpg --image-size square_hd --image-size landscape_16_9
- Pixel-size hints (mapped to presets):
node skills/car-ads-designer/fal-qwen-image-edit/scripts/generate.mjs --prompt "..." --ref base.jpg --size 1024x1024 --size 1920x1080
- Basic:
- Stop and request review (required)
- After generation completes, do not take any further actions (no auto-retouching, no upscaling, no additional variants) until a human or designated reviewer confirms the outputs are acceptable.
- Review outputs
- Inspect the final images under
artifacts/...first. - If something looks off, use the working folder in
logs/...to debug (it includes request/response payloads).
Outputs
- Working folder under
logs/fal-qwen-image-edit/<run-id>/containing inputs and full request/response logs. - Final folder under
artifacts/fal-qwen-image-edit/<run-id>/containing:README.md(what was generated + where the working logs are)- One or more final images (and per-size subfolders when multiple sizes are requested)
Definition of done
- The script completes without errors and writes at least one image file under
artifacts/. - The output matches the prompt intent and aligns with the provided reference image(s).
- A human (or designated agent reviewer) has reviewed the outputs and explicitly approved any next steps.
Safety / quality checklist
- Do not commit API keys; keep them in local-only
workers.jsoncand avoid printing them. - Avoid generating copyrighted logos/marks you don’t have rights to use.
- Treat rendered text as unreliable; avoid high-stakes text (prices, legal claims) unless you will manually verify/correct.
- Don’t include sensitive personal data in prompts or images unless you have permission and a clear need.
More from lingkaix/smartworkers
smart-skill-maker
Create or improve SmartWorkers-style skills from a workspace-local `skills/` source tree, then apply them to Codex with `npx skills`. Use when you want one guided tool for new skill creation, skill upgrades, or SmartWorkers skill-maintenance work while keeping repo conventions aligned.
22ads-info-gatherer
Use when the user needs a dealer car-ad source pack built from live web sources or a provided worker-pack folder, with VIN-anchored retail truth, source assets, and provenance preserved for downstream generation and review.
7ads-suite-pipeline
Run the full dealer car-ad production pipeline from source pack or approved anchor design to final approved deliverables: generate clean images, review/regenerate, build SVG copy overlays, and finish consistent multi-size or multi-model suites. Use whenever the user asks to produce a full ad suite, continue from an approved ad, adapt one campaign across sizes/models, review and improve generations, or keep retail offer copy exact while moving from source pack to deliverable assets.
7fal-nano-banana-2-image-gen
Generate images from text prompts using fal.ai Nano Banana 2 (fal-ai/nano-banana-2). Use when you need fast text-to-image variants for ads, concepts, or backgrounds.
6workspace-setup
Initialize a SmartWorkers-style agent workspace with repo-root guidance, `logs/`/`temp/`/`artifacts/`, a local `skills/` source tree, ignore rules, config templates, and the required global `mise` plus `npx skills`, `skill-creator`, and `smart-skill-maker` bootstrap completed in one turn by default. Use when starting a new agent workspace, bootstrapping a fresh project folder for repeatable agent work, or standardizing README, WORKFLOW, AGENTS, and skill-management flow before adding more automation.
5fal-veo3-image-to-video
Generate videos from a reference image using fal.ai Veo 3.1 image-to-video (`fal-ai/veo3.1/image-to-video`). Use when you need short video variants (ad b-roll, product shots, storyboards) from a still image, with final outputs saved to `artifacts/` and request/response logs saved to `logs/`.
1