nano-banana-pro
Nano Banana Pro (Gemini 3 Pro Image)
Generate or edit images using Gemini 3 Pro Image via AI Gateway or direct API.
Quick Start
Generate an image:
python3 {baseDir}/scripts/generate_image.py \
--prompt "your image description" \
--filename "output.png" \
--resolution 1K
Usage Examples
Generate Image
python3 {baseDir}/scripts/generate_image.py \
--prompt "a cute cat sitting on a wooden floor" \
--filename "cat.png"
Edit Image (requires GEMINI_API_KEY)
Note: Image editing currently requires direct Gemini API access.
export GEMINI_API_KEY="your-gemini-key"
python3 {baseDir}/scripts/generate_image.py \
--prompt "make it more colorful" \
--filename "cat-colorful.png" \
-i "cat.png"
Multi-image Composition (up to 14 images, requires GEMINI_API_KEY)
python3 {baseDir}/scripts/generate_image.py \
--prompt "combine these into one scene" \
--filename "combined.png" \
-i img1.png -i img2.png -i img3.png
API Key Configuration
The skill automatically detects API keys in this priority order:
AI_GATEWAY_API_KEY(recommended) - Uses AI GatewayGEMINI_API_KEY- Direct Gemini API access--api-keyargument - Manual key override
Recommended: AI Gateway (default)
export AI_GATEWAY_API_KEY="your-gateway-key" # Usually pre-configured
✅ No additional dependencies required ✅ Cost-efficient and unified API management ❌ Image editing not currently supported
Alternative: Direct Gemini API
export GEMINI_API_KEY="your-gemini-key"
✅ Supports image editing and multi-image composition
❌ Requires google-genai package: pip install google-genai
Options
--prompt, -p: Image description (required)--filename, -f: Output filename (required)--resolution, -r: Resolution (1K,2K,4K, default:1K)--input-image, -i: Input image(s) for editing (up to 14, requires GEMINI_API_KEY)--api-key, -k: Manual API key override
Notes
- Resolutions:
1K(default),2K,4K - Use descriptive filenames with timestamps:
2024-12-31-cat.png - The script outputs a
MEDIA:line for OpenClaw integration - AI Gateway mode works with standard Python 3, no extra tools needed
More from happycapy-ai/happycapy-skills
world-class-carousel
Generate world-class Instagram carousel content on any topic. Produces 7-10 publication-ready slides (1080x1350) with AI-generated visuals, precise typography, Instagram music recommendations, optimized captions, and hashtags. Uses Aristotelian first-principles framework with 7 content archetypes, 6 hook patterns, a mandatory Bullshit Test quality gate, and a comprehensive design system. Fully generalized -- works for ANY topic. Triggers: instagram carousel, create carousel, carousel post, make carousel slides, instagram slides, carousel content, slide deck for instagram, swipeable post
10latex-document
>
7find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
7redbook-creator-publish
小红书帖子创作与发布技能。用于:(1) 生成小红书风格的帖子内容(标题+正文+标签)(2) 生成帖子相关的配图 (3) 自动上传到小红书创作者平台(默认自动上传,失败时自动提供手动指引) (4) 生成本地预览HTML文件。触发词:小红书创作、create redbook、小红书、红书、笔记创作、帖子创作
7ai-video-generation
|
63d-web-experience
Expert in building 3D experiences for the web - Three.js, React Three Fiber, Spline, WebGL, and interactive 3D scenes. Covers product configurators, 3D portfolios, immersive websites, and bringing depth to web experiences. Use when: 3D website, three.js, WebGL, react three fiber, 3D experience.
6