image-generation
Image Generation: Create images with Gemini
Overview
Generate high-quality images using Google Gemini's image generation models. Supports text-to-image generation and can optionally use existing images or videos as reference.
How to use
bash ${CLAUDE_PLUGIN_ROOT}/scripts/gemini.sh --model=gemini-3-pro-image-preview "YOUR IMAGE DESCRIPTION"
Arguments:
--model- Required for image generation: Usegemini-3-pro-image-preview--file- Optional: Reference image or video for style/content guidance- When relevant, explicitly ask the model to include or preserve the original image/composition in the output.
Examples:
# Generate an image from a text description
npx -y superconductor-gemini-skills --model=gemini-3-pro-image-preview "A serene mountain landscape at sunset with golden light"
# Generate an image inspired by an existing image
npx -y superconductor-gemini-skills --model=gemini-3-pro-image-preview --file=reference.jpg "Transform this into a watercolor painting"
# Keep the original composition while applying a new style
npx -y superconductor-gemini-skills --model=gemini-3-pro-image-preview --file=reference.jpg "Keep the original image composition and main subjects, but restyle it as a polished dark-mode mobile app mockup"
# Generate an image from a video
npx -y superconductor-gemini-skills --model=gemini-3-pro-image-preview --file=video.mp4 "Generate an image that captures the essence of this video"
# Create specific styles
npx -y superconductor-gemini-skills --model=gemini-3-pro-image-preview "A cyberpunk cityscape in neon colors, digital art style"
Output
Generated images are saved to the current directory as gemini-image-{timestamp}.png.
API Key
The GEMINI_API_KEY environment variable must be set. Get your key at: https://ai.google.dev/gemini-api/docs/api-key
Models
| Model ID | Context Window | Pricing |
|---|---|---|
gemini-3-pro-image-preview |
65k / 32k | $2 (Text) / $0.134 (Image) |
More from superconductor/superconductor-plugin-marketplace
video-understanding
Analyze and understand video content using Google Gemini. Supports local video files (mp4, mov, webm, avi, mkv) and YouTube links up to an hour long. Use this skill when you need to understand, summarize, or extract information from video content.
9video-generation
Generate videos using Google Veo models. Create 4-8 second videos from text prompts, with optional image-to-video and video extension capabilities. Veo 3.1 supports native audio generation including dialogue and sound effects.
5audio-understanding
Transcribe and analyze audio content using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links up to 9.5 hours long. Use this skill when you need to transcribe, summarize, or extract information from audio content.
5x-api
Interact with the X (Twitter) API v2 using curl commands. Use this skill to look up user profiles, search recent posts, retrieve tweets, get user timelines, and explore other public X API v2 endpoints.
4text-to-speech
Convert text to natural-sounding speech using Google Gemini TTS models. Supports 30 different voices and 24 languages. Use this skill when you need to generate audio narration, voiceovers, or spoken content from text.
4gemini-consultation
Ask questions to Google Gemini AI models. Use this skill when you need a second opinion from another frontier AI model, want to analyze documents (PDFs, images), or need Gemini's perspective on any topic.
4