video-understanding
Video Understanding: Video analysis with Gemini
File support
This skill supports video analysis using Google Gemini models. Supported formats:
| Category | Extensions |
|---|---|
| Video | .mp4, .mov, .webm, .avi, .mkv |
- Local video files up to an hour long
- YouTube links (youtube.com/watch, youtu.be, youtube.com/embed)
Reference: https://ai.google.dev/gemini-api/docs/video-understanding
How to use
bash ${CLAUDE_PLUGIN_ROOT}/scripts/gemini.sh --file=VIDEO_PATH "YOUR QUESTION ABOUT THE VIDEO"
Arguments:
--file- Required: Local video file path or YouTube URL--model- Optional: Model to use (defaults togemini-3-flash-preview)
Examples:
# Analyze a local video
npx -y superconductor-gemini-skills --file=video.mp4 "Summarize this video"
npx -y superconductor-gemini-skills --file=presentation.mov "What are the key points discussed?"
# Analyze a YouTube video
npx -y superconductor-gemini-skills --file="https://www.youtube.com/watch?v=F0I5M4Pb85k" "Summarize the video"
npx -y superconductor-gemini-skills --file="https://youtu.be/F0I5M4Pb85k" "What happens at the 2 minute mark?"
# Find specific moments
npx -y superconductor-gemini-skills --file=video.mp4 "What is the best timestamp to represent this video's content?"
API Key
The GEMINI_API_KEY environment variable must be set. Get your key at: https://ai.google.dev/gemini-api/docs/api-key
Models
| Model ID | Context Window | Pricing (Input / Output) |
|---|---|---|
gemini-3-pro-preview |
1M / 64k | $2 / $12 (<200k), $4 / $18 (>200k) |
gemini-3-flash-preview |
1M / 64k | $0.50 / $3 |
gemini-2.5-pro |
1M / 65k | $1.25 / $10 (<200k), $2.50 / $15 (>200k) |
gemini-2.5-flash |
1M / 65k | $0.30 / $2.50 |
More from superconductor/superconductor-plugin-marketplace
video-generation
Generate videos using Google Veo models. Create 4-8 second videos from text prompts, with optional image-to-video and video extension capabilities. Veo 3.1 supports native audio generation including dialogue and sound effects.
5audio-understanding
Transcribe and analyze audio content using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links up to 9.5 hours long. Use this skill when you need to transcribe, summarize, or extract information from audio content.
5x-api
Interact with the X (Twitter) API v2 using curl commands. Use this skill to look up user profiles, search recent posts, retrieve tweets, get user timelines, and explore other public X API v2 endpoints.
4image-generation
Generate high-quality images using Google Gemini's Nano Banana Pro image model. Use this skill when you need to create images from text descriptions or transform existing images/videos into new artwork.
4text-to-speech
Convert text to natural-sounding speech using Google Gemini TTS models. Supports 30 different voices and 24 languages. Use this skill when you need to generate audio narration, voiceovers, or spoken content from text.
4gemini-consultation
Ask questions to Google Gemini AI models. Use this skill when you need a second opinion from another frontier AI model, want to analyze documents (PDFs, images), or need Gemini's perspective on any topic.
4