text-to-speech
Text-to-Speech: Generate audio from text with Gemini
Overview
Convert text to natural-sounding speech using Google Gemini's TTS models. Supports:
- 30 prebuilt voices with distinct characteristics
- 24 languages with automatic detection
- Single-speaker and multi-speaker audio
- Natural intonation and expression
Reference: https://ai.google.dev/gemini-api/docs/speech-generation
How to use
bash ${CLAUDE_PLUGIN_ROOT}/scripts/gemini.sh --model=gemini-2.5-flash-preview-tts "TEXT TO SPEAK"
More from superconductor/superconductor-plugin-marketplace
video-understanding
Analyze and understand video content using Google Gemini. Supports local video files (mp4, mov, webm, avi, mkv) and YouTube links up to an hour long. Use this skill when you need to understand, summarize, or extract information from video content.
9video-generation
Generate videos using Google Veo models. Create 4-8 second videos from text prompts, with optional image-to-video and video extension capabilities. Veo 3.1 supports native audio generation including dialogue and sound effects.
5audio-understanding
Transcribe and analyze audio content using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links up to 9.5 hours long. Use this skill when you need to transcribe, summarize, or extract information from audio content.
5x-api
Interact with the X (Twitter) API v2 using curl commands. Use this skill to look up user profiles, search recent posts, retrieve tweets, get user timelines, and explore other public X API v2 endpoints.
4image-generation
Generate high-quality images using Google Gemini's Nano Banana Pro image model. Use this skill when you need to create images from text descriptions or transform existing images/videos into new artwork.
4gemini-consultation
Ask questions to Google Gemini AI models. Use this skill when you need a second opinion from another frontier AI model, want to analyze documents (PDFs, images), or need Gemini's perspective on any topic.
4