voice-generation
Installation
SKILL.md
Voice Generation Skill
Generate realistic speech using AI (Google Gemini TTS, ElevenLabs, OpenAI TTS).
Prerequisites
At least one API key is required:
GOOGLE_API_KEY- For Google Gemini TTS (same key as video/image/music) ✅ELEVENLABS_API_KEY- For ElevenLabs high-quality voice synthesisOPENAI_API_KEY- For OpenAI TTS voices
Available APIs
Google Gemini TTS (Recommended - Same API Key)
- Best for: Podcasts, dialogues, audiobooks with style control
- Voices: 30 voices with natural language style control
- Multi-speaker: Up to 2 speakers for dialogues ✅
- Languages: 24 languages (auto-detected)
Related skills