elevenlabs-tts
SKILL.md
ElevenLabs Text-to-Speech
Premium text-to-speech with 22+ voices via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
infsh). Install instructions
infsh login
# Generate speech with ElevenLabs
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'
Available Models
| Model | ID | Best For | Latency |
|---|---|---|---|
| Multilingual v2 | eleven_multilingual_v2 |
Highest quality, 32 languages | ~250ms |
| Turbo v2.5 | eleven_turbo_v2_5 |
Balance of speed & quality | ~150ms |
| Flash v2.5 | eleven_flash_v2_5 |
Ultra-low latency | ~75ms |
Voice Library
Female Voices
| Voice | Style |
|---|---|
aria |
American, conversational |
alice |
British, confident |
bella |
American, warm |
jessica |
American, expressive |
laura |
American, professional |
lily |
British, soft |
sarah |
American, friendly |
Male Voices
| Voice | Style |
|---|---|
george |
British, authoritative |
adam |
American, deep |
bill |
American, mature |
brian |
American, conversational |
callum |
Transatlantic, intense |
charlie |
Australian, natural |
chris |
American, casual |
daniel |
British, commanding |
eric |
American, friendly |
harry |
American, young |
liam |
American, articulate |
matilda |
American, warm |
river |
American, confident |
roger |
American, authoritative |
will |
American, bright |
Examples
Basic Speech
infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'
Choose a Model
# Highest quality
infsh app run elevenlabs/tts --input '{
"text": "This is our premium multilingual model with the best quality.",
"voice": "aria",
"model": "eleven_multilingual_v2"
}'
# Ultra-fast for real-time applications
infsh app run elevenlabs/tts --input '{
"text": "Flash model for low-latency applications.",
"voice": "brian",
"model": "eleven_flash_v2_5"
}'
Voice Tuning
infsh app run elevenlabs/tts --input '{
"text": "Fine-tune the voice characteristics for your use case.",
"voice": "bella",
"stability": 0.3,
"similarity_boost": 0.9,
"style": 0.4
}'
| Parameter | Range | Effect |
|---|---|---|
stability |
0-1 | Higher = more consistent, lower = more expressive |
similarity_boost |
0-1 | Higher = closer to original voice character |
style |
0-1 | Higher = more style exaggeration |
use_speaker_boost |
true/false | Enhances speaker clarity |
Output Formats
# High-quality MP3
infsh app run elevenlabs/tts --input '{
"text": "High quality audio output.",
"voice": "daniel",
"output_format": "mp3_44100_192"
}'
| Format | Description |
|---|---|
mp3_44100_128 |
MP3 at 44.1kHz, 128kbps (default) |
mp3_44100_192 |
MP3 at 44.1kHz, 192kbps |
pcm_16000 |
Raw PCM at 16kHz |
pcm_22050 |
Raw PCM at 22.05kHz |
pcm_24000 |
Raw PCM at 24kHz |
pcm_44100 |
Raw PCM at 44.1kHz |
Multilingual
ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.
# Spanish
infsh app run elevenlabs/tts --input '{
"text": "Hola, bienvenidos a nuestra presentación.",
"voice": "aria",
"model": "eleven_multilingual_v2"
}'
# French
infsh app run elevenlabs/tts --input '{
"text": "Bonjour, bienvenue à notre démonstration.",
"voice": "alice",
"model": "eleven_multilingual_v2"
}'
Voice + Video Workflow
# 1. Generate voiceover
infsh app run elevenlabs/tts --input '{
"text": "Introducing the future of AI-powered content creation.",
"voice": "george"
}' > voiceover.json
# 2. Create talking head video
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "<audio-url-from-step-1>"
}'
Use Cases
- Voiceovers: Product demos, explainer videos, commercials
- Audiobooks: Long-form narration with consistent voices
- Podcasts: AI hosts with natural delivery
- E-learning: Course narration in multiple languages
- Accessibility: High-quality screen reader content
- IVR: Professional phone system messages
- Video Narration: Documentary and social media content
Related Skills
# ElevenLabs multi-speaker dialogue
npx skills add inference-sh/skills@elevenlabs-dialogue
# ElevenLabs voice changer
npx skills add inference-sh/skills@elevenlabs-voice-changer
# ElevenLabs sound effects
npx skills add inference-sh/skills@elevenlabs-sound-effects
# All TTS models (Kokoro, DIA, Chatterbox, and more)
npx skills add inference-sh/skills@text-to-speech
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli
Browse all audio apps: infsh app list --category audio
Weekly Installs
12.5K
Repository
inferen-sh/skillsGitHub Stars
159
First Seen
1 day ago
Security Audits
Installed on
claude-code10.1K
github-copilot8.8K
gemini-cli8.8K
codex8.8K
kimi-cli8.8K
amp8.8K