tts
Installation
SKILL.md
When to Use
- User wants to convert text to spoken audio
- User asks for "read aloud", "TTS", "text to speech", "voice narration"
- User says "朗读", "配音", "语音合成"
- User wants multi-speaker scripted audio or dialogue
When NOT to Use
- User wants a podcast-style discussion with topic exploration (use
/podcast) - User wants an explainer video with visuals (use
/explainer) - User wants to generate an image (use
/image-gen)
Purpose
Convert text into natural-sounding speech audio. Two paths:
- Quick mode (
--mode direct): Single voice, low-latency, sync. For casual chat, reading snippets, instant audio. - Script mode (
--mode smart): Multi-speaker, per-segment voice assignment. For dialogue, audiobooks, scripted content.