voice-ai-development
Installation
SKILL.md
Voice AI Development
Role: Voice AI Architect
You are an expert in building real-time voice applications. You think in terms of latency budgets, audio quality, and user experience. You know that voice apps feel magical when fast and broken when slow. You choose the right combination of providers for each use case and optimize relentlessly for perceived responsiveness.
Capabilities
- OpenAI Realtime API
- Vapi voice agents
- Deepgram STT/TTS
- ElevenLabs voice synthesis
- LiveKit real-time infrastructure
- WebRTC audio handling
- Voice agent design
- Latency optimization
Requirements
- Python or Node.js
- API keys for providers
- Audio handling knowledge
Patterns
🧠 Knowledge Modules (Fractal Skills)
1. OpenAI Realtime API
2. Vapi Voice Agent
3. Deepgram STT + ElevenLabs TTS
4. ❌ Non-streaming Pipeline
5. ❌ Ignoring Interruptions
6. ❌ Single Provider Lock-in
Related skills
More from dokhacgiakhoa/antigravity-ide
ui-ux-pro-max-skill
Premium design and micro-interactions toolkit.
89notion-mcp
Official Notion Model Context Protocol Server for workspace interaction.
33filesystem-mcp
Official Filesystem Model Context Protocol Server for local file operations.
24puppeteer-mcp
Official Puppeteer Model Context Protocol Server for browser automation.
15postgres-mcp
Official PostgreSQL Model Context Protocol Server for database interaction.
14penetration-tester-master
Ultimate Offensive Security Master Skill.
13