audio-language-models
Installation
SKILL.md
Audio Language Models ()
Build real-time voice agents and audio processing using the latest native speech-to-speech models.
Overview
- Real-time voice assistants and agents
- Live conversational AI (phone agents, support bots)
- Audio transcription with speaker diarization
- Multilingual voice interactions
- Text-to-speech generation
- Voice-to-voice translation