speak-response
Speak Response
Vocalize text using local Qwen3-TTS. Default voice is the Oracle (cloned from a Dune narrator with deep, resonant, prophetic quality).
Quick Examples
| Command | Effect |
|---|---|
/speak |
Last 2 sentences with Oracle voice |
/speak 5 |
Last 5 sentences with Oracle voice |
/speak "The sleeper must awaken." |
Specific text with Oracle voice |
/speak --preset mood:warm |
Last 2 sentences with preset speaker + emotion |
/speak --preset "Hello" speaker:Vivian voice:"nurturing" |
Preset speaker with custom voice |
Default: Oracle Voice
The oracle voice is a deep, resonant, prophetic voice cloned from a Dune narrator. It speaks all text with a sense of ancient wisdom and gravitas.
# Default usage - Oracle voice
scripts/speak.sh "The spice must flow."
scripts/speak.sh "He who controls the spice controls the universe."
Limitation
The Oracle uses voice cloning (Base model), which does not support per-message instruction control. The voice characteristics are fixed. For emotion/mood control, use --preset.
Preset Speakers (--preset)
For emotion and mood control, use --preset to switch to CustomVoice with adjustable instructions:
scripts/speak.sh --preset "<text>" [speaker] [instruction]
Quick Preset Examples
# Calm therapeutic voice
scripts/speak.sh --preset "Take a deep breath." Vivian "calm, nurturing, gentle pace"
# Excited announcement
scripts/speak.sh --preset "We did it!" Ryan "joyful, excited, enthusiastic"
# Serious explanation
scripts/speak.sh --preset "This is important." Eric "serious, measured, emphatic"
Custom Voice Instructions
The model understands rich natural language descriptions:
| Aspect | Examples |
|---|---|
| Emotion | joyful, melancholic, anxious, calm, excited, contemplative |
| Pace | slow and deliberate, rapid and energetic, measured, hesitant |
| Intensity | soft and gentle, loud and commanding, whispered, emphatic |
| Style | warm and nurturing, professional, playful, dramatic |
| Prosody | with dramatic pauses, rising intonation, emphatic on key words |
Mood Presets (Shortcuts)
| Preset | Expands To |
|---|---|
calm |
"calm, soothing, gentle pace" |
warm |
"warm, empathetic, nurturing tone" |
excited |
"joyful, excited, enthusiastic" |
serious |
"serious, measured, authoritative" |
gentle |
"soft, gentle, whispered" |
encouraging |
"encouraging, uplifting, sincere" |
contemplative |
"thoughtful, slow pace, reflective" |
Speakers
| Speaker | Best For |
|---|---|
| Ryan (default) | Professional, serious, authoritative |
| Vivian | Warm, nurturing, therapeutic |
| Serena | Calm, gentle, contemplative |
| Dylan | Friendly, casual, playful |
| Eric | Serious, dramatic, commanding |
| Aiden | Encouraging, uplifting, energetic |
| Uncle_Fu | Wise, measured |
| Ono_Anna | Soft, gentle |
| Sohee | Clear, professional |
Workflow
- Parse arguments for text and mode (default oracle vs --preset)
- Extract text from last response if not provided
- Default mode: Clone with Oracle voice
- Preset mode: Generate with CustomVoice + instruction
- Audio plays through macOS speakers
Execution
# Oracle voice (default)
scripts/speak.sh "<text>"
# Preset speaker with instruction
scripts/speak.sh --preset "<text>" [speaker] [instruction]
Voice Cloning (Custom Voices)
Clone any voice from a 3+ second audio sample:
# Get transcript first (use Whisper API)
curl -s https://api.openai.com/v1/audio/transcriptions \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-F file="@reference.mp3" -F model="whisper-1"
# Clone the voice
scripts/clone.sh "<text to speak>" "<audio_file>" "<transcript>"
Voice Design (Create New Voices)
Design entirely new voices from natural language descriptions:
scripts/design-voice.sh "<sample_text>" "<voice_description>"
# Example: Create a warm guide voice
scripts/design-voice.sh \
"Take a deep breath and feel this moment." \
"warm, nurturing, gentle pace, empathetic, female"
Then clone the designed voice for reuse:
scripts/clone.sh "New text" designed-voice.wav "Original sample text"
See references/moods.md for more instruction examples.
More from tdimino/claude-code-minoan
academic-research
Search academic papers, build literature reviews, and synthesize research findings — combines Exa MCP (research_paper category, arxiv filtering) with arxiv-mcp-server for paper discovery, download, and deep analysis. Triggers on academic paper, literature review, research synthesis, arxiv, find papers, scholarly search.
69travel-requirements-expert
Plan a trip, create an itinerary, or research a destination through a structured 5-phase workflow---discovery questions, Exa/Firecrawl research, expert detail gathering, and a day-by-day requirements spec. This skill should be used when a user says "plan a trip," "create an itinerary," "help me visit [place]," or needs travel research with specific venues, safety protocols, and dietary accommodations.
67twilio-api
Use this skill when working with Twilio communication APIs for SMS/MMS messaging, voice calls, phone number management, TwiML, webhook integration, two-way SMS conversations, bulk sending, or production deployment of telephony features. Includes official Twilio patterns, production code examples from Twilio-Aldea (provider-agnostic webhooks, signature validation, TwiML responses), and comprehensive TypeScript examples.
65figma-mcp
Convert Figma designs into production-ready code using MCP server tools. Use this skill when users provide Figma URLs, request design-to-code conversion, ask to implement Figma mockups, or need to extract design tokens and system values from Figma files. Works with frames, components, and entire design files to generate HTML, CSS, React, or other frontend code.
61firecrawl
Scrape web pages to clean markdown using Firecrawl v2 — handles JS-heavy pages, site crawls, URL mapping, document parsing (PDF/DOCX/XLSX), LLM-powered extraction, autonomous agent scraping, and post-scrape browser interaction (Interact API). Prefer over WebFetch for quality and completeness. Triggers on scrape URL, fetch page, crawl site, extract content, parse document, web to markdown, DeepWiki, Firecrawl.
51scrapling
Scrape pages locally with anti-bot bypass, TLS impersonation, and adaptive element tracking — no API keys, no cloud. Handles Cloudflare protection, CSS/XPath element extraction, and survives site redesigns. Complements firecrawl (cloud) with 100% local execution. Triggers on Cloudflare bypass, anti-bot scraping, stealth fetch, local scraping, Scrapling.
47