SKILLS LAUNCH PARTY
skills/inference-sh/skills/ai-podcast-creation

ai-podcast-creation

SKILL.md

AI Podcast Creation

Create AI-powered podcasts and audio content via inference.sh CLI.

Quick Start

curl -fsSL https://cli.inference.sh | sh && infsh login

# Generate podcast segment
infsh app run infsh/kokoro-tts --input '{
  "text": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.",
  "voice": "am_michael"
}'

Available Voices

Kokoro TTS

Voice ID Description Best For
af_sarah American female, warm Host, narrator
af_nicole American female, professional News, business
am_michael American male, authoritative Documentary, tech
am_adam American male, conversational Casual podcast
bf_emma British female, refined Audiobooks
bm_george British male, classic Formal content

DIA TTS (Conversational)

Voice ID Description Best For
dia-conversational Natural conversation Dialogue, interviews

Chatterbox

Voice ID Description Best For
chatterbox-default Expressive Casual, entertainment

Podcast Workflows

Simple Narration

# Single voice podcast segment
infsh app run infsh/kokoro-tts --input '{
  "text": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.",
  "voice": "am_michael"
}'

Multi-Voice Conversation

# Host introduction
infsh app run infsh/kokoro-tts --input '{
  "text": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.",
  "voice": "am_michael"
}' > host_intro.json

# Guest response
infsh app run infsh/kokoro-tts --input '{
  "text": "Thanks for having me. I am excited to share what we have been working on.",
  "voice": "af_sarah"
}' > guest_response.json

# Merge into conversation
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-url>", "<guest-url>"],
  "crossfade_ms": 500
}'

Full Episode Pipeline

# 1. Generate script with Claude
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights."
}' > script.json

# 2. Generate intro music
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds"
}' > intro_music.json

# 3. Generate host segments
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-lines>",
  "voice": "am_michael"
}' > host.json

# 4. Generate guest segments
infsh app run infsh/kokoro-tts --input '{
  "text": "<guest-lines>",
  "voice": "af_sarah"
}' > guest.json

# 5. Generate outro music
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast outro music, matching intro style, fade out, 10 seconds"
}' > outro_music.json

# 6. Merge everything
infsh app run infsh/media-merger --input '{
  "audio_files": [
    "<intro-music>",
    "<host>",
    "<guest>",
    "<outro-music>"
  ],
  "crossfade_ms": 1000
}'

NotebookLM-Style Content

Generate podcast-style discussions from documents.

# 1. Extract key points
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>"
}' > discussion_script.json

# 2. Generate Host A
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-a-lines>",
  "voice": "am_michael"
}' > host_a.json

# 3. Generate Host B
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-b-lines>",
  "voice": "af_sarah"
}' > host_b.json

# 4. Interleave and merge
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"],
  "crossfade_ms": 300
}'

Audiobook Chapter

# Long-form narration
infsh app run infsh/kokoro-tts --input '{
  "text": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...",
  "voice": "bf_emma",
  "speed": 0.9
}'

Audio Enhancement

Add Background Music

# 1. Generate podcast audio
infsh app run infsh/kokoro-tts --input '{
  "text": "<podcast-script>",
  "voice": "am_michael"
}' > podcast.json

# 2. Generate ambient music
infsh app run infsh/ai-music --input '{
  "prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable"
}' > background.json

# 3. Mix with lower background volume
infsh app run infsh/media-merger --input '{
  "audio_files": ["<podcast-url>"],
  "background_audio": "<background-url>",
  "background_volume": 0.15
}'

Add Sound Effects

# Transition sounds between segments
infsh app run infsh/ai-music --input '{
  "prompt": "Short podcast transition sound, whoosh, 2 seconds"
}' > transition.json

Script Writing Tips

Prompt for Claude

infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a podcast script with these requirements:
  - Topic: [YOUR TOPIC]
  - Duration: 5 minutes (about 750 words)
  - Format: Two hosts (HOST_A and HOST_B)
  - Tone: Conversational, informative, engaging
  - Include: Hook intro, 3 main points, call to action
  - Mark speaker changes clearly

  Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally."
}'

Podcast Templates

Interview Format

HOST: Introduction and welcome
GUEST: Thank you, happy to be here
HOST: First question about background
GUEST: Response with story
HOST: Follow-up question
GUEST: Deeper insight
... continue pattern ...
HOST: Closing question
GUEST: Final thoughts
HOST: Thank you and outro

Solo Episode

Introduction with hook
Topic overview
Point 1 with examples
Point 2 with examples
Point 3 with examples
Summary and takeaways
Call to action
Outro

News Roundup

Intro music
Welcome and date
Story 1: headline + details
Story 2: headline + details
Story 3: headline + details
Analysis/opinion segment
Outro

Best Practices

  1. Natural punctuation - Use commas and periods for pacing
  2. Short sentences - Easier to speak and listen
  3. Varied voices - Different speakers prevent monotony
  4. Background music - Subtle, at 10-15% volume
  5. Crossfades - Smooth transitions between segments
  6. Edit scripts - Remove filler before generating

Related Skills

# Text-to-speech models
npx skills add inference-sh/agent-skills@text-to-speech

# AI music generation
npx skills add inference-sh/agent-skills@ai-music-generation

# LLM for scripts
npx skills add inference-sh/agent-skills@llm-models

# Content pipelines
npx skills add inference-sh/agent-skills@ai-content-pipeline

# Full platform skill
npx skills add inference-sh/agent-skills@inference-sh

Browse all apps: infsh app list --category audio

Weekly Installs
214
First Seen
2 days ago
Installed on
claude-code94
opencode92
antigravity75
codex75
gemini-cli74
cursor74