Podcast Generation Skill

Overview

This skill generates high-quality podcast audio from text content. The workflow includes creating a structured JSON script (conversational dialogue) and executing audio generation through text-to-speech synthesis.

Core Capabilities

Convert any text content (articles, reports, documentation) into podcast scripts
Generate natural two-host conversational dialogue (male and female hosts)
Synthesize speech audio using text-to-speech
Mix audio chunks into a final podcast MP3 file
Support both English and Chinese content

Workflow

Step 1: Understand Requirements

When a user requests podcast generation, identify:

Installs

1.4K

Repository

bytedance/deer-flow

GitHub Stars

71.2K

First Seen

Feb 17, 2026

Security Audits

Gen Agent Trust HubPass

SocketFail

SnykPass