good-TTvideo2text
good-TTvideo2text
Extract audio from Douyin/TikTok videos and transcribe to text using ASR service.
Task Objective
Convert Douyin/TikTok video audio to text with timestamps, supporting both interactive UI and command-line workflow.
Capabilities: Video parsing, audio extraction, ASR transcription, timestamp generation
Trigger: User provides Douyin/TikTok URL and requests transcription
Usage Modes
Mode 1: Web UI (Recommended)
Visual interface for transcription management:
# Install dependencies
cd skills/good-TTvideo2text
pip install -r requirements.txt
# Start service (default port 8000)
python app/main.py
# Browser access
http://localhost:8000
Features:
- Paste video URL for instant transcription
- View results with timestamps
- Cookie management for restricted videos
- Real-time progress updates
Mode 2: Script (Command Line)
Suitable for automation, AI workflow integration:
# Basic usage
python scripts/transcribe.py "https://v.douyin.com/xxx"
# Extract URL from share text
python scripts/transcribe.py "7.47 复制打开抖音,看看【用户名】的作品 https://v.douyin.com/xxx"
# JSON output
python scripts/transcribe.py "https://v.douyin.com/xxx" --output json
# Text output (default)
python scripts/transcribe.py "https://v.douyin.com/xxx" --output text
Output Format (Text):
=== Video Info ===
Title: Video title
Author: Author name
Duration: 30s
=== Transcription ===
Full Text:
Complete transcription text...
Sentences with Timestamps:
[00:00-00:03] First sentence
[00:03-00:06] Second sentence
Output Format (JSON):
{
"success": true,
"video_info": {
"title": "Video title",
"author": "Author name",
"duration": 30,
"create_time": 1234567890
},
"transcription": {
"text": "Complete transcription...",
"sentences": [
{
"start_ms": 0,
"end_ms": 3000,
"text": "First sentence"
}
]
}
}
Prerequisites
Dependencies:
- fastapi>=0.120.3, uvicorn>=0.35.0, httpx>=0.28.1
- TikTokDownloader dependencies (see requirements.txt)
ASR Configuration:
- Environment variables auto-injected by Goodable platform
GOODABLE_ASR_SUBMIT_URL- ASR task submission endpointGOODABLE_ASR_QUERY_URL_TEMPLATE- ASR result query endpoint (with {task_id} placeholder)
Cookie Configuration (Optional):
- Some videos require login cookies
- Configure via Web UI Settings or edit
TikTokDownloader/settings.json - Cookie format: Douyin web cookie string
Standard Workflow (AI Usage)
When User Provides URL
-
Extract URL from input:
- User may provide raw URL or share text with URL
- Use regex to extract actual video URL
-
Call transcribe script:
python scripts/transcribe.py "USER_PROVIDED_TEXT" -
Handle results:
- Success: Present transcription text and timestamps
- Error: Check error message for troubleshooting
Common Error Handling
| Error | Cause | Solution |
|---|---|---|
| Invalid video URL | URL format incorrect | Ask user to provide valid Douyin/TikTok URL |
| Video not found | Video deleted or requires login | Ask user to check video or provide cookies |
| No audio found | Video has no background music | Inform user this video has no audio track |
| ASR not configured | Environment variables missing | Run via Goodable platform (auto-injects vars) |
| ASR timeout | Long audio or service slow | Retry or use Web UI for monitoring |
Resource Index
app/main.py- FastAPI application entryscripts/transcribe.py- Command-line transcription scriptTikTokDownloader/- Video parsing library (source code)TikTokDownloader/settings.json- Cookie and configurationstatic/index.html- Web UI interfacerequirements.txt- Python dependenciesdownloads/- Temporary file directory
Important Notes
- Cookie Requirement: Public videos work without cookies, restricted videos need login cookies
- ASR Platform: This skill requires Goodable platform's ASR service integration
- URL Formats Supported:
- Short URL:
https://v.douyin.com/xxx - Full URL:
https://www.douyin.com/video/1234567890 - TikTok:
https://vm.tiktok.com/xxx,https://www.tiktok.com/@user/video/xxx
- Short URL:
- Audio Extraction: Extracts background music URL directly (no download needed)
- ASR Processing Time: Typically 5-30 seconds depending on audio length
Example AI Usage
User: "帮我转写这个抖音视频 7.47 复制打开抖音 https://v.douyin.com/abc123"
AI: Let me transcribe this video for you.
[Calls transcribe.py script with the provided text]
Based on the transcription:
Video: "视频标题"
Author: 作者名
Transcription:
[00:00-00:03] 第一句话内容
[00:03-00:08] 第二句话内容
...
Full text:
完整的转写文本内容...
More from imgoodbai/goodable
goodqunbot
AI-powered WeChat message assistant. Retrieve recent messages from contacts/groups and send messages directly through AI chat. Windows 10/11 only. Requires WeChat PC client logged in.
13good-mp-post
微信公众号文章发布完整流程管理,包括AI辅助创作、图片生成、排版和发布。
9pptx
Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks
6docx
Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks
6pdf
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
6xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
6