video-processing

SKILL.md

What I do

  • Download Videos - Fetch videos from YouTube, Facebook, and other platforms
  • Extract Audio - Pull audio tracks from video files in various formats (MP3, WAV, AAC, etc.)
  • Transcribe Audio - Convert speech to text with word-level timestamps using faster-whisper (supports Bengali and English)
  • Analyze Videos - Detect scenes, extract keyframes, generate thumbnails and subtitles

Commands

Download a video

video download <url> [--output <path>] [--format <mp4|webm>] [--quality <quality>]

Extract audio from video

video extract-audio <video> [--output <path>] [--format <mp3|wav|aac>] [--bitrate <kbps>]

Transcribe video to text

video transcribe <video> [--output <path>] [--format <json|srt|vtt>] [--language <code>]

Analyze video comprehensively

video analyze <video> [--output <path>] [--threshold <0.0-1.0>] [--interval <seconds>]

Requirements

  • Node.js 18+
  • FFmpeg (system installation required)
  • faster-whisper (CTranslate2-based Whisper implementation for faster local transcription)

Usage examples

# Download YouTube video
video download "https://youtube.com/watch?v=..." --output ./videos

# Extract MP3 from video
video extract-audio ./video.mp4 --format mp3 --bitrate 320k

# Generate subtitles
video transcribe ./video.mp4 --format srt --language en

# Analyze video with scene detection
video analyze ./video.mp4 --threshold 0.3

When to use me

Use this skill when you need to:

  • Save online videos for offline viewing
  • Convert video content to audio (podcasts, music)
  • Create subtitles or transcripts from video content
  • Analyze video structure for editing or highlights
Weekly Installs
13
First Seen
Feb 26, 2026
Installed on
github-copilot13
codex13
kimi-cli13
amp13
gemini-cli13
cursor13