skills/skills.volces.com/video-understand

video-understand

SKILL.md

video-understand

Gives your agent the ability to understand and analyze video content. Supports Google Gemini and Moonshot AI (Kimi) as providers.

When to Use

Use video-understand when you need to:

  • Understand what happens in a video file (MP4, MOV, WebM, AVI, etc.)
  • Analyze a YouTube video (Gemini: passed natively; Kimi: downloads via yt-dlp first)
  • Analyze an HTTP video URL (Gemini: passed natively; Kimi: downloads via fetch first)
  • Extract specific information, summaries, or descriptions from video content
  • Ask follow-up questions about a previously analyzed video
  • Get timestamped breakdowns of video content

Prerequisites

Check if installed:

video-understand --version
Installs
13
First Seen
Mar 12, 2026