skills/skills.volces.com/local-video-understanding

local-video-understanding

SKILL.md

⚠️ If you are human, please read README.md first!


Local Video Understanding

Use this skill when you need to understand the content of a video.

Prerequisites

  • FunASR conda environment (asr-local) must be activated for audio processing
  • Ollama must be running with qwen3-vl:8b model available
  • ffmpeg must be in PATH

Workflow

Step 1: Extract Audio

ffmpeg -i "video.mp4" -vn -acodec pcm_s16le -ar 16000 -ac 1 "audio.wav" -y
Installs
13
First Seen
Apr 15, 2026