watch-cli-video-agent
watch-cli Video Agent Skill
Skill by ara.so — Devtools Skills collection.
What It Does
watch-cli gives AI agents the ability to "watch" social videos by decomposing them into frames (JPGs) + audio transcript, avoiding expensive multimodal video APIs. Works with YouTube, X/Twitter, LinkedIn, TikTok, Reddit, Vimeo, and Facebook. ~50× cheaper and ~5× faster than calling a multimodal LLM on full video.
The insight: Videos are just frames + audio. Extract both, read them separately with vision + text LLMs, and you have full video understanding without burning a video model on every frame.
Installation
# Quick install (symlinks to ~/.local/bin)
curl -fsSL https://raw.githubusercontent.com/sonpiaz/watch-cli/main/install.sh | bash
More from aradotso/devtools-skills
cli-anything-agent-native-software
Generate and use command-line interfaces that make any software controllable by AI agents with structured JSON output
75opencli-universal-cli-hub
Convert websites, Electron apps, and local tools into standardized CLIs for AI agents and humans using OpenCLI's browser automation and adapter framework.
74op-auto-clicker
Automate mouse clicks with customizable intervals, hotkeys, and macros using OP Auto Clicker for Windows
73chrome-devtools-mcp-automation
Use Chrome DevTools MCP to control Chrome, debug web apps, analyze performance, and automate browser tasks via MCP tools
72wx-cli-wechat-local-data
Query and export local WeChat data (messages, contacts, moments, favorites) from the command line with daemon architecture
71officecli-office-automation
Use OfficeCLI to create, read, and edit Word, Excel, and PowerPoint files from the command line with AI-friendly commands.
70