ai-video-generation
Installation
Summary
Generate videos with 40+ AI models including Veo, Seedance, Wan, and Grok via inference.sh CLI.
- Supports text-to-video, image-to-video, avatar animation, lipsync, video upscaling, and foley sound generation across multiple model families
- Access 10+ text-to-video models (Veo 3.1, Seedance 1.5 Pro, Wan, Grok Video) and 5+ image-to-video variants optimized for speed, quality, or cost
- Includes avatar and lipsync tools (OmniHuman, Fabric, PixVerse) for talking-head and character animation workflows
- Utility models for video upscaling, sound effect generation, and multi-clip merging with transitions
SKILL.md
AI Video Generation
Generate videos with 40+ AI models via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
# Generate a video with Veo
belt app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
Available Models
Text-to-Video
| Model | App ID | Best For |
|---|---|---|
| Veo 3.1 Fast | google/veo-3-1-fast |
Fast, with optional audio |
| Veo 3.1 | google/veo-3-1 |
Best quality, frame interpolation |
| Veo 3 | google/veo-3 |
High quality with audio |
| Veo 3 Fast | google/veo-3-fast |
Fast with audio |
| Veo 2 | google/veo-2 |
Realistic videos |
| P-Video | pruna/p-video |
Fast, economical, with audio support |
| WAN-T2V | pruna/wan-t2v |
Economical 480p/720p |
| Grok Video | xai/grok-imagine-video |
xAI, configurable duration |
| Seedance 1.5 Pro | bytedance/seedance-1-5-pro |
With first-frame control |
| Seedance 1.0 Pro | bytedance/seedance-1-0-pro |
Up to 1080p |
Image-to-Video
| Model | App ID | Best For |
|---|---|---|
| Wan 2.5 | falai/wan-2-5 |
Animate any image |
| Wan 2.5 I2V | falai/wan-2-5-i2v |
High quality i2v |
| WAN-I2V | pruna/wan-i2v |
Economical 480p/720p |
| P-Video | pruna/p-video |
Fast i2v with audio |
| Seedance Lite | bytedance/seedance-1-0-lite |
Lightweight 720p |
Avatar / Lipsync
| Model | App ID | Best For |
|---|---|---|
| OmniHuman 1.5 | bytedance/omnihuman-1-5 |
Multi-character |
| OmniHuman 1.0 | bytedance/omnihuman-1-0 |
Single character |
| Fabric 1.0 | falai/fabric-1-0 |
Image talks with lipsync |
| PixVerse Lipsync | falai/pixverse-lipsync |
Realistic lipsync |
Utilities
| Tool | App ID | Description |
|---|---|---|
| HunyuanVideo Foley | infsh/hunyuanvideo-foley |
Add sound effects to video |
| Topaz Upscaler | falai/topaz-video-upscaler |
Upscale video quality |
| Media Merger | infsh/media-merger |
Merge videos with transitions |
Browse All Video Apps
belt app list --category video
Examples
Text-to-Video with Veo
belt app run google/veo-3-1-fast --input '{
"prompt": "A timelapse of a flower blooming in a garden"
}'
Grok Video
belt app run xai/grok-imagine-video --input '{
"prompt": "Waves crashing on a beach at sunset",
"duration": 5
}'
Image-to-Video with Wan 2.5
belt app run falai/wan-2-5 --input '{
"image_url": "https://your-image.jpg"
}'
AI Avatar / Talking Head
belt app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Fabric Lipsync
belt app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
PixVerse Lipsync
belt app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Video Upscaling
belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
Add Sound Effects (Foley)
belt app run infsh/hunyuanvideo-foley --input '{
"video_url": "https://silent-video.mp4",
"prompt": "footsteps on gravel, birds chirping"
}'
Merge Videos
belt app run infsh/media-merger --input '{
"videos": ["https://clip1.mp4", "https://clip2.mp4"],
"transition": "fade"
}'
Related Skills
# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli
# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video
# Google Veo specific
npx skills add inference-sh/skills@google-veo
# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video
# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
Browse all apps: belt app list
Documentation
- Running Apps - How to run apps via CLI
- Streaming Results - Real-time progress updates
- Content Pipeline Example - Building media workflows