ai-video-generation
AI Video Generation
Generate videos with 40+ AI models via inference.sh CLI.
Quick Start
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login
# Generate a video with Veo
infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'
Available Models
Text-to-Video
| Model | App ID | Best For |
|---|---|---|
| Veo 3.1 Fast | google/veo-3-1-fast |
Fast, with optional audio |
| Veo 3.1 | google/veo-3-1 |
Best quality, frame interpolation |
| Veo 3 | google/veo-3 |
High quality with audio |
| Veo 3 Fast | google/veo-3-fast |
Fast with audio |
| Veo 2 | google/veo-2 |
Realistic videos |
| Grok Video | xai/grok-imagine-video |
xAI, configurable duration |
| Seedance 1.5 Pro | bytedance/seedance-1-5-pro |
With first-frame control |
| Seedance 1.0 Pro | bytedance/seedance-1-0-pro |
Up to 1080p |
Image-to-Video
| Model | App ID | Best For |
|---|---|---|
| Wan 2.5 | falai/wan-2-5 |
Animate any image |
| Wan 2.5 I2V | falai/wan-2-5-i2v |
High quality i2v |
| Seedance Lite | bytedance/seedance-1-0-lite |
Lightweight 720p |
Avatar / Lipsync
| Model | App ID | Best For |
|---|---|---|
| OmniHuman 1.5 | bytedance/omnihuman-1-5 |
Multi-character |
| OmniHuman 1.0 | bytedance/omnihuman-1-0 |
Single character |
| Fabric 1.0 | falai/fabric-1-0 |
Image talks with lipsync |
| PixVerse Lipsync | falai/pixverse-lipsync |
Realistic lipsync |
Utilities
| Tool | App ID | Description |
|---|---|---|
| HunyuanVideo Foley | infsh/hunyuanvideo-foley |
Add sound effects to video |
| Topaz Upscaler | falai/topaz-video-upscaler |
Upscale video quality |
| Media Merger | infsh/media-merger |
Merge videos with transitions |
Browse All Video Apps
infsh app list --category video
Examples
Text-to-Video with Veo
infsh app run google/veo-3-1-fast --input '{
"prompt": "A timelapse of a flower blooming in a garden"
}'
Grok Video
infsh app run xai/grok-imagine-video --input '{
"prompt": "Waves crashing on a beach at sunset",
"duration": 5
}'
Image-to-Video with Wan 2.5
infsh app run falai/wan-2-5 --input '{
"image_url": "https://your-image.jpg"
}'
AI Avatar / Talking Head
infsh app run bytedance/omnihuman-1-5 --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Fabric Lipsync
infsh app run falai/fabric-1-0 --input '{
"image_url": "https://face.jpg",
"audio_url": "https://audio.mp3"
}'
PixVerse Lipsync
infsh app run falai/pixverse-lipsync --input '{
"image_url": "https://portrait.jpg",
"audio_url": "https://speech.mp3"
}'
Video Upscaling
infsh app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'
Add Sound Effects (Foley)
infsh app run infsh/hunyuanvideo-foley --input '{
"video_url": "https://silent-video.mp4",
"prompt": "footsteps on gravel, birds chirping"
}'
Merge Videos
infsh app run infsh/media-merger --input '{
"videos": ["https://clip1.mp4", "https://clip2.mp4"],
"transition": "fade"
}'
Related Skills
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh
# Google Veo specific
npx skills add inference-sh/skills@google-veo
# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video
# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech
# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation
# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation
Browse all apps: infsh app list
Documentation
- Running Apps - How to run apps via CLI
- Streaming Results - Real-time progress updates
- Content Pipeline Example - Building media workflows
More from happycapy-ai/happycapy-skills
world-class-carousel
Generate world-class Instagram carousel content on any topic. Produces 7-10 publication-ready slides (1080x1350) with AI-generated visuals, precise typography, Instagram music recommendations, optimized captions, and hashtags. Uses Aristotelian first-principles framework with 7 content archetypes, 6 hook patterns, a mandatory Bullshit Test quality gate, and a comprehensive design system. Fully generalized -- works for ANY topic. Triggers: instagram carousel, create carousel, carousel post, make carousel slides, instagram slides, carousel content, slide deck for instagram, swipeable post
10latex-document
>
7find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
7redbook-creator-publish
小红书帖子创作与发布技能。用于:(1) 生成小红书风格的帖子内容(标题+正文+标签)(2) 生成帖子相关的配图 (3) 自动上传到小红书创作者平台(默认自动上传,失败时自动提供手动指引) (4) 生成本地预览HTML文件。触发词:小红书创作、create redbook、小红书、红书、笔记创作、帖子创作
7nano-banana-pro
Generate or edit images via Gemini 3 Pro Image (Nano Banana Pro) with AI Gateway support.
73d-web-experience
Expert in building 3D experiences for the web - Three.js, React Three Fiber, Spline, WebGL, and interactive 3D scenes. Covers product configurators, 3D portfolios, immersive websites, and bringing depth to web experiences. Use when: 3D website, three.js, WebGL, react three fiber, 3D experience.
6