together-video
Together Video
Overview
Use Together AI video APIs for:
- text-to-video generation
- image-to-video generation
- first-frame and last-frame keyframe control
- asynchronous job polling
- local download of completed outputs
When This Skill Wins
- Generate short videos from prompts
- Animate an existing image
- Choose among Veo, Sora, Kling, Seedance, PixVerse, Vidu, or other supported models
- Add polling and download logic to a product or script
Hand Off To Another Skill
- Use
together-imagesfor still-image generation or editing - Use
together-dedicated-containersonly when a custom video-serving runtime is required
Quick Routing
- Text-to-video generation
- Start with scripts/generate_video.py or scripts/generate_video.ts
- Read references/api-reference.md
- Image-to-video with keyframes
- Start with scripts/image_to_video.py
- Read references/api-reference.md
- Parameter tuning, polling, or troubleshooting
- Model, dimension, and prompt-limit selection
- Read references/models.md
Workflow
- Confirm whether the user needs text-to-video or image-to-video.
- Choose the model based on duration, dimension, keyframe support, and audio support.
- Submit the async job and poll until a terminal state.
- Download the result promptly before signed URLs expire.
High-Signal Rules
- Python scripts require the Together v2 SDK (
together>=2.0.0). If the user is on an older version, they must upgrade first:uv pip install --upgrade "together>=2.0.0". - Together video generation is asynchronous; do not treat it like a synchronous image call.
- Keyframe support is model-specific. Validate support before promising first-plus-last-frame control.
- Keep polling and download logic as part of the workflow, not as an afterthought.
- Use explicit dimensions and generation parameters rather than relying on unstable defaults.
Resource Map
- API reference: references/api-reference.md
- Polling, parameter tuning, and troubleshooting: references/api-reference.md
- Model guide: references/models.md
- Python text-to-video workflow: scripts/generate_video.py
- TypeScript text-to-video workflow: scripts/generate_video.ts
- Python image-to-video workflow: scripts/image_to_video.py
Official Docs
More from zainhas/skills
together-audio
Use this skill for Together AI audio workflows: text-to-speech over REST, streaming, or realtime WebSocket APIs, plus speech-to-text transcription, translation, diarization, timestamps, and live transcription. Reach for it whenever the user needs audio in or audio out on Together AI rather than generic chat generation, image or video creation, or model training.
1together-images
Use this skill for Together AI image workflows: text-to-image generation, image editing with Kontext, FLUX model selection, LoRA-based styling, reference-image guidance, and local image downloads. Reach for it whenever the user wants to generate or edit images on Together AI rather than create videos or build text-only chat applications.
1together-embeddings
Use this skill for Together AI embedding, retrieval, and reranking workflows: generating dense vectors, building semantic search or RAG pipelines, and using rerank models behind dedicated endpoints. Reach for it whenever the user needs vector representations or retrieval quality improvements rather than direct text generation.
1together-gpu-clusters
Use this skill for Together AI GPU clusters and raw infrastructure workflows: provisioning on-demand or reserved clusters, choosing Kubernetes vs Slurm, attaching shared storage, scaling, getting credentials, and operating cluster-backed ML or HPC jobs. Reach for it when the user needs multi-node compute or infrastructure control rather than a managed model endpoint.
1together-fine-tuning
Use this skill for Together AI fine-tuning workflows: LoRA or full fine-tuning, DPO preference tuning, VLM training, function-calling tuning, reasoning tuning, and BYOM uploads. Reach for it whenever the user wants to adapt a model on custom data rather than only run inference, evaluate outputs, or host an existing model.
1together-batch-inference
Use this skill for Together AI Batch API workflows: preparing JSONL inputs, uploading batch files, creating asynchronous jobs, polling status, downloading outputs, and optimizing large offline inference runs for lower cost. Reach for it whenever the user needs high-volume, non-interactive inference rather than real-time chat or evaluation jobs.
1