alicloud-ai-video-retalk
Installation
SKILL.md
Category: provider
Model Studio VideoRetalk
Validation
mkdir -p output/alicloud-ai-video-retalk
python -m py_compile skills/ai/video/alicloud-ai-video-retalk/scripts/prepare_retalk_request.py && echo "py_compile_ok" > output/alicloud-ai-video-retalk/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-video-retalk/validate.txt is generated.
Output And Evidence
- Save normalized request payloads, target face selection settings, and task polling snapshots under
output/alicloud-ai-video-retalk/. - Record the exact video/audio input URLs and whether
video_extensionwas enabled.
Use VideoRetalk when the input is already a person video and the job is to replace lip sync with a new speech track.
Critical model names
Use this exact model string:
videoretalk
Prerequisites
- This model currently only supports China mainland (Beijing).
- API is HTTP async only; there is no online console experience.
- Set
DASHSCOPE_API_KEYin your environment, or adddashscope_api_keyto~/.alibabacloud/credentials.
Normalized interface (video.retalk)
Request
model(string, optional): defaultvideoretalkvideo_url(string, required)audio_url(string, required)ref_image_url(string, optional): target face when input video contains multiple facesvideo_extension(bool, optional): extend video to match longer audioquery_face_threshold(int, optional):120to200
Response
task_id(string)task_status(string)video_url(string, when finished)usage(object, optional)
Endpoint and execution model
- Submit task:
POST https://dashscope.aliyuncs.com/api/v1/services/aigc/image2video/video-synthesis/ - Poll task:
GET https://dashscope.aliyuncs.com/api/v1/tasks/{task_id} - HTTP calls are async only and must set header
X-DashScope-Async: enable.
Quick start
python skills/ai/video/alicloud-ai-video-retalk/scripts/prepare_retalk_request.py \
--video-url "https://example.com/talking-head.mp4" \
--audio-url "https://example.com/new-voice.wav" \
--video-extension
Operational guidance
- Keep input videos front-facing and close enough for stable face tracking.
- If the video contains multiple faces, provide
ref_image_urlto anchor the intended target. - If the new audio is longer than the input video, decide explicitly whether to extend the picture track or truncate the audio.
- URLs must be public HTTP/HTTPS links; local file paths are not accepted by the API.
Output location
- Default output:
output/alicloud-ai-video-retalk/request.json - Override base dir with
OUTPUT_DIR.
References
references/sources.md
Weekly Installs
7
Repository
cinience/alicloud-skillsGitHub Stars
383
First Seen
Mar 28, 2026
Security Audits