skills/cinience/alicloud-skills/alicloud-ai-multimodal-qwen-omni

alicloud-ai-multimodal-qwen-omni

SKILL.md

Category: provider

Model Studio Qwen Omni

Validation

mkdir -p output/alicloud-ai-multimodal-qwen-omni
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/scripts/prepare_omni_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qwen-omni/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-multimodal-qwen-omni/validate.txt is generated.

Critical model names

Use one of these exact model strings:

  • qwen3-omni-flash
  • qwen3-omni-flash-realtime
  • qwen-omni-turbo
  • qwen-omni-turbo-realtime

Typical use

  • Image + audio + text assistant
  • Realtime multimodal agents
  • Spoken responses grounded in visual input

Normalized interface (omni.chat)

Request

  • model (string, optional): default qwen3-omni-flash
  • text (string, optional)
  • image (string, optional)
  • audio (string, optional)
  • response_modalities (array, optional): e.g. ["text"], ["text","audio"]

Response

  • text (string, optional)
  • audio_url or audio_chunk (optional)
  • usage (object, optional)

Quick start

python skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/scripts/prepare_omni_request.py \
  --output output/alicloud-ai-multimodal-qwen-omni/request.json

References

  • references/sources.md
Weekly Installs
26
GitHub Stars
354
First Seen
5 days ago
Installed on
gemini-cli25
github-copilot25
codex25
kimi-cli25
amp25
cline25