skills/cinience/alicloud-skills/alicloud-ai-multimodal-qvq

alicloud-ai-multimodal-qvq

SKILL.md

Category: provider

Model Studio QVQ Visual Reasoning

Validation

mkdir -p output/alicloud-ai-multimodal-qvq
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-multimodal-qwen-vqv/validate.txt is generated.

Critical model names

Use one of these exact model strings:

  • qvq-plus
  • qvq-max

Typical use

  • Mathematical reasoning from screenshots
  • Diagram and chart reasoning
  • Visually grounded multi-step problem solving

Quick start

python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py \
  --output output/alicloud-ai-multimodal-qvq/request.json

Notes

  • Use skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/ for standard image understanding.
  • Use QVQ when the task explicitly needs stronger reasoning over visual evidence.

References

  • references/sources.md
Weekly Installs
26
GitHub Stars
354
First Seen
4 days ago
Installed on
gemini-cli25
github-copilot25
codex25
kimi-cli25
amp25
cline25