alicloud-ai-multimodal-qvq
SKILL.md
Category: provider
Model Studio QVQ Visual Reasoning
Validation
mkdir -p output/alicloud-ai-multimodal-qvq
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-multimodal-qwen-vqv/validate.txt is generated.
Critical model names
Use one of these exact model strings:
qvq-plusqvq-max
Typical use
- Mathematical reasoning from screenshots
- Diagram and chart reasoning
- Visually grounded multi-step problem solving
Quick start
python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py \
--output output/alicloud-ai-multimodal-qvq/request.json
Notes
- Use
skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/for standard image understanding. - Use QVQ when the task explicitly needs stronger reasoning over visual evidence.
References
references/sources.md
Weekly Installs
26
Repository
cinience/alicloud-skillsGitHub Stars
354
First Seen
4 days ago
Security Audits
Installed on
gemini-cli25
github-copilot25
codex25
kimi-cli25
amp25
cline25