skills/skills.volces.com/vision-recognition-ocr

vision-recognition-ocr

SKILL.md

Vision Recognition + OCR

Cross-platform Python: on Windows prefer py -3.11; on Linux/macOS prefer python3; if plain python already points to Python 3, it also works.

Recognize vehicles, animals, and plants, or extract text from screenshots, photos, invoices, and tables via Baidu vision APIs. This skill combines lightweight classification and OCR workflows in one place.

Why install this

Use this skill when you want to:

  • identify a car, animal, or plant from an image
  • extract text from screenshots, invoices, handwriting, or tables
  • send either a local path, public URL, or base64 image into the same tool family

Common use cases

  • 识别车型 / 看图识别动物或植物
  • 提取截图、票据、表格中的文字
  • 对同一张图在“识别类别”和“OCR 提取”之间切换
Installs
8
First Seen
Apr 9, 2026