image-to-text
Installation
SKILL.md
Image to Text
Extract all readable text from an image using OCR (Tesseract). Returns the full text content along with word-level bounding boxes and confidence scores.
When to Use
- Reading text content from a screenshot or design mockup
- Extracting UI copy (labels, buttons, headings) so you don't have to retype it
- Getting text positions and bounding boxes from a design image
How It Works
- The image is passed to Tesseract.js for optical character recognition
- Tesseract segments the image into lines and words
- Returns the full text plus word-level details (position, confidence)