fal-vision

SKILL.md

fal-vision

Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.

Scripts

Script Purpose
analyze.sh Analyze an image (segment, detect, OCR, describe, QA)

Usage

Segment Objects

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"

Detect Objects

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect

Extract Text (OCR)

./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr

Describe Image

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe

Visual QA

./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"

Arguments

Argument Description Required
--image-url URL of image to analyze Yes
--operation segment, detect, ocr, describe, qa Yes
--query / -q Text prompt for segment/qa operations For segment/qa
--model / -m Override model endpoint No

Finding Models

To discover the best and latest vision/analysis models, use the search API:

# Search for segmentation models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"

# Search for object detection models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"

# Search for OCR models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"

# Search for image captioning / visual QA models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption"
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"

Or use the search_models MCP tool with keywords like "segmentation", "detection", "ocr", "caption", "vision".

Weekly Installs
28
GitHub Stars
35
First Seen
11 days ago
Installed on
github-copilot28
opencode28
cursor27
gemini-cli27
amp27
cline27