fal-vision
SKILL.md
fal-vision
Analyze and understand images using fal.ai vision models — segmentation, detection, OCR, captioning, and visual QA.
Scripts
| Script | Purpose |
|---|---|
analyze.sh |
Analyze an image (segment, detect, OCR, describe, QA) |
Usage
Segment Objects
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"
Detect Objects
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect
Extract Text (OCR)
./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr
Describe Image
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe
Visual QA
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"
Arguments
| Argument | Description | Required |
|---|---|---|
--image-url |
URL of image to analyze | Yes |
--operation |
segment, detect, ocr, describe, qa | Yes |
--query / -q |
Text prompt for segment/qa operations | For segment/qa |
--model / -m |
Override model endpoint | No |
Finding Models
To discover the best and latest vision/analysis models, use the search API:
# Search for segmentation models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"
# Search for object detection models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"
# Search for OCR models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"
# Search for image captioning / visual QA models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption"
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"
Or use the search_models MCP tool with keywords like "segmentation", "detection", "ocr", "caption", "vision".
Weekly Installs
28
Repository
fal-ai-community/skillsGitHub Stars
35
First Seen
11 days ago
Security Audits
Installed on
github-copilot28
opencode28
cursor27
gemini-cli27
amp27
cline27