segment-image
SKILL.md
Segment Image
Segment and identify objects in images using Gemini's vision capabilities.
When to Use
Use this skill when the user asks to:
- Identify objects in an image
- Generate masks for specific objects
- Segment an image into regions
- Extract objects from an image
Usage
bun run ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts <input-image> [options]
Options
--prompt <text>- Custom segmentation prompt--output <dir>- Output directory for mask files
Examples
# Segment all objects
bun run ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg
# Segment with custom prompt
bun run ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg --prompt "identify all people and vehicles"
# Save masks to directory
bun run ${CLAUDE_PLUGIN_ROOT}/skills/segment-image/scripts/segment.ts photo.jpg --output ./masks
Context Discipline
Do not read generated mask images back into context. The script outputs file paths. Ask the user to visually inspect the masks. To inspect programmatically, optimize the images first (via the optimize-images skill).
Model
Uses gemini-3-flash-preview (Gemini 3 Flash) for image segmentation.
Last verified: February 2026. If a newer generation exists, STOP and suggest a PR to
b-open-io/gemskills. See the ask-gemini skill'sreferences/gemini-api.mdfor current models and Google's officialgemini-api-devskill for the canonical source.
Weekly Installs
9
Repository
b-open-io/gemskillsGitHub Stars
2
First Seen
6 days ago
Security Audits
Installed on
opencode9
gemini-cli9
antigravity9
claude-code9
github-copilot9
codex9