vision-bench

Installation

SKILL.md

Vision Bench — LLM Image Evaluation

Compare images by scoring them with one or more vision LLM judges against structured rubric criteria.

Quick Start

# Install dependencies
pip install pyyaml openai anthropic mistralai

# Score a single image
python bench.py image.png --criteria photorealism --judge gemini-2.5-flash

# Compare two AI-generated images
python bench.py img_a.png img_b.png \
  --criteria text_to_image \
  --prompt "a fox in a snowy forest" \
  --judge gpt-4o

Installs

141

Repository

glebis/claude-skills

GitHub Stars

304

First Seen

Apr 24, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykPass

vision-bench — glebis/claude-skills