evaluation-harness

Installation
SKILL.md

Evaluation Harness

Build systematic evaluation frameworks for LLM applications.

Golden Dataset Format

Related skills

More from patricio0312rev/skills

Installs
114
GitHub Stars
38
First Seen
Jan 24, 2026