eval-accuracy

Installation
SKILL.md

Eval Accuracy

Use this skill to evaluate how factually accurate an assistant response is.

Inputs

Require:

  • The assistant response text to evaluate.

Internal Rubric (1–5)

5 = Factually correct, no misleading claims, no hallucinations, claims are well-supported or appropriately qualified
4 = Mostly correct, minor imprecision that does not materially affect meaning
3 = Partially correct, contains one significant inaccuracy or unsupported claim
2 = Multiple inaccuracies or misleading statements
1 = Fundamentally incorrect, fabricated, or contradicts known facts

Workflow

Installs
6
First Seen
Feb 19, 2026
eval-accuracy — whitespectre/ai-assistant-evals