eval-accuracy
Installation
SKILL.md
Eval Accuracy
Use this skill to evaluate how factually accurate an assistant response is.
Inputs
Require:
- The assistant response text to evaluate.
Internal Rubric (1–5)
5 = Factually correct, no misleading claims, no hallucinations, claims are well-supported or appropriately qualified
4 = Mostly correct, minor imprecision that does not materially affect meaning
3 = Partially correct, contains one significant inaccuracy or unsupported claim
2 = Multiple inaccuracies or misleading statements
1 = Fundamentally incorrect, fabricated, or contradicts known facts