eval-tracking

SKILL.md

Eval Tracking

Skill for Supabase-backed evaluation result tracking.

Overview

Track evaluations with:

  • eval_runs - Evaluation run metadata
  • eval_cases - Individual test cases
  • eval_scores - Metric scores per case

Use When

This skill is automatically invoked when:

  • Storing evaluation results
  • Building eval dashboards
  • Tracking regression over time
  • Comparing run results

Available Scripts

Script Description
scripts/setup-tracking.sh Run Supabase migration

Available Templates

Template Description
templates/schema.sql Supabase tables and RLS
templates/queries.sql Dashboard queries
Weekly Installs
3
GitHub Stars
3
First Seen
Feb 11, 2026
Installed on
opencode3
gemini-cli3
claude-code3
github-copilot3
codex3
amp3