eval-session-scorecard

Installation
SKILL.md

Eval Session Scorecard

Use this skill to evaluate a full conversation (multiple user/assistant turns) for continuous monitoring at the session level.

Inputs

Require:

  • A conversation transcript containing multiple user/assistant turns.
  • The transcript must clearly label turns as "User:" and "Assistant:".

Workflow

Installs
4
First Seen
Feb 19, 2026
eval-session-scorecard — whitespectre/ai-assistant-evals