eval-session-scorecard
Installation
SKILL.md
Eval Session Scorecard
Use this skill to evaluate a full conversation (multiple user/assistant turns) for continuous monitoring at the session level.
Inputs
Require:
- A conversation transcript containing multiple user/assistant turns.
- The transcript must clearly label turns as "User:" and "Assistant:".