adk-evals
Installation
SKILL.md
ADK Evals Skill
What are Evals?
Evals are automated conversation tests for ADK agents. Each eval defines a scenario — a sequence of user messages or events — and asserts on what the bot should do: what it says, which tools it calls, how state changes, which workflows run, and more.
Evals run against a live dev bot (adk dev), so they test the full stack — not mocks.
When to Use This Skill
Use this skill when the developer asks about:
- Writing evals — file format, assertions, turn types, setup
- Running evals — CLI commands, filtering, output interpretation
- Testing specific primitives — how to test actions, tools, workflows, conversations, state
- The testing loop — write → run → inspect traces → iterate
- CI integration — exit codes,
--format jsonflag, tagging strategies - Eval configuration — idleTimeout, judgePassThreshold, judgeModel