ADK Evals Skill

What are Evals?

Evals are automated conversation tests for ADK agents. Each eval defines a scenario — a sequence of user messages or events — and asserts on what the bot should do: what it says, which tools it calls, how state changes, which workflows run, and more.

Evals run against a live dev bot (adk dev), so they test the full stack — not mocks.

When to Use This Skill

Use this skill when the developer asks about:

Writing evals — file format, assertions, turn types, setup
Running evals — CLI commands, filtering, output interpretation
Testing specific primitives — how to test actions, tools, workflows, conversations, state
The testing loop — write → run → inspect traces → iterate
CI integration — exit codes, --format json flag, tagging strategies
Eval configuration — idleTimeout, judgePassThreshold, judgeModel

adk-evals

ADK Evals Skill

What are Evals?

When to Use This Skill