gcp-agent-golden-dataset-builder
gcp-agent-golden-dataset-builder
This skill helps you build the foundation for data-driven agent development: the Golden Dataset. Grounded in evaluation_blog.md, it focuses on verifying not just the final answer, but the "Thinking Process" (Reasoning Trace).
Usage
Ask Antigravity to:
- "Build a golden dataset with tool trajectories"
- "Structure my evaluation data for tool call validation"
- "Create a template for my Course Creator agent evaluation"
Dataset Pattern
A production-ready dataset uses the .jsonl format and includes:
prompt: The user input.reference: The ground truth answer (for semantic ResponseMatch).reference_trajectory: A list of expected tool calls. This allows the evaluator to check if the agent used the right tools in the right order.
Example Structure
More from googlecloudplatform/devrel-demos
go-backend-dev
Specialist in implementing robust HTTP services and APIs in Go. Activates for "endpoint", "handler", "API", "server".
41go-reviewer
Expert code reviewer focusing on idiomatic Go, concurrency safety, and clean code principles. Activates for "review", "idiomatic", "refactor".
41go-architect
Expert in Go project scaffolding, standard layout compliance, and dependency management. Activates for "new project", "structure", "layout".
36go-test-expert
Expert in Go testing patterns, table-driven tests, httptest, benchmarking, and fuzzing. Activates for "test", "fail", "benchmark", "debug", "fuzz".
35latest-software-version
>
34go-project-setup
>
26