gcp-agent-golden-dataset-builder

Installation
SKILL.md

gcp-agent-golden-dataset-builder

This skill helps you build the foundation for data-driven agent development: the Golden Dataset. Grounded in evaluation_blog.md, it focuses on verifying not just the final answer, but the "Thinking Process" (Reasoning Trace).

Usage

Ask Antigravity to:

  • "Build a golden dataset with tool trajectories"
  • "Structure my evaluation data for tool call validation"
  • "Create a template for my Course Creator agent evaluation"

Dataset Pattern

A production-ready dataset uses the .jsonl format and includes:

  1. prompt: The user input.
  2. reference: The ground truth answer (for semantic ResponseMatch).
  3. reference_trajectory: A list of expected tool calls. This allows the evaluator to check if the agent used the right tools in the right order.

Example Structure

Related skills
Installs
3
GitHub Stars
281
First Seen
Mar 11, 2026