get-results
Get Simulation Results
Retrieve results for $ARGUMENTS.
Workflow
If Run ID Provided
List all simulations for the run:
coval simulations list --run-id <run_id> --format json
If Simulation ID Provided
Get detailed simulation data:
coval simulations get <simulation_id> --format json
This returns:
- Status (COMPLETED, FAILED)
- Test case ID
- Transcript (conversation history)
- Timestamps
- Error message (if failed)
Step 2: Analyze Results
For each simulation, extract:
| Field | Description |
|---|---|
status |
COMPLETED or FAILED |
test_case_id |
Which test case was run |
transcript |
Full conversation |
has_audio |
Whether audio is available |
error_message |
Failure reason (if any) |
Step 3: Present Summary
## Results for Run <run_id>
| Simulation | Status | Test Case | Audio |
|------------|--------|-----------|-------|
| sim_abc123 | COMPLETED | tc_xyz | Yes |
| sim_def456 | FAILED | tc_uvw | No |
### Failed Simulations
- sim_def456: "Connection timeout"
### View Details
`coval simulations get <sim_id>`
### Download Audio
`coval simulations audio <sim_id> -o output.wav`
Filtering
List simulations with filters:
coval simulations list --run-id <run_id> --filter "status=FAILED"
coval simulations list --run-id <run_id> --filter "has_audio=true"
More from coval-ai/coval-external-skills
onboard
>
14launch-run
Launch a Coval evaluation run against an AI agent. Use when user wants to start an evaluation, test an agent, or run simulations.
12coval-resources
Comprehensive overview of ALL Coval platform resources, their hierarchy, relationships, API endpoints, and ID formats. Use when user asks about Coval resources, data model, how things relate, what endpoints exist, or needs context about the platform structure before making API calls.
12quick-eval
Full evaluation workflow - launch a run, watch progress, and summarize results. Use for end-to-end agent testing.
12download-audio
Download audio recordings from Coval voice simulations. Use when user wants to listen to or analyze call recordings.
12watch-run
Monitor a Coval run's progress with live updates. Use when user wants to check run status or wait for completion.
12