tdd-spec-review
[IMPORTANT] Use
TaskCreateto break ALL work into small tasks BEFORE starting — including tasks for each file read. This prevents context loss from long files. For simple tasks, AI MUST ask user whether to skip.
Evidence Gate: MANDATORY IMPORTANT MUST — every claim, finding, and recommendation requires
file:lineproof or traced evidence with confidence percentage (>80% to act, <80% must verify first).
OOP & DRY Enforcement: MANDATORY IMPORTANT MUST — flag duplicated patterns that should be extracted to a base class, generic, or helper. Classes in the same group or suffix (ex *Entity, *Dto, *Service, etc...) MUST inherit a common base (even if empty now — enables future shared logic and child overrides). Verify project has code linting/analyzer configured for the stack.
External Memory: For complex or lengthy work (research, analysis, scan, review), write intermediate findings and final results to a report file in
plans/reports/— prevents context loss and serves as deliverable.
-
.claude/skills/shared/double-round-trip-review-protocol.md— Mandatory two-round review enforcement -
.claude/skills/shared/graph-impact-analysis-protocol.md— Graph impact analysis: blast-radius + trace to find potentially stale/affected files -
docs/test-specs/— Test specifications by module (cross-reference during review to verify TC completeness and avoid duplicates)
Quick Summary
Goal: Auto-review test specifications for coverage completeness, TC format correctness, and no missing test cases before implementation proceeds.
Key distinction: AI self-review (automatic), NOT user interview.
Be skeptical. Apply critical thinking, sequential thinking. Every claim needs traced proof, confidence percentages (Idea should be more than 80%).
Workflow
- Locate test specs — Find TCs in feature doc Section 17 or
docs/test-specs/ - Load source — Read stories/PBI/acceptance criteria that TCs should cover
- Evaluate checklist — Score each check
- Calculate coverage — % of stories/AC with corresponding TCs
- Classify — PASS/WARN/FAIL
- Output verdict
Checklist
Required (all must pass)
- TC ID format — All TCs follow
TC-{FEATURE}-{NNN}format - Story coverage — Every user story has at least one corresponding TC
- AC coverage — Every acceptance criterion has a test case
- Happy path — Each story has at least one happy path TC
- Error path — Each story has at least one error/failure TC
- No duplicates — No duplicate TCs testing the same scenario
- Testable assertions — Each TC has clear expected result (not vague "should work")
- Authorization TCs — At least 1 TC per story verifying unauthorized access is rejected (ref:
.claude/skills/shared/cross-cutting-quality-concerns-protocol.md§1)
Recommended (>=50% should pass)
- Edge cases — Boundary values, empty inputs, max limits tested
- Integration points — Cross-service scenarios covered
- Performance TCs — Response time or throughput expectations where relevant; production-like data volume TCs if >1000 records expected (ref: protocol §4)
- Security TCs — Auth, authorization, input validation tested
- Seed data TCs — If feature needs reference data, TCs verify data exists and seeder runs correctly (ref: protocol §2)
- Data migration TCs — If schema changes exist, TCs verify data transforms correctly, rollback works, no data loss (ref: protocol §5)
Output
## Test Spec Review Result
**Status:** PASS | WARN | FAIL
**TCs reviewed:** {count}
**Coverage:** {X}% of stories, {Y}% of acceptance criteria
### Coverage Matrix
| Story/AC | TC IDs | Happy | Error | Edge |
| -------- | ------ | ----- | ----- | ---- |
### Required ({X}/{Y})
- ✅/❌ Check description
### Recommended ({X}/{Y})
- ✅/⚠️ Check description
### Missing Coverage
- {Stories/AC without TCs}
### Verdict
{PROCEED | REVISE_FIRST}
Round 2: Focused Re-Review (MANDATORY)
Protocol:
.claude/skills/shared/double-round-trip-review-protocol.md
After completing Round 1 checklist evaluation, execute a second full review round:
- Re-read the Round 1 verdict and checklist results
- Re-evaluate ALL checklist items — do NOT rely on Round 1 memory
- Challenge Round 1 PASS items: "Is this really PASS? Did I verify with evidence?"
- Focus on what Round 1 typically misses:
- Implicit assumptions that weren't validated
- Missing acceptance criteria coverage
- Edge cases not addressed in the artifact
- Cross-references that weren't verified
- Update verdict if Round 2 found new issues
- Final verdict must incorporate findings from BOTH rounds
Key Rules
- FAIL blocks workflow — If FAIL, do NOT proceed to implementation.
- Coverage >= 100% required — Every story and AC must have at least one TC.
- No guessing — Reference specific TC IDs and story references.
- Quality over quantity — Flag duplicate TCs, prefer fewer meaningful tests.
Next Steps
MANDATORY IMPORTANT MUST after completing this skill, use AskUserQuestion to recommend:
- "/plan (Recommended)" — Create implementation plan with validated test specs
- "/tdd-spec" — Re-generate specs if FAIL verdict
- "/integration-test" — Generate integration test code from specs
- "Skip, continue manually" — user decides
Closing Reminders
MANDATORY IMPORTANT MUST break work into small todo tasks using TaskCreate BEFORE starting.
MANDATORY IMPORTANT MUST validate decisions with user via AskUserQuestion — never auto-decide.
MANDATORY IMPORTANT MUST add a final review todo task to verify work quality.