Specification-Driven Development

Orchestrate a spec-first development workflow: behavioral specification, test scenario derivation, test implementation planning, test implementation, feature implementation, and cross-artifact alignment review. The skill guides writing behavioral specifications, derives test scenarios, plans test implementation approaches, and verifies alignment across all artifacts and code. Quality gates between phases are advisory — the skill flags issues and recommends addressing them, but the user can override and proceed.

Commands

Command	Phase	Reference
`/spec-dd`	Auto-detect phase, assess state, recommend next step	All references
`/spec-dd:spec`	Behavioral Specification	`references/specification.md`
`/spec-dd:test`	Test Specification	`references/test-specification.md`
`/spec-dd:test-impl`	Test Implementation Specification	`references/test-implementation-specification.md`
`/spec-dd:verify`	Implementation Verification	`references/verify.md`
`/spec-dd:review`	Alignment Review	`references/review.md`

All commands accept an optional feature name argument (e.g., /spec-dd:spec user-auth). /spec-dd:verify also accepts a spec file path as its first argument (e.g., /spec-dd:verify specifications.md chat-ui). This allows verification against any spec file, including informal ones not created through the spec-dd workflow. If no feature name is provided and multiple features exist, list available features and ask the user to choose.

Artifacts

All artifacts live in docs/specs/ with one set of files per feature:

Artifact	Filename
Behavioral Specification	`docs/specs/<feature>-specification.md`
Test Specification	`docs/specs/<feature>-test-specification.md`
Test Implementation Specification	`docs/specs/<feature>-test-implementation-specification.md`
Implementation Verification	`docs/specs/<feature>-verification.md`
Implementation Review	`docs/specs/<feature>-implementation-review.md`

First Steps

When any command is invoked:

Read the relevant reference file from references/ BEFORE doing anything else.
- /spec-dd:spec -> read references/specification.md
- /spec-dd:test -> read references/test-specification.md
- /spec-dd:test-impl -> read references/test-implementation-specification.md
- /spec-dd:verify -> read references/verify.md
- /spec-dd:review -> read references/review.md
- /spec-dd -> read whichever reference applies to the recommended phase
Auto-detect project language by scanning for manifest files:
- package.json (JavaScript/TypeScript)
- requirements.txt, pyproject.toml (Python)
- go.mod (Go)
- Cargo.toml (Rust)
- pom.xml, build.gradle (Java/Kotlin)
- Use the detected language/ecosystem to inform testing frameworks, patterns, idioms, and handoff prompts throughout the workflow.
For /spec-dd (router): Follow the auto-detect router logic below.
For /spec-dd:<phase>: Check whether prior-phase artifacts exist. If gaps are found, advise the user which earlier phase to complete first, but do not block — proceed if the user chooses to continue.

Phase 1: Behavioral Specification (`/spec-dd:spec`)

Reference: Read references/specification.md before starting.

Purpose: Define what the system does — behavioral contracts, not implementation details.

Workflow:

Guide the user through creating or reviewing the behavioral specification for the selected feature.
Use selectable options plus a free-text escape for every question. Ask 1-3 questions per turn, grouped thematically. Summarize captured answers before moving on.
Review for ambiguity:
- Vague language ("fast", "secure", "easy", "simple", "efficient")
- Missing edge cases and boundary conditions
- Undefined terms and implicit assumptions
- Unmeasurable acceptance criteria
Mark any unresolved items with [NEEDS CLARIFICATION].
Produce or update docs/specs/<feature>-specification.md.

Advisory gate: No unresolved [NEEDS CLARIFICATION] markers before proceeding to Phase 2.

Artifact template:

# <Feature> - Behavioral Specification

## Objective
What this feature does and why it exists.

## User Stories & Acceptance Criteria
Numbered user stories, each with measurable acceptance criteria.

## Constraints
Technical, business, or regulatory constraints.

## Edge Cases
Boundary conditions, error states, unusual inputs.

## Non-Goals
What this feature explicitly does NOT do.

## Open Questions
Items marked [NEEDS CLARIFICATION] that must be resolved.

Phase 2: Test Specification (`/spec-dd:test`)

Reference: Read references/test-specification.md before starting.

Purpose: Derive test scenarios from the behavioral spec only — no implementation knowledge.

Pre-check: Verify that <feature>-specification.md exists. If it does not, advise the user to complete Phase 1 first. If the user chooses to proceed anyway, note the risk and continue.

Workflow:

Derive Given/When/Then scenarios from each acceptance criterion in the behavioral specification.
Enforce one behavior per scenario, one action per step.
Build a coverage matrix mapping every spec requirement to test scenarios.
Language-aware review: ensure scenarios are realistic and testable for the project's detected testing ecosystem.
Check traceability: every acceptance criterion must have at least one test scenario.
Produce or update docs/specs/<feature>-test-specification.md.

Advisory gate: Full traceability between spec requirements and test scenarios before proceeding to Phase 3.

Artifact template:

# <Feature> - Test Specification

## Coverage Matrix
Table mapping each acceptance criterion to its test scenarios.

## Test Scenarios
Given/When/Then format. Grouped by user story or functional area.

## Edge Case Scenarios
Boundary conditions derived from the specification's edge cases section.

## Traceability
Summary confirming every acceptance criterion is covered.

Phase 3: Test Implementation Specification (`/spec-dd:test-impl`)

Reference: Read references/test-implementation-specification.md before starting.

Purpose: Map every test scenario to a technical approach for implementing it as actual test code.

Pre-check: Verify that <feature>-test-specification.md exists. If it does not, advise the user to complete Phase 2 first. Proceed if the user overrides.

Workflow:

Guide the user through describing the technical approach for implementing each test scenario as actual test code.
Use language-aware patterns: test framework selection, fixture strategies, mock/stub approaches, test file organization.
Map every test scenario from the test specification to specific test functions, test classes, or test modules.
Describe shared fixtures, test data factories, and setup/teardown strategies.
Verify completeness: every Given/When/Then scenario has a corresponding test implementation approach.
Produce or update docs/specs/<feature>-test-implementation-specification.md.

Advisory gate: Every test scenario mapped to a test implementation approach before proceeding to Phase 4.

Artifact template:

# <Feature> - Test Implementation Specification

## Test Framework & Conventions
Detected stack, test framework, test runner, conventions.

## Test Structure
How tests are organized: files, classes/modules, naming conventions.

## Test Scenario Mapping
Map each test scenario to a test function/method with setup and assertion strategy.

## Fixtures & Test Data
Shared fixtures, factories, test data approach, setup/teardown.

## Alignment Check
Confirmation that every test scenario has a test implementation approach.

Phase 4: Test Implementation (Handoff)

Purpose: Actual test code is written by a coding agent following the test implementation specification. Tests must initially FAIL because the feature code does not exist yet. This skill does NOT write test code.

Workflow:

Announce readiness for test implementation.
Summarize what the test implementation specification contains and where it lives (docs/specs/<feature>-test-implementation-specification.md).
Offer to propose a prompt for the coding agent. The prompt should:
- Reference both the test specification and the test implementation specification
- Instruct the agent to implement tests using the project's test framework
- Instruct the agent to verify that all tests FAIL (since the feature is not yet implemented)
- Specify the detected language and ecosystem
Wait for test code to be written and verified as failing before proceeding.

Phase 5: Feature Implementation (Handoff)

Purpose: Production code is written to make all tests pass. This skill does NOT write implementation code.

Workflow:

Announce readiness for feature implementation.
Summarize that tests exist, are failing, and define the expected behavior.
Offer to propose a prompt for the coding agent. The prompt should:
- Reference the behavioral specification and the test files
- Instruct the agent to write code that makes all tests pass without modifying any test files
- Specify the detected language and ecosystem
Wait for implementation code to be written before proceeding.

Phase 6: Implementation Verification (`/spec-dd:verify`)

Reference: Read references/verify.md before starting.

Purpose: Check whether the implementation satisfies the specification at the requirement level. Unlike the review phase (which checks artifact alignment), verification answers: "Did we build what the spec says?"

Key difference from review: Verification works with any spec file — including informal documents not created through spec-dd. It produces a requirement-level PASS/FAIL checklist, not a severity-classified alignment report.

Pre-check: Implementation code must exist. If no code has been written yet, advise the user to complete Phase 5 first.

Workflow:

Identify the spec file. Use the argument if provided (e.g., /spec-dd:verify specifications.md). Otherwise, look for docs/specs/<feature>-specification.md. If neither exists, ask the user to provide the spec file path.
Extract requirements. Read the spec file and identify every testable requirement using the heuristics in references/verify.md: numbered items, acceptance criteria, RFC-style keywords (MUST/SHALL), and behavioral descriptions. Assign each an ID (R01, R02, ...).
Verify by reading code. For each requirement, search the codebase for implementing code. Verify that the code logic matches the spec's behavioral description. Mark as PASS, FAIL, or PARTIAL with a one-line evidence note citing file path and line number.
Run targeted tests only when needed. Execute tests only when a requirement describes runtime behavior that cannot be verified by reading, when the user explicitly asks, or when a read-based check is inconclusive. Never re-run slow or non-deterministic tests without asking.
Handle non-deterministic tests. If a requirement is tested by a non-deterministic test (LLM-dependent, network-dependent), verify the test exists and exercises the correct code path. Mark the requirement based on code reading, not test pass/fail.
Produce verification report at docs/specs/<feature>-verification.md (or docs/specs/verify-<spec-filename>-<date>.md if no feature name).
If gaps are found, list specific next steps: which requirements need implementation, which need fixes (with file paths), whether to revisit the spec (if requirements are ambiguous).

Advisory gate: All requirements PASS or PARTIAL (no FAIL) before proceeding to Phase 7 (Review).

Artifact template:

# Verification: <spec-file>

| Field | Value |
|-------|-------|
| Spec file | `<path to spec>` |
| Feature | <feature name or "N/A"> |
| Date | <today's date> |

## Requirements

| # | Requirement | Status | Evidence |
|---|-------------|--------|----------|
| R01 | <requirement> | PASS | `file:line` — description |
| R02 | <requirement> | FAIL | Expected X, found nothing |
| R03 | <requirement> | PARTIAL | `file:line` — works but missing Y |

## Summary
X/Y PASS, Z FAIL, W PARTIAL

## Gaps Requiring Action

| # | Requirement | Issue | Suggested Fix |
|---|-------------|-------|---------------|
| R02 | <requirement> | Not implemented | <suggested approach> |

## Notes
Observations about non-deterministic tests, environment issues, or spec ambiguities.

Phase 7: Review (`/spec-dd:review`)

Reference: Read references/review.md before starting.

Purpose: Verify alignment across all artifacts and actual code.

Pre-check: Check that all three specification documents exist for the feature: <feature>-specification.md, <feature>-test-specification.md, and <feature>-test-implementation-specification.md. Flag any that are missing but proceed with what is available.

Workflow:

Document alignment: Cross-check the three specification documents against each other. Identify inconsistencies, coverage gaps, and unresolved ambiguities.
Code alignment: Scan actual test files and implementation source code in the project. Compare against the specification documents. Identify:
- Test scenarios in the test spec without corresponding test code
- Test implementation approaches in the test impl spec without corresponding test code
- Undocumented behavior (code not covered by any specification)
Test execution: Detect the project's test runner and execute all available tests locally:
- Check for: Makefile targets, justfile recipes, package.json scripts (test, test:unit, test:integration), pytest, go test, cargo test, mvn test, gradle test, and similar
- Run the detected test command
- Report pass/fail results with failure details
Produce review report at docs/specs/<feature>-implementation-review.md.
If issues are found, recommend which phase to revisit and why.

Artifact template:

# <Feature> - Implementation Review

## Specification Alignment
Cross-check between the three spec documents.

## Code Alignment
Actual test code vs test specification. Actual implementation vs implementation specification.

## Test Execution
Test runner detected, command used, pass/fail results, failure details if any.

## Coverage Report
Gaps, misalignments, unresolved items.

## Status
Pass/fail summary per check.

## Recommendations
Next steps if issues are found.

Auto-Detect Router (`/spec-dd`)

When /spec-dd is invoked (without a specific phase subcommand):

Scan docs/specs/ for *-specification.md files to discover features. If the directory does not exist or is empty, ask the user for a feature name and start at Phase 1.
Select feature: Use the argument if provided. Otherwise, list discovered features and ask the user to choose, or let them name a new feature.
Assess phase status for the selected feature:
- Does <feature>-specification.md exist? Any [NEEDS CLARIFICATION] markers?
- Does <feature>-test-specification.md exist? Does it cover all acceptance criteria from the behavioral spec?
- Do actual test files exist in the project? Do they align with the test spec?
- Does <feature>-test-implementation-specification.md exist? Does it address all test scenarios?
- Do actual test files exist? Do they fail (feature not yet implemented)?
- Does actual implementation code exist? Do the tests pass?
- Does <feature>-verification.md exist? Are all requirements PASS or PARTIAL?
- Does <feature>-implementation-review.md exist? Is it current?
Report current state: Which phases are complete, which have gaps, which have not been started.
Recommend next action: Either fix gaps in an earlier phase or proceed to the next incomplete phase.
If at a handoff phase (Phase 3 or Phase 5): Offer to propose a prompt for the coding agent.

Quality Gates

All gates are advisory — the skill flags issues and recommends addressing them, but the user can override and proceed.

Transition	Gate Check
Spec -> Test Spec	No unresolved `[NEEDS CLARIFICATION]` markers
Test Spec -> Test Impl Spec	Full traceability: every acceptance criterion mapped to test scenarios
Test Impl Spec -> Test Impl	Every test scenario mapped to a test implementation approach
Test Impl -> Feature Impl	Test files exist and all tests FAIL (feature not yet implemented)
Feature Impl -> Verify	Implementation code exists
Verify -> Review	All requirements PASS or PARTIAL (no FAIL)
Review -> Done	All checks pass in the review report, including test execution

Iterative Workflow Support

The workflow supports non-linear progression:

The auto-detect router can recommend going back to an earlier phase when it finds gaps or inconsistencies.
Any phase can be re-entered at any time via explicit subcommands (/spec-dd:spec, /spec-dd:test, etc.).
Artifacts are updated in place — the skill works with whatever state exists.
Starting rough and iterating is explicitly supported: write a quick behavioral spec, rough out test scenarios, refine both, then proceed.
Review findings may reveal issues that require revisiting specification, test specification, or test implementation specification phases.

Pragmatism Guidelines

These are guidelines, not laws. Apply judgment:

Scale to project size. A small utility feature needs a lighter touch than a complex distributed system feature. Do not demand exhaustive specs for a 20-line function.
Gates are advisory. Flag issues and recommend addressing them. If the user acknowledges a gap and wants to proceed, respect the override.
Guide, do not block. The skill's role is to surface gaps, ambiguities, and misalignments. It is not a gatekeeper — it is a navigator.
Acknowledge trade-offs. When the user makes a conscious decision to skip or simplify a phase, note it and move on. Do not repeatedly insist.
Language idioms matter. Adapt patterns, terminology, and recommendations to the project's detected language and ecosystem. A Go project and a Python project have different testing conventions, architectural patterns, and file structures.

spec-dd

Specification-Driven Development

Commands

Artifacts

First Steps

Phase 1: Behavioral Specification (`/spec-dd:spec`)

Phase 2: Test Specification (`/spec-dd:test`)

Phase 3: Test Implementation Specification (`/spec-dd:test-impl`)

Phase 4: Test Implementation (Handoff)

Phase 5: Feature Implementation (Handoff)

Phase 6: Implementation Verification (`/spec-dd:verify`)

Phase 7: Review (`/spec-dd:review`)

Auto-Detect Router (`/spec-dd`)

Quality Gates

Iterative Workflow Support

Pragmatism Guidelines

More from florianbuetow/claude-code

ssrf

spec-writer

solid-principles

business-logic

file-upload

pasta-risk

spec-dd

Specification-Driven Development

Commands

Artifacts

First Steps

Phase 1: Behavioral Specification (/spec-dd:spec)

Phase 2: Test Specification (/spec-dd:test)

Phase 3: Test Implementation Specification (/spec-dd:test-impl)

Phase 4: Test Implementation (Handoff)

Phase 5: Feature Implementation (Handoff)

Phase 6: Implementation Verification (/spec-dd:verify)

Phase 7: Review (/spec-dd:review)

Auto-Detect Router (/spec-dd)

Quality Gates

Iterative Workflow Support

Pragmatism Guidelines

More from florianbuetow/claude-code

ssrf

spec-writer

solid-principles

business-logic

file-upload

pasta-risk

Phase 1: Behavioral Specification (`/spec-dd:spec`)

Phase 2: Test Specification (`/spec-dd:test`)

Phase 3: Test Implementation Specification (`/spec-dd:test-impl`)

Phase 6: Implementation Verification (`/spec-dd:verify`)

Phase 7: Review (`/spec-dd:review`)

Auto-Detect Router (`/spec-dd`)