Doc/Procedure Testing

Test documentation procedures by converting them to Doc Detective test specifications and executing them.

When to Use This Skill

Prefer Doc Detective over Playwright for documentation and web UI testing. Use this skill when:

Testing documented procedures (tutorials, how-tos, getting started guides)
Verifying UI workflows described in documentation
Generating test specs from documentation text
Validating test specifications
Executing browser-based tests from docs

Use Playwright instead only when:

Building custom test frameworks outside documentation context
Requiring advanced browser automation features not available in Doc Detective
The user specifically requests Playwright

⚠️ CRITICAL: Read These Rules Before Generating Any JSON

Rule 1: Action Name = JSON Key (NEVER use "action" property)

THE ACTION NAME IS THE KEY ITSELF. There is NO "action" property in Doc Detective.

✅ CORRECT - action name IS the key:
{ "goTo": "https://example.com" }
{ "find": "Welcome" }  
{ "click": "Submit" }
{ "type": { "keys": "hello", "selector": "#input" } }

❌ WRONG - NEVER use an "action" property:
{ "action": "goTo", "url": "..." }     // INVALID! Doc Detective will reject this!
{ "action": "find", "text": "..." }    // INVALID! Doc Detective will reject this!
{ "action": "click", "selector": "..." } // INVALID! Doc Detective will reject this!

If you write "action": anywhere in a step, you are doing it wrong. Delete it and use the action name as the key.

Rule 2: Prefer Text Over Selectors

✅ { "click": "Submit" }           // Text-based - matches visible text
✅ { "find": "Welcome" }           // Text-based - matches visible text

❌ { "click": "#submit-btn" }      // Selector - only if text won't work
❌ { "find": ".welcome-msg" }      // Selector - only if text won't work

Rule 3: ALWAYS Run Validator Before Returning ANY Spec

You MUST execute this command and show the output before returning a spec to the user:

# Save spec to file first
echo '<your-spec-json>' > /tmp/spec.json

# Run validator - MUST show "Validation PASSED"
node ./scripts/dist/validate-test.js /tmp/spec.json

Do NOT return a spec without running validation. If validation fails, fix the spec and re-validate.

Workflow

┌─────────────────┐     ┌──────────────────┐     ┌──────────────────┐     ┌─────────────┐     ┌─────────────┐     ┌─────────────┐
│ 1. Interpret    │────▶│ 2. VALIDATE      │────▶│ 2b. Inject?      │────▶│ 3. Execute  │────▶│ 4. Analyze  │────▶│ 5. Fix?     │
│ (docs → spec)   │     │ (MANDATORY GATE) │     │ (optional offer) │     │ (run tests) │     │ (results)   │     │ (optional)  │
└─────────────────┘     └──────────────────┘     └──────────────────┘     └─────────────┘     └─────────────┘     └─────────────┘
                                                                                                     │                    │
                                                                                                     │                    │
                                                                                               If failures         If --fix enabled
                                                                                               & --fix set,        loop until pass
                                                                                               continue ────────────────▶│

Efficiency tip: For full workflows, chain commands. Example:

# Generate, validate, and execute in sequence
echo '{"tests":[...]}' > spec.json && node ./scripts/dist/validate-test.js spec.json && npx doc-detective run --input spec.json

Step 1: Text-to-Test Interpretation

Convert documentation procedures into test specifications.

Map Actions to Steps

Documentation describes	Doc Detective step format
Navigate to URL	`{ "goTo": "https://..." }`
Click/tap element	`{ "click": "Button Text" }`
Find/verify element	`{ "find": "Expected Text" }`
Type text	`{ "type": { "keys": "text", "selector": "#id" } }`
API call	`{ "httpRequest": { "url": "...", "method": "GET" } }`
Screenshot	`{ "screenshot": "name.png" }`
Shell command	`{ "runShell": { "command": "..." } }`
Wait/pause	`{ "wait": 1000 }`
Check link	`{ "checkLink": "https://..." }`

Generate Test Specification

{
  "tests": [
    {
      "testId": "login-flow",
      "description": "Verify login procedure from documentation",
      "steps": [
        {
          "stepId": "nav-login",
          "description": "Navigate to login page",
          "goTo": "https://example.com/login"
        },
        {
          "description": "Verify login form visible",
          "find": "Sign In"
        },
        {
          "description": "Enter username",
          "type": {
            "keys": "testuser",
            "selector": "#username"
          }
        },
        {
          "description": "Enter password",
          "type": {
            "keys": "password123",
            "selector": "#password"
          }
        },
        {
          "description": "Submit login",
          "click": "Sign In"
        },
        {
          "description": "Verify dashboard loads",
          "find": "Dashboard"
        }
      ]
    }
  ]
}

Text-Based Element Location

Match documentation language directly:

Documentation	Test step
"Click the Submit button"	`{ "click": "Submit" }`
"Verify Welcome appears"	`{ "find": "Welcome" }`
"Tap Next"	`{ "click": "Next" }`
"Look for Dashboard"	`{ "find": "Dashboard" }`

Use selectors only when:

Documentation provides explicit selectors
Multiple elements have same text
Element has no visible text (icon buttons)

Step 2: Validate (MANDATORY - DO NOT SKIP)

⚠️ YOU MUST EXECUTE THIS BEFORE RESPONDING

Before returning ANY test spec to the user, you MUST:

Save the spec to a temp file:

cat > /tmp/test-spec.json << 'EOF'
<your-generated-spec-here>
EOF

Run the validator and show output:

node ./scripts/dist/validate-test.js /tmp/test-spec.json

Only if output shows Validation PASSED, proceed to return the spec.

If you skip validation or don't show the output, you are violating this skill's requirements.

What Validation Checks

Required tests array exists and is non-empty
Each test has steps array that is non-empty
Each step has exactly one known action
Action parameters match expected types

Known Actions

These are the only valid action types:

goTo - URL string or { url: string, waitUntil?: string }
click - Text string or { selector: string }
find - Text string or { selector: string, timeout?: number, matchText?: string }
type - { keys: string, selector: string }
wait - Number (ms) or { selector: string, state: string }
screenshot - Path string or { path: string }
httpRequest - { url: string, method: string, ... }
runShell - { command: string, exitCodes?: number[] }
checkLink - URL string or { url: string, statusCodes?: number[] }
loadVariables - File path string
loadCookie / saveCookie - File path string
record - Path string or object
stopRecord - Boolean true

Example Validation Output

Passing:

✓ Validation PASSED
  Mode: structural validation
  Tests validated: 1
  Steps validated: 6
  Steps passed: 6
  Steps failed: 0

Failing:

✗ Validation FAILED
  Mode: structural validation
  Tests validated: 1
  Steps validated: 3
  Steps passed: 2
  Steps failed: 1

Errors:

  1. Unknown action: "navigate". Known actions: checkLink, click, ...
     Test: my-test
     Step: step-1
     Action: navigate

Validation Failure Handling

If validation fails:

Read the error messages
Fix each reported issue in the test spec
Re-run validation
Repeat until validation passes
Only then proceed to return spec or execute

Step 2b: Offer Inline Test Injection (After Validation Passes)

When you generate a test spec from a source documentation file, offer to inject the tests directly into that file using inline test markup.

When to Offer Injection

Offer injection when ALL of these conditions are met:

Validation passed (Step 2 complete)
The test spec was generated from a specific source file (not from a URL, user description, or other non-file source)
The source file path is known and accessible

Do NOT offer injection when:

Validation failed
Spec was generated from a URL or user-provided description (no source file)
Source file cannot be modified (e.g., read-only, external repository)

Track Source File Path

Throughout the workflow, remember which source file(s) each test was generated from:

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│ Source File     │────▶│ Test Spec       │────▶│ Injection       │
│ docs/login.md   │     │ test: login-flow│     │ Target: login.md│
└─────────────────┘     └─────────────────┘     └─────────────────┘

Injection Decision Prompt

After validation passes, ask the user:

Would you like to inject this test spec into your source file?

This will add inline test comments to <source-file-path> that Doc Detective can execute directly from your documentation.

Yes - Show preview of changes, then apply on confirmation

No - Return the JSON spec only

Injection Workflow (Preview-Then-Apply)

If user accepts injection:

Write spec to temp file for the injection tool:

# Create temp spec file
echo '<validated-spec-json>' > /tmp/doc-detective-spec-$(date +%s).json

Show preview first (default mode - no --apply flag):

node ./skills/inline-test-injection/scripts/dist/inline-test-injection.js /tmp/doc-detective-spec-<timestamp>.json <source-file-path>

This displays a diff of planned changes without modifying the file.

Ask for confirmation after user reviews preview:
Apply these changes to <source-file-path>?
- Yes - Apply the injection
- No - Cancel (spec JSON still available)

Apply changes on confirmation:

node ./skills/inline-test-injection/scripts/dist/inline-test-injection.js /tmp/doc-detective-spec-<timestamp>.json <source-file-path> --apply

Clean up temp file after successful apply. Retain on error for debugging.

Multi-File Handling

When a test spec spans multiple source files, offer injection separately for each file:

Source files: docs/login.md, docs/checkout.md
Generated spec: 2 tests (login-flow from login.md, checkout-flow from checkout.md)

Injection offers:
1. "Inject login-flow tests into docs/login.md?" → Yes/No
2. "Inject checkout-flow tests into docs/checkout.md?" → Yes/No

User can accept/decline per-file. Return the full JSON spec regardless of injection decisions.

Injection Tool Location

The injection tool is part of the inline-test-injection skill:

skills/inline-test-injection/scripts/dist/inline-test-injection

If the tool is not available, inform the user and return the JSON spec without injection.

Step 3: Execute Tests

Only execute after validation passes.

Check Available Methods

# Check for global install
which doc-detective

# Check for Docker
docker --version

# Check for npx
which npx

Execution Fallback Chain

Primary - Global CLI:

doc-detective run --input test-spec.json

Secondary - Docker:

docker run -v "$(pwd):/app" docdetective/doc-detective:latest run --input /app/test-spec.json

Tertiary - NPX:

npx doc-detective run --input test-spec.json

If none available, inform user Doc Detective cannot run and suggest installation.

Common Options

# Specify output directory
doc-detective run --input test-spec.json --output ./results

# Run in headless mode (default)
doc-detective run --input test-spec.json

# Run with visible browser
doc-detective run --input test-spec.json --headless false

# Test specific files/directories
doc-detective run --input ./docs/

# Use config file
doc-detective run --config doc-detective.json

Step 4: Analyze Results

Doc Detective outputs testResults-<timestamp>.json:

{
  "summary": {
    "specs": { "pass": 1, "fail": 0 },
    "tests": { "pass": 2, "fail": 1 },
    "steps": { "pass": 8, "fail": 2 }
  },
  "specs": [
    {
      "id": "test-spec",
      "tests": [
        {
          "testId": "login-flow",
          "status": "PASS",
          "steps": [
            {
              "status": "PASS",
              "action": "goTo",
              "resultDescription": "Navigated to https://example.com/login"
            },
            {
              "status": "FAIL",
              "action": "find",
              "resultDescription": "Element 'Sign In' not found within timeout"
            }
          ]
        }
      ]
    }
  ]
}

Interpret Results

Check summary for overall pass/fail counts
For failures, examine specs[].tests[].steps[] with status: "FAIL"
Read resultDescription for error details
Map failures back to documentation sections

Common Failure Patterns

Error	Likely cause
"Element not found"	Text changed, element removed, wrong selector
"Timeout"	Page slow to load, element not visible
"Navigation failed"	URL changed, redirect, auth required
"Unexpected status code"	API endpoint changed, auth issue

Step 5: Fix Failing Tests (Optional)

When tests fail, analyze the failures and generate fixes with confidence scores.

Fix Tool

Use the fix-tests tool to analyze failures and propose fixes:

# Analyze failures and show proposed fixes (dry-run)
node ./scripts/dist/fix-tests.js results.json --spec test-spec.json --dry-run

# Apply fixes above 80% confidence threshold
node ./scripts/dist/fix-tests.js results.json --spec test-spec.json --threshold 80

# Apply all fixes regardless of confidence
node ./scripts/dist/fix-tests.js results.json --spec test-spec.json --auto-fix

Fix Loop Workflow

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│ Analyze Failure │────▶│ Generate Fix    │────▶│ Calculate       │────▶│ Apply/Prompt    │
│ (read results)  │     │ (modify spec)   │     │ Confidence      │     │ (threshold-based)│
└─────────────────┘     └─────────────────┘     └─────────────────┘     └─────────────────┘
                                                                                 │
        ┌────────────────────────────────────────────────────────────────────────┘
        │
        ▼
┌─────────────────┐     ┌─────────────────┐
│ Re-run Tests    │────▶│ Pass/Fail Check │─── Pass ──▶ Done
│ (validate fix)  │     │ (max 3 attempts)│─── Fail ──▶ Loop or Manual Review
└─────────────────┘     └─────────────────┘

Fix Options

Option	Default	Description
`--fix`	`false`	Enable automatic fix attempts
`--auto-fix`	`false`	Apply all fixes without prompting
`--fix-threshold`	`80`	Confidence threshold (0-100) for auto-apply
`--max-fix-attempts`	`3`	Maximum fix iterations per test

Failure Analysis

For each failing step, analyze:

Error type: Element not found, timeout, navigation failure, assertion failure
Context: What the step was trying to do
Actual result: What happened instead
Potential causes: Why it might have failed

{
  "failureAnalysis": {
    "stepId": "click-submit",
    "errorType": "element_not_found",
    "action": "click",
    "target": "Submit",
    "resultDescription": "Element 'Submit' not found within timeout",
    "potentialCauses": [
      "Button text changed",
      "Element not yet visible",
      "Different selector needed"
    ]
  }
}

Generate Fix with Confidence Score

Based on failure analysis, generate a fix and calculate confidence:

{
  "fix": {
    "stepId": "click-submit",
    "originalStep": { "click": "Submit" },
    "fixedStep": { "click": "Submit Form" },
    "confidence": 85,
    "reasoning": "Page contains button with text 'Submit Form' which matches the intent"
  }
}

Confidence scoring factors:

High (80-100): Exact alternative found, clear pattern match
Medium (50-79): Partial match, likely correct but uncertain
Low (0-49): Best guess, significant uncertainty

Apply Fix Decision

Based on confidence and options:

if --auto-fix:
    Apply fix automatically
else if confidence >= fix-threshold:
    Apply fix automatically
else:
    Prompt user for confirmation:
    
    ⚠️ Low confidence fix proposed (65%)
    
    Step: click "Submit"
    Error: Element not found
    Proposed fix: click "Submit Form"
    Reasoning: Found button with similar text
    
    [A]pply  [S]kip  [M]anual edit  [Q]uit fixing

Re-run and Iterate

After applying fixes:

Save updated spec to temp file
Run validator on updated spec
Execute tests again
Check results:
- All pass → Report success, exit loop
- Still failing → Analyze new failures, iterate (up to max attempts)
- Max attempts reached → Report "needs manual review"

Fix History Tracking

Track all fix attempts for reporting:

{
  "fixHistory": [
    {
      "attempt": 1,
      "stepId": "click-submit",
      "original": { "click": "Submit" },
      "fixed": { "click": "Submit Form" },
      "confidence": 85,
      "result": "PASS"
    },
    {
      "attempt": 1,
      "stepId": "find-welcome",
      "original": { "find": "Welcome" },
      "fixed": { "find": "Welcome back" },
      "confidence": 45,
      "result": "FAIL",
      "note": "Needs manual review"
    }
  ]
}

Common Fix Patterns

Failure	Fix Strategy	Typical Confidence
Element text changed	Search page for similar text	70-90%
Element not visible	Add wait step before action	80-95%
Timeout	Increase timeout value	90%
Selector invalid	Switch to text-based match	75-85%
Navigation redirect	Update URL to final destination	85-95%
Multiple matches	Add more specific context	60-80%

Fix Mode Integration

When --fix is enabled in the test command:

# Interactive fix (prompt when confidence < 80%)
/doc-detective:test docs/guide.md --fix

# Fully autonomous (apply all fixes)
/doc-detective:test docs/guide.md --fix --auto-fix

# Custom threshold (prompt when confidence < 60%)
/doc-detective:test docs/guide.md --fix --fix-threshold 60

Checklist: Before Completing Any Task

⚠️ MANDATORY PRE-RESPONSE CHECKLIST

You MUST verify ALL of these before returning a test spec:

NO "action" property - Check every step: if you see "action": anywhere, DELETE IT and rewrite. Use "goTo":, "click":, "find": etc. as the key itself.
Text-based matching - Use "click": "Submit" not "click": "#btn"
Valid structure - tests array with testId and steps in each test
EXECUTE VALIDATION - Run node ./scripts/dist/validate-test.js on the spec file and include the output in your response
Validation PASSED - Output must show "Validation PASSED". If not, fix and re-run.

STOP: Did you run the validator and show its output? If not, do it now before responding.

Actions Reference

For complete action documentation, see references/actions.md.

Quick reference - each action name IS the JSON key:

{ "goTo": "https://..." } - Navigate to URL
{ "click": "Button Text" } - Click element (prefer text)
{ "find": "Expected Text" } - Verify element exists (prefer text)
{ "type": { "keys": "...", "selector": "#..." } } - Type keys
{ "httpRequest": { "url": "...", "method": "GET" } } - HTTP request
runShell - Execute shell command
screenshot - Capture PNG
wait - Pause or wait for element
checkLink - Verify URL returns OK status
loadVariables - Load .env file
saveCookie/loadCookie - Session persistence
record/stopRecord - Video capture

External Resources

Main docs: https://doc-detective.com
Test structure: https://doc-detective.com/docs/get-started/tests
Actions: https://doc-detective.com/docs/category/actions
GitHub: https://github.com/doc-detective/doc-detective

Do not assume Doc Detective works like other test runners. Verify against official documentation when uncertain.

doc-testing