skill-tester

SKILL.md

Skill Tester

Overview

skill-tester validates Claude Code skills work correctly in real-world scenarios through hands-on testing operations.

Purpose: Functional and usability testing through actual usage

The 4 Testing Operations:

  1. Scenario Testing - Test skill in realistic use cases
  2. Example Validation - Verify all examples execute correctly
  3. Integration Testing - Test skill works with other skills
  4. Usability Testing - Assess ease of use and effectiveness

Integration with review-multi:

  • review-multi Operation 4 (Usability Review): Comprehensive assessment with scoring
  • skill-tester: Focused on functional testing (does it work?)

When to Use

  • Pre-deployment testing (ensure functionality)
  • Example validation (all examples work?)
  • Integration testing (works with other skills?)
  • Real-world validation (effective in practice?)

Operations

Operation 1: Scenario Testing

Purpose: Test skill in realistic scenarios

Process:

  1. Select scenario from "When to Use"
  2. Actually follow skill instructions
  3. Complete the task
  4. Document success/failure and issues

Validation: PASS if scenario completes successfully


Operation 2: Example Validation

Purpose: Verify all code/command examples execute correctly

Process:

  1. Extract all examples from SKILL.md
  2. Execute each example
  3. Verify output matches expectations
  4. Document any failures

Validation: PASS if all examples work


Operation 3: Integration Testing

Purpose: Test skill works with other skills (for workflows/dependencies)

Process:

  1. Identify dependent skills
  2. Test integration points
  3. Verify data flows correctly
  4. Check composition works

Validation: PASS if integrations work smoothly


Operation 4: Usability Testing

Purpose: Assess if skill is easy to use and effective

Process:

  1. First-time user test (can someone use it without prior knowledge?)
  2. Task completion test (can users achieve goals?)
  3. Efficiency test (reasonable time?)
  4. Satisfaction assessment (positive experience?)

Validation: PASS if usable and effective


Quick Reference

Operation Focus Pass Criteria
Scenario Testing Real-world usage Scenario completes successfully
Example Validation Examples work All examples execute correctly
Integration Testing Works with others Integrations smooth
Usability Testing Easy and effective Usable by target users

Use with: review-multi (comprehensive) + skill-tester (functional validation)


skill-tester ensures skills work correctly through hands-on testing in real scenarios.

Weekly Installs
1
Installed on
claude-code1