qa

Installation

SKILL.md

QA Test Plan Generator

!IMPORTANT: Follow this process exactly. Do not skip steps.

Step 0: Parse Input

Parse $ARGUMENTS in two passes: first extract the output destination, then detect the diff target.

Pass A — Output mode

Strip and capture a trailing --terminal, --issue, or --file token from $ARGUMENTS. Store it as OUTPUT_MODE. If absent, default to terminal. The remaining tokens form the diff-target argument for Pass B.

Flag	`OUTPUT_MODE`	Effect
`--terminal` (or none)	`terminal`	Render the plan inline in the chat
`--issue`	`issue`	Create a GitHub issue (requires a GitHub remote)
`--file`	`file`	Write the plan to `./qa-plan-<timestamp>.md`

Pass B — Diff target

With the output flag removed, inspect what's left to determine the mode:

Pattern	Detection	Mode
40-char hex or short hash	Matches `/^[0-9a-f]{7,40}$/`	commit — diff from hash to HEAD
`#N` or bare number with valid PR	`gh pr view N` succeeds	pr — PR commit range
Bare number or `last N`	Matches `/^(last\s+)?\d+$/i`	count — last N commits
Empty	No argument	count with N=1 (last commit)

Disambiguation for bare numbers: Try gh pr view N --json number 2>/dev/null first. If it returns a valid PR, use pr mode. Otherwise, treat as "last N commits."

Step 1: Verify GitHub Remote (issue mode only)

Only run this check when OUTPUT_MODE == "issue":

gh repo view --json nameWithOwner --jq '.nameWithOwner'

If this fails, abort: "No GitHub remote detected. --issue requires a GitHub repository. Re-run with --terminal or --file."

For terminal and file modes, skip this step — they work in any repository.

Step 2: Extract Diff and Commit Log

Based on mode:

Commit mode

git log --oneline $HASH..HEAD
git diff --name-only $HASH..HEAD
git diff $HASH..HEAD

Count mode

git log --oneline -$N
git diff --name-only HEAD~$N..HEAD
git diff HEAD~$N..HEAD

PR mode

gh pr view $PR_NUMBER --json title,url,baseRefName,headRefName,number
gh pr diff $PR_NUMBER

Store the commit log, changed file list, and full diff for the next steps.

Step 3: Explore Codebase

Spawn 2 parallel Explore agents (Agent tool with subagent_type: "Explore") to build contextual understanding beyond the raw diff:

Agent 1: `change-analyzer`

Prompt: Read each changed file in full (not just diff hunks). For each file identify:

What feature area it belongs to (auth, payments, API, UI, etc.)
What user-facing behaviors changed
What API endpoints or routes were modified
What database/state changes occur
What the happy-path flow looks like for the change

Agent 2: `dependency-tracer`

Prompt: For each changed file, trace callers and consumers using Grep and Read. Identify:

Which UI screens, components, or flows consume the changed code
What existing tests cover the changed code (unit, integration, e2e) and what testing patterns/frameworks the project uses
What config changes, new env vars, or migrations are in the diff
What new dependencies were added (package.json, requirements.txt, etc.)
What feature flags or permission changes are present

Provide both agents with the changed file list and commit log from Step 2.

Step 4: Generate QA Plan

Synthesize the exploration findings into a structured QA plan. This plan targets manual-only scenarios — things that require human eyes, real environments, or multi-step user interaction to verify.

Exclusion filter — do NOT include scenarios that are:

Pure logic/computation (unit-testable)
Input validation & boundary values for individual fields (unit-testable)
Individual API endpoint request/response correctness (integration-testable)
Database CRUD correctness (integration-testable)
Error handling for specific error codes/messages (unit-testable)
Permission checks on individual endpoints (integration-testable)

If the dependency-tracer found existing automated test coverage for a behavior, exclude it.

Grouping: Cluster changes by feature area. Derive areas from directory structure, component/module boundaries, API route groups, or file naming patterns.

For each feature area, generate scenarios in these categories:

User flows & interactions — multi-step workflows, navigation paths, form submission sequences spanning multiple components/pages
Visual & layout — UI rendering, responsive behavior, content overflow, visual regressions, animation/transition correctness
Cross-system integration — behavior across service boundaries, third-party integrations, webhook flows, real external API behavior
State & data consistency — race conditions, concurrent user actions, stale data, cache invalidation, optimistic update rollbacks
Environment-specific behavior — browser/device differences, network conditions (slow/offline), timezone/locale effects, feature flag combinations

Only include categories relevant to the changes — omit empty categories.

Additionally generate:

Prerequisites — inferred from config changes: env vars, migrations, feature flags, required access/roles, test data setup
Regression checks — user-visible regressions only (not internal correctness already covered by automated tests)
Error handling — graceful degradation visible to the user (not API error codes or programmatic error responses)

Step 5: Output QA Plan

Render the plan body from the shared template below, then route it to the destination chosen by OUTPUT_MODE.

Shared body template

All three output modes use the same body. Substitute bracketed placeholders; omit **Related PR:** #N unless diff-target mode was pr.

## QA Test Plan

**Source:** [commit range or PR reference]
**Generated:** [today's date]
**Related PR:** #N  <!-- include only if PR mode -->

---

### Prerequisites

> Complete before starting QA.

- [ ] [prerequisite 1]
- [ ] [prerequisite 2]

---

### 1. [Feature Area Name]

#### 1.1 [Scenario Title]
**Preconditions:** [state/setup needed]

| Step | Action | Expected Result |
|------|--------|-----------------|
| 1 | [action] | [expected] |
| 2 | [action] | [expected] |

- [ ] Verified

**Edge cases:**
- [ ] [boundary / negative / error scenario]
- [ ] [another edge case]

#### 1.2 [Next Scenario]
...

---

### 2. [Next Feature Area]
...

---

### Regression Checks

> Existing behavior that must remain unchanged.

- [ ] [regression scenario 1]
- [ ] [regression scenario 2]

---

### Error Handling

- [ ] [error state 1 — expected graceful behavior]
- [ ] [error state 2 — expected graceful behavior]

---

<sub>Generated by /qa from [input description]</sub>

Branch on `OUTPUT_MODE`

`terminal` (default)

Print the rendered plan directly in the chat response, wrapped in a fenced markdown code block so it renders as-is. Do not call gh or write files.

`issue`

Ensure the qa label exists, then create the issue. Use a HEREDOC with single-quoted delimiter to prevent shell interpolation:

gh label create qa --description "Manual QA test plan" --color "0E8A16" 2>/dev/null || true

gh issue create \
  --title "QA: [concise feature summary]" \
  --label "qa" \
  --body "$(cat <<'EOF'
[rendered body from the shared template above]
EOF
)"

`file`

Write the rendered plan to ./qa-plan-<timestamp>.md in the current working directory, where <timestamp> is date +%Y%m%d-%H%M%S. Use the Write tool (not shell redirection). Report the absolute path in Step 6.

Step 6: Confirm

Tailor the summary to OUTPUT_MODE:

terminal:

QA plan rendered above.
- [N] feature areas
- [N] test scenarios
- [N] edge cases

issue:

QA issue created: [URL]
- [N] feature areas
- [N] test scenarios
- [N] edge cases

file:

QA plan written to [absolute path]
- [N] feature areas
- [N] test scenarios
- [N] edge cases

Execution Notes

Output mode precedence: the suffix flag wins; if absent, default to terminal. --issue aborts early when no GitHub remote is configured (Step 1); --terminal and --file don't require one.
Large diffs (>50KB): Pass only the changed file list to explore agents and instruct them to read files directly instead of embedding the full diff.
PR from fork: If gh pr diff fails, fall back to git diff origin/$BASE...origin/$HEAD.
No prerequisites detected: Omit the Prerequisites section rather than leaving it empty.
Single small change: Still group under a feature area — even one scenario should follow the template for consistency.

Related skills

More from benjaming/ai-skills

Installs

Repository

benjaming/ai-skills

GitHub Stars

First Seen

Mar 21, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykWarn

qa

QA Test Plan Generator

Step 0: Parse Input

Pass A — Output mode

Pass B — Diff target

Step 1: Verify GitHub Remote (issue mode only)

Step 2: Extract Diff and Commit Log

Commit mode

Count mode

PR mode

Step 3: Explore Codebase

Agent 1: `change-analyzer`

Agent 2: `dependency-tracer`

Step 4: Generate QA Plan

Step 5: Output QA Plan

Shared body template

Branch on `OUTPUT_MODE`

`terminal` (default)

`issue`

`file`

Step 6: Confirm

Execution Notes

More from benjaming/ai-skills

confluence-cli

atlassian-cli-jira

ralph-loop

interview

codex-cli

daily-standup

qa

QA Test Plan Generator

Step 0: Parse Input

Pass A — Output mode

Pass B — Diff target

Step 1: Verify GitHub Remote (issue mode only)

Step 2: Extract Diff and Commit Log

Commit mode

Count mode

PR mode

Step 3: Explore Codebase

Agent 1: change-analyzer

Agent 2: dependency-tracer

Step 4: Generate QA Plan

Step 5: Output QA Plan

Shared body template

Branch on OUTPUT_MODE

terminal (default)

issue

file

Step 6: Confirm

Execution Notes

More from benjaming/ai-skills

confluence-cli

atlassian-cli-jira

ralph-loop

interview

codex-cli

daily-standup

Agent 1: `change-analyzer`

Agent 2: `dependency-tracer`

Branch on `OUTPUT_MODE`

`terminal` (default)

`issue`

`file`