experiment-designer
Experiment Designer Skill
Produce rigorous experiment designs from product hypotheses, and interpret results with statistical and practical significance — so you can defend every decision to a sceptical engineering lead or data scientist.
Required Inputs
Ask the user for these if not provided: For experiment design:
- Hypothesis (what change, what metric, what expected movement)
- Current baseline metric value
- Minimum detectable effect (MDE) — the smallest lift worth caring about
- Available daily sample size
For results interpretation:
- Control and variant results (raw numbers or percentages)
- P-value or confidence interval
- Run duration (days)
- Any anomalies observed during the test
Two-Phase Process
Phase 1: Experiment Design
- Restate hypothesis as: "If we [change], we expect [metric] to [move by X%] because [reason]"
- Define control and variant clearly
- Select primary metric (one only) and secondary guardrail metrics (2-3 max)
- Calculate required sample size from MDE and baseline
- Estimate run time in days
- Set pre-defined success criteria before the test runs — no moving goalposts
- Flag design risks: novelty effects, seasonal confounds, multiple testing issues, network effects, sample ratio mismatch
Phase 2: Results Interpretation
- Assess statistical significance (p < 0.05 threshold)
- Assess practical significance: was the lift meaningful for the business, not just real?
- Interpret confidence intervals
- Investigate confounding factors
- Recommend: Ship / Iterate / Kill / Run follow-up test
- Validate — Confirm the test ran for the full planned duration. Flag if it was stopped early (peeking problem). Confirm sample ratio mismatch did not occur.
Output Structure
[Design or Results header based on phase]
Hypothesis: "If we [change], we expect [metric] to [move by X%] because [reason]"
Primary metric: [One metric only] Guardrail metrics: [2-3 max] Required sample size: [n per variant] Estimated run time: [days] Pre-defined success threshold: [specific number] Design risk flags: [any concerns]
Results (Phase 2 only): Statistical significance: [p-value and conclusion] Practical significance: [lift size vs. business threshold] Recommendation: Ship / Iterate / Kill / Follow-up — [rationale]
Quality Checks
- Hypothesis specifies the change, the metric, the direction, and the reason
- Primary metric is singular — guardrail metrics are secondary
- Success criteria are defined before the test launches (not after seeing results)
- Test was not stopped early (or flagged clearly if it was)
- Practical significance assessed separately from statistical significance
- Sample ratio mismatch is checked in results interpretation
More from mohitagw15856/pm-claude-skills
user-research-synthesis
Analyze and synthesize user research findings into structured, actionable insights. Use when given user research data, interview transcripts, survey results, or user feedback that needs to be analyzed and summarised. Produces a themed synthesis with prevalence data, supporting quotes, pain points analysis, feature request prioritisation, and recommended next steps.
26prd-template
Create a Product Requirements Document following proven PM template structure. Use when asked to write a PRD, product spec, feature specification, or requirements document for a new feature or product. Produces a complete PRD with problem statement, user stories, functional requirements, technical considerations, and success metrics.
20stakeholder-update
Create executive stakeholder updates following proven communication frameworks. Use when the user needs to create a status update, progress report, executive summary, or communication for leadership, stakeholders, or executives.
19competitive-analysis
Analyze competitors and create competitive landscape documentation with feature matrices, positioning maps, and strategic recommendations. Use when asked to analyze competitors, create competitive analysis, compare features with competitors, build a competitive landscape, track competitive positioning, or prepare sales battlecard inputs. Produces structured competitor profiles, feature comparison matrix, win/loss analysis, and prioritised strategic recommendations.
18meeting-notes
Structure and format meeting notes following PM best practices. Use when asked to create meeting notes, format discussion notes, capture action items, or document decisions from any meeting type. Produces structured notes with decisions, action items (owner + deadline), open questions, and next steps.
17executive-summary
Write an executive summary for any document, report, or proposal. Use when asked to write an executive summary, management summary, briefing paper, or one-pager for senior stakeholders. Produces a structured summary that busy executives can read in under 3 minutes and act on.
15