scientific-thinking
Scientific Thinking
Enforces the scientific method as an investigation discipline. Governs hypothesis formulation, predictions, experiment design, and evidence-based conclusions.
Shared Resources
Load these references at the start of any investigation. A references index is available for a quick map of all shared files.
- Unified Investigation Template — output structure for all investigation work
- Investigation Workflow — mermaid diagrams of the full scientific method flow
Companion Skill
evidence-first-debugging handles observation recording, evidence IDs, and causality
validation. Both skills use the same Unified Investigation Template output structure. Load
evidence-first-debugging when the task requires structured evidence tracking alongside
hypothesis work.
Activation Triggers
Activate this skill when the prompt contains:
- "unknown cause", "strange behavior", "intermittent"
- "architecture decision", "previous attempts failed"
- "root cause", "investigation"
Scope Boundary
Apply this skill to problems with genuine uncertainty — unknowns, failed attempts, complex architecture trade-offs.
Tasks outside scope: typo fixes, simple additions, tasks with explicit step-by-step instructions already provided. For those, execute directly.
Investigation Stages
Enforce stages in this order. Each stage gates the next.
flowchart TD
S0[Stage 0-3: Observation<br>Record ONLY factual observations<br>No interpretation] --> S4
S4[Stage 4: Hypothesis Formulation<br>State H0 null and H1 alternative<br>Both must be falsifiable] --> S5
S5[Stage 5: Prediction<br>Write: If H1 is true, we should observe...] --> S6
S6[Stage 6-7: Experiment Design<br>Define Path A and Path B<br>Identify confounds] --> S8
S8[Stage 8-9: Execute<br>Run experiments<br>Record results verbatim] --> S10
S10[Stage 10-11: Conclusion<br>Classify causality<br>State decision<br>Cite evidence IDs] --> Done
Done{status?}
Done -->|resolved-verified| Retro[Notify user: retrospective-analyst<br>agent can produce mermaid timeline<br>and retrospective from iteration log]
Done -->|unresolved or mitigated| S0
Stage Rules
Observation (sections 0-3):
- Record only what is directly observable — no causal language, no interpretations
- Each observation requires an evidence ID (delegated to
evidence-first-debugging)
Hypothesis Formulation (section 4):
- H0 (null): the default assumption — no effect, no cause
- H1 (alternative): the proposed cause or mechanism
- Both must be falsifiable — if a hypothesis cannot be tested, rewrite it
Prediction (section 5):
- Write one sentence per hypothesis: "If H1 is true, we should observe X when we do Y"
- Predictions must reference observable, measurable outcomes
Experiment Design (sections 6-7):
- Path A: test that would confirm H1
- Path B: test that would refute H1 (controls for confounds)
- List confounds explicitly — variables that could produce the same result without H1 being true
Execute (sections 8-9):
- Run experiments exactly as designed — record results verbatim, not summarized
- If execution diverges from design, restart from Experiment Design with updated constraints
Conclusion (sections 10-11):
- Classify each action-result link:
causal-supported,correlated-only,unrelated, orunknown - State the decision derived from evidence
- Every claim must cite an evidence ID — no unsupported assertions
More from jamie-bitflight/claude_skills
perl-lint
This skill should be used when the user asks to lint Perl code, run perlcritic, check Perl style, format Perl code, run perltidy, or mentions Perl Critic policies, code formatting, or style checking.
24brainstorming-skill
You MUST use this before any creative work - creating features, building components, adding functionality, modifying behavior, or when users request help with ideation, marketing, and strategic planning. Explores user intent, requirements, and design before implementation using 30+ research-validated prompt patterns.
11design-anti-patterns
Enforce anti-AI UI design rules based on the Uncodixfy methodology. Use when generating HTML, CSS, React, Vue, Svelte, or any frontend UI code. Prevents "Codex UI" — the generic AI aesthetic of soft gradients, floating panels, oversized rounded corners, glassmorphism, hero sections in dashboards, and decorative copy. Applies constraints from Linear/Raycast/Stripe/GitHub design philosophy: functional, honest, human-designed interfaces. Triggers on: UI generation, dashboard building, frontend component creation, CSS styling, landing page design, or any task producing visual interface code.
7python3-review
Comprehensive Python code review checking patterns, types, security, and performance. Use when reviewing Python code for quality issues, when auditing code before merge, or when assessing technical debt in a Python codebase.
7hooks-guide
Cross-platform hooks reference for AI coding assistants — Claude Code, GitHub Copilot, Cursor, Windsurf, Amp. Covers hook authoring in Node.js CJS and Python, per-platform event schemas, inline-agent hooks and MCP in agent frontmatter, common JSON I/O, exit codes, best practices, and a fetch script to refresh docs from official sources. Use when writing, reviewing, or debugging hooks for any AI assistant.
7agent-creator
Create high-quality Claude Code agents from scratch or by adapting existing agents as templates. Use when the user wants to create a new agent, modify agent configurations, build specialized subagents, or design agent architectures. Guides through requirements gathering, template selection, and agent file generation following Anthropic best practices (v2.1.63+).
6