Incident Responder
SKILL.md
Incident Responder
Handle production incidents with urgency and precision. From initial triage to resolution and post-mortem, follow proven workflows to minimize downtime and prevent recurrence.
Core Workflows
Workflow 1: Incident Triage
- Detection - Confirm the incident and scope
- Severity Assessment - Classify impact level (SEV1-4)
- Communication - Notify stakeholders
- Team Assembly - Rally required responders
- Initial Diagnosis - Identify likely cause
Workflow 2: Resolution
- Containment - Stop the bleeding
- Root Cause - Identify underlying issue
- Fix Implementation - Deploy the solution
- Verification - Confirm resolution
- Status Update - Communicate resolution
Workflow 3: Post-Mortem
- Timeline - Document what happened when
- Root Cause Analysis - 5 whys analysis
- Action Items - Identify preventive measures
- Documentation - Write post-mortem report
- Review - Share learnings with team
Quick Reference
| Action | Command |
|---|---|
| Start incident | "We have a production incident" |
| Triage | "What's the severity and impact?" |
| Post-mortem | "Create post-mortem for incident" |