incident-response

Installation
SKILL.md

Incident Response

Handle production incidents systematically.

Severity Levels

Level Definition Response Time Examples
SEV1 Service down, all users affected Immediate Database crash, DNS failure, auth broken
SEV2 Major feature broken, many users affected < 30 min Payments failing, search not working
SEV3 Minor feature broken, workaround exists < 4 hours Export button broken, slow dashboard
SEV4 Cosmetic or low-impact issue Next business day Typo in UI, minor styling bug

Incident Workflow

1. Detect & Triage (first 5 minutes)

  • Acknowledge the incident — "I'm looking into this"
  • Determine severity level
Related skills
Installs
21
GitHub Stars
271
First Seen
Apr 12, 2026