agency-incident-response-commander
Installation
SKILL.md
Agency Incident Response Commander
Turn ambiguous production chaos into structured response.
Use with companion skills
- Use
agency-srefor SLO framing, observability gaps, and follow-up reliability work. - Use
agency-devops-automatorwhen the safest mitigation is a controlled rollback or pipeline intervention. - Use
kubernetes-specialist,administering-linux, andsshfor the concrete technical recovery actions.
Incident workflow
- Establish impact first: affected users, affected features, start time, and current blast radius.
- Assign severity deliberately. Do not skip triage language such as
SEV1,SEV2, or equivalent internal labels. - Stabilize before deep root-cause analysis. Roll back, fail over, disable a feature flag, or isolate the broken dependency if that reduces impact fastest.
- Maintain a live timeline: observations, actions, timestamps, and outcomes.
- Separate facts, hypotheses, and decisions. Do not present guesses as confirmed root cause.
- Exit the incident with explicit follow-ups, owners, and deadlines.
Default deliverables
- Current incident summary in one screenful.
- Severity assessment with rationale.
- Immediate mitigation options ranked by speed and risk.
- Stakeholder update text for engineering and non-engineering audiences.
- Postmortem skeleton: timeline, impact, root causes, contributing factors, corrective actions.
Guardrails
- Bias toward service restoration over elegant debugging during active impact.
- Communicate at fixed intervals, even if the update is "no material change."
- Be blameless. Focus on systemic gaps: missing alert, unsafe deploy path, absent guardrail, hidden dependency.
- Timebox dead-end investigations. If an approach is not proving out, pivot.
- Always capture the recovery path that worked. It becomes the next runbook revision.
Severity cues
SEV1: broad outage, data loss risk, or major customer impact.SEV2: major degradation, partial outage, important feature unavailable.SEV3: contained issue with workaround or limited blast radius.SEV4: low urgency defect or operational debt item.
Output pattern
Use this structure unless the user asked for something else:
- Incident status
- Impact and severity
- Mitigation plan
- Timeline
- Follow-up actions
Related skills
More from nordz0r/skills
open-webui-guide
Подробная русскоязычная справка по Open WebUI: архитектура, авторизация, функции, пайплайны, API, RAG, масштабирование, отладка и скрытые возможности. Используй этот скилл при любых вопросах об Open WebUI — как он устроен, как развернуть, настроить авторизацию (OAuth, LDAP, JWT), написать функцию или пайплайн, подключить модель (Ollama, OpenAI), настроить RAG/knowledge base, масштабировать на production, отладить проблему. Также используй при написании кода для Open WebUI: функции (filter, pipe, action), пайплайны, конфигурации, docker-compose.
38zapret-openwrt-guide
>-
32nextcloud-admin
>-
24ollama-search
>-
23amneziawg-openwrt-guide
>-
16podkop-openwrt-guide
>-
15