observability-alerting
Observability Alerting
Overview
Use this skill to design alerting that catches real incidents quickly without overwhelming responders.
Scope Boundaries
- Use this skill when the task matches the trigger condition described in
description. - Do not use this skill when the primary task falls outside this skill's domain.
Shared References
- Alert threshold actionability rules:
references/alert-threshold-actionability-rules.md
Templates And Assets
- Alert catalog template:
assets/alert-catalog-template.csv
- Alert noise review checklist:
assets/alert-noise-review-checklist.md
Inputs To Gather
- Critical user/system failure modes.
- Available telemetry signals and quality.
- On-call routing and escalation policy.
- Historical false-positive/false-negative patterns.
Deliverables
- Alert catalog with severity, owner, and runbook linkage.
- Threshold and routing policy.
- Noise-control and tuning plan.
Workflow
- Build initial alert catalog in
assets/alert-catalog-template.csv. - Set thresholds using
references/alert-threshold-actionability-rules.md. - Define routing/escalation by severity.
- Validate with
assets/alert-noise-review-checklist.md. - Publish tuning backlog and ownership.
Quality Standard
- Alerts are actionable and owned.
- Critical paths have coverage with bounded noise.
- Paging vs non-paging intent is explicit.
Failure Conditions
- Stop when alerts are noisy, non-actionable, or ownerless.
- Stop when critical failure modes lack alert coverage.
- Escalate when alert quality risks SLO breach response.
More from kentoshimizu/sw-agent-skills
graph-algorithms
Graph algorithm workflow for modeling entities/relations and selecting traversal, path, ordering, or flow strategies. Use when correctness or performance depends on graph representation and algorithm choice; do not use for schema-only modeling or deployment topology planning.
14bash-style-guide
Style, review, and refactoring standards for Bash shell scripting. Trigger when `.sh` files, files with `#!/usr/bin/env bash` or `#!/bin/bash`, or CI workflow blocks with `shell: bash` are created, modified, or reviewed and Bash-specific quality controls (quoting safety, error handling, portability, readability) must be enforced. Do not use for generic POSIX `sh`, PowerShell, or language-specific application style rules. In multi-language pull requests, run together with other applicable `*-style-guide` skills.
11architecture-clean-architecture
Clean Architecture workflow for enforcing dependency direction, stable domain boundaries, and use-case-centered application design. Use when teams must separate business rules from frameworks and delivery mechanisms; do not use for isolated module cleanup without boundary implications.
11powershell-style-guide
Style, review, and refactoring standards for PowerShell scripting. Trigger when `.ps1`, `.psm1`, `.psd1` files, or CI workflow blocks with `shell: pwsh` or `shell: powershell` are created, modified, or reviewed and PowerShell-specific quality controls (error handling, parameter validation, readability, operational safety) must be enforced. Do not use for Bash, generic POSIX `sh`, or language-specific application style rules. In multi-language pull requests, run together with other applicable `*-style-guide` skills.
10github-codeowners-management
Govern CODEOWNERS rules so review routing reflects real ownership and risk boundaries on GitHub. Use when repository ownership mapping or mandatory reviewer rules must be defined, updated, or audited; do not use for non-GitHub runtime architecture or data-layer design.
9security-authentication
Security workflow for authentication architecture, credential lifecycle, and session/token assurance. Use when login, identity proofing, MFA, or session security decisions are required; do not use for authorization policy design or non-security quality tuning.
9