eng-observability
Observability and Debugging Discipline
Intent
- Make it trivial to answer “what is happening” and “why” without attaching a debugger in production.
- Ensure logs, metrics, events, and traces capture user intent, environment, and failure context while protecting sensitive data.
Guiding Principles
- Prefer structured logs + correlation IDs over ad-hoc strings.
- Emit signals at every boundary (client, API, worker, contract invocation).
- Include context (user/session/network/chain) necessary to reproduce issues.
- Keep signal cost reasonable—throttle chatty paths, sample intelligently.
- Build fast local debugging loops (trace replay, state inspectors, dev wallets).
Workflow
- Identify critical paths affected and define success/error signals per path.
- Add/extend tracing spans or log blocks with consistent field names.
- Validate observability locally by simulating successes, errors, and timeouts; ensure signals reach the sink (console, APM, analytics, chain explorer).
- Document dashboards, queries, or CLI commands useful for post-deploy verification.
- For on-chain logic, emit events with canonical schema so downstream indexers can consume them.
Verification
- Run the code with verbose logging/tracing enabled; inspect outputs for clarity and privacy.
- Confirm metrics/counters appear where expected (APM, telemetry pipeline, analytics, chain explorer).
- Dry-run incident response: can you locate a test failure or simulated outage using only emitted signals? If not, iterate.
More from tjboudreaux/cc-plugin-engineering-excellence
eng-tdd
Enforces disciplined RED-GREEN-REFACTOR cycle—write failing test first, watch it fail, write minimal code to pass, then refactor.
2eng-verification
Mandatory checklist before claiming work is complete—run the evidence-producing command, inspect output, and only then state a result.
2eng-performance
Guard latency, memory, battery, bandwidth, and gas/compute budgets by measuring before and after every change.
2eng-user-impact
Anchor every engineering decision in user value, measurable outcomes, accessibility, and cross-platform experience coherence.
2meta-superpowers
Startup protocol for every task—discover applicable skills, run them, announce usage, and follow required workflows (brainstorming, TodoWrite checklists, etc.).
2eng-quality-handoff
Deliver changes with clear communication, reviewer-ready context, and follow-up accountability.
2