hightriad
HighTriad
Build professional, production-grade system designs that balance high concurrency, high performance, and high availability.
Core Workflow
-
Clarify requirements Collect workload shape, critical paths, and failure tolerance. Ask for absolute targets: RPS/QPS, p95/p99 latency, peak traffic, growth rate, error budget, RTO/RPO, data consistency needs.
-
Define SLIs/SLOs Choose 3 to 5 primary SLIs and map them to explicit SLOs. Prefer latency percentiles, availability, throughput, and freshness over averages.
-
Model the system Sketch request flow and identify bottlenecks across compute, network, storage, and dependency call chains. Enumerate concurrency boundaries: queues, pools, locks, partitions, and external rate limits.
-
Design for scale Select scaling axis: horizontal, vertical, data partitioning, or event-driven async. Define partitioning keys, load balancing strategy, and caching boundaries.
-
Design for performance Minimize critical path length, reduce tail latency, and cut remote calls. Choose data access patterns, indexing, caching tiers, and compression tradeoffs.
-
Design for availability Add redundancy, fault isolation, and graceful degradation. Define failover paths, health checks, circuit breakers, and data durability strategy.
-
Validate with tests Create load, stress, soak, and chaos test plans aligned to SLOs. Plan rollback and mitigation steps for regression risk.
-
Operationalize Define observability, alerting, runbooks, and capacity review cadence. Prepare incident response playbooks and on-call readiness.
Reference Map
- Read
references/tech-kubernetes.mdwhen the system runs on Kubernetes or needs autoscaling, multi-zone placement, or service mesh guidance. - Read
references/tech-redis.mdwhen using Redis for caching, rate limiting, queues, or session storage. - Read
references/tech-postgresql.mdwhen PostgreSQL is the primary datastore or when designing replicas, partitioning, and indexing. - Read
references/tech-kafka.mdwhen using Kafka for event streaming, async pipelines, or decoupling services. - Read
references/tech-nginx.mdwhen edge routing, TLS termination, or L7 load balancing is required. - Read
references/industry-finance.mdfor trading, payments, or regulated workloads. - Read
references/industry-ecommerce.mdfor flash sales, promotions, and cart/checkout workloads. - Read
references/industry-iot.mdfor device fleets, bursty telemetry, or edge connectivity constraints. - Read
references/industry-realtime.mdfor chat, gaming, or real-time collaboration systems. - Read
references/templates.mdwhen the user needs architecture, SLO, or capacity plan templates. - Read
references/testing-drills.mdwhen load testing, chaos testing, or DR drills are requested.
Concurrency Design Checklist
- Define concurrency target by peak RPS and concurrent users.
- Bound resource usage with worker pools, queues, and backpressure.
- Partition workload by tenant, shard key, or request type.
- Use async I/O for network and storage operations.
- Limit shared-state contention with sharding or lock-free structures.
- Apply rate limiting at edge and internal dependencies.
- Protect downstream services with bulkheads and timeouts.
Performance Design Checklist
- Reduce critical path by collapsing or parallelizing remote calls.
- Minimize p99 latency contributors: cold starts, GC pauses, locks, slow queries.
- Add caching with explicit invalidation rules.
- Use read replicas or materialized views for read-heavy workloads.
- Choose data formats and compression based on CPU vs bandwidth tradeoff.
- Optimize queries with indexes and selective projections.
- Warm pools and caches for predictable latency.
Availability Design Checklist
- Eliminate single points of failure with redundancy across zones.
- Use health checks and automated failover.
- Separate control plane and data plane failure domains.
- Support graceful degradation for non-critical features.
- Define RTO/RPO per subsystem and validate with DR drills.
- Ensure idempotency for retries and at-least-once delivery.
- Protect data with backups, versioning, and restore verification.
Validation Plan
- Run load tests to p95/p99 targets at expected peak.
- Run stress tests beyond peak to validate backpressure behavior.
- Run soak tests to surface memory leaks and queue buildup.
- Run chaos tests on dependencies and network partitions.
- Validate auto-scaling and failover timing against RTO.
Deliverables
- Architecture diagram with data flow and failure domains.
- SLI/SLO document with error budgets and alert thresholds.
- Capacity plan with scaling triggers and cost projections.
- Risk register with mitigations and rollback plans.
- Test plan covering load, stress, soak, and chaos.
- Operational runbook with on-call actions and dashboards.
Red Flags
- SLOs not defined or only averages tracked.
- Unbounded queues or unlimited fiber/thread spawning.
- Single shared database without partitioning plan at scale.
- No clear rollback or mitigation plan for deploys.
- No chaos testing or failover verification.
Output Template
Provide a concise plan with headings in this order:
- Targets (SLIs/SLOs, RTO/RPO)
- Workload model (traffic shape, hotspots, dependencies)
- Architecture (flow, scaling axis, partitions)
- Performance (critical path, caching, data access)
- Availability (redundancy, failover, degradation)
- Validation (tests and success criteria)
- Ops (observability, runbooks, incident response)
More from jwcodewrote/agent_skills_plugin
meihua-yishu
A comprehensive traditional Chinese metaphysics agent ("Yi Jing" expert) that combines Mei Hua Yi Shu (Timing) with Gemini AI for modern interpretation.
77ai-search-browser-use
Use this skill when a task needs AI-assisted web research via a real browser. Uses Chrome CDP (Chrome DevTools Protocol) as the primary automation method, with browser-use as fallback. Supports Gemini + Qwen queries with consolidated answers and citations.
11li-er
A persona-based skill for simulating an "Ideological Inquisitor" - an extremist agent who prioritizes ideology over facts. Used for roleplay, debate simulation, and critical thinking training (detecting manipulation).
9shaoji-style-writer
A specialized scriptwriting persona (Shaoji Style) that packages deep existential tragedy within a "wholesome, positive" facade. Expert in constructing emotional traps, tragic aesthetics, and philosophical dialogues about "human resilience.
7daipai-execution-master(带派)
A high-energy, combat-style execution agent ("Yu Jie" style) that prioritizes speed, efficiency, and aggressive positivity. Best for rapid prototyping, clearing backlogs, and boosting morale.
6dialectical-reasoning(唯物辩证法)
A cognitive operating system based on Dialectical Materialism (Materialism, Universal Connection, Eternal Development, Unity of Opposites) to force objective, non-linear analysis of complex engineering and logic problems.
5