valkey-ops
Valkey Operations Reference
Routing
- Install, build from source, package manager, Docker, Compose, systemd, bare metal, multi-instance -> Deployment
- Config tuning, maxmemory, eviction, encoding thresholds, lazyfree, logging, CPU pinning, workload presets, pubsub buffers -> Configuration
- High availability, Sentinel, failover detection, quorum, split-brain, min-replicas -> Sentinel
- Cluster setup, hash slots, resharding, node add/remove, atomic migration, replica migration, consistency -> Cluster
- Persistence, RDB, AOF, hybrid, fsync, BGSAVE, backup, restore, disaster recovery -> Persistence
- Replication, primary-replica, REPLICAOF, backlog, diskless sync, dual-channel, replication lag -> Replication
- ACL, TLS, certificates, mutual TLS, protected mode, rename-command, hardening, network security -> Security
- Monitoring, INFO, metrics, Prometheus, Grafana, alerting, commandlog, slow log -> Monitoring
- Performance, I/O threads, memory fragmentation, defragmentation, latency, durability, client-side caching, CLIENT TRACKING, benchmarking -> Performance
- OOM, out of memory, crashes, slow commands, replication lag diagnosis, cluster partitions, network splits, diagnostics -> Troubleshooting
- Version upgrades, compatibility, Redis to Valkey migration, rolling upgrade -> Upgrades
- Kubernetes, Helm, operators, StatefulSet, PVC, probes, resource sizing, kernel tuning -> Kubernetes
- Capacity planning, memory sizing, connection planning, cluster sizing -> Operations
- Pre-launch check, production readiness, go-live checklist -> Production Checklist
Deployment
| Topic | Reference |
|---|---|
| Package managers, building from source, build flags | install |
| Docker images (official, Bitnami), cluster example | docker-images |
| Compose patterns, volumes, networking, config injection | docker-patterns |
| systemd service, kernel tuning, multi-instance | bare-metal |
Configuration
| Topic | Reference |
|---|---|
| Essential parameters with verified defaults | essentials |
| Eviction policies, LRU/LFU tuning | eviction |
| Memory encoding thresholds per data type | encoding |
| Config presets by workload (cache, store, session, queue, rate limiter) | workload-presets |
| Lazy free config (UNLINK, async eviction/expiry) | lazyfree |
| Logging, OOM score, shutdown, CPU pinning, unix sockets, active expiration, protocol limits | advanced |
| Pub/Sub buffer limits, keyspace notifications, sharded pub/sub, subscriber memory | pubsub |
Sentinel (High Availability)
| Topic | Reference |
|---|---|
| How Sentinel works, failure detection, election | architecture |
| Step-by-step deployment, config directives | sentinel-deployment |
| Tuning, cross-DC, Docker/NAT, coordinated failover, systemd | sentinel-advanced |
| Split-brain prevention, min-replicas settings | split-brain |
Cluster
| Topic | Reference |
|---|---|
| Network requirements, config, cluster creation, hash slots | setup |
| Resharding, adding/removing nodes, atomic migration (9.0) | resharding |
| Manual failover, health checks, replica migration, scaling, rolling restart runbook | operations |
| Consistency guarantees, write safety, partition behavior | consistency |
Persistence
| Topic | Reference |
|---|---|
| RDB configuration, save directives, BGSAVE | rdb |
| AOF configuration, fsync policies, hybrid mode | aof |
| Automated backup scripts, off-site backup, retention | backup-strategies |
| Disaster recovery, FLUSHALL recovery, verification | disaster-recovery |
Replication
| Topic | Reference |
|---|---|
| Primary-replica setup, REPLICAOF, sync mechanisms | setup |
| Backlog sizing, diskless sync, dual-channel, Docker/NAT | tuning |
| min-replicas safety, critical warnings, data loss prevention | safety |
Security
| Topic | Reference |
|---|---|
| ACL users, roles, categories, practical examples | acl |
| TLS setup, certificates, mutual TLS, replication encryption, cluster bus encryption | tls |
| Defense in depth, protected mode, network hardening | hardening |
| Disabling dangerous commands via rename-command and ACL | rename-commands |
Monitoring
| Topic | Reference |
|---|---|
| INFO sections, critical metrics, diagnostic commands | metrics |
| Prometheus exporter setup, scrape configs | prometheus |
| Grafana dashboards, panel definitions, PromQL queries | grafana |
| Alert rules YAML (complete rule set) | alerting-rules |
| Threshold tuning, recording rules, Alertmanager routing | alerting-config |
| Commandlog (slow/large request/reply tracking) | commandlog |
Performance
| Topic | Reference |
|---|---|
| I/O threads config, when to enable, thread count | io-threads |
| maxmemory, eviction, encoding, fragmentation | memory |
| Latency diagnosis workflow, LATENCY DOCTOR, watchdog | latency |
| Durability vs performance spectrum, TCP tuning, client connection tuning, kernel tuning, 9.0 features | durability |
| Active defragmentation config and monitoring | defragmentation |
| Client-side caching (CLIENT TRACKING) | client-caching |
| valkey-benchmark, valkey-perf-benchmark, best practices | benchmarking |
Troubleshooting
| Topic | Reference |
|---|---|
| Out of memory: symptoms, diagnosis, resolution | oom |
| Replication lag: diagnosis, backlog, buffer tuning | replication-lag |
| Slow commands: commandlog, common culprits, fixes | slow-commands |
| Cluster partitions: network splits, recovery | cluster-partitions |
| 7-phase runbook, fork latency, memory testing | diagnostics-runbook |
| Diagnostic commands, incident patterns, health script | diagnostics-commands |
Upgrades
| Topic | Reference |
|---|---|
| Version compatibility, RDB versions, feature matrix | compatibility |
| Redis to Valkey migration, 3 methods | migration |
| Rolling upgrades for Sentinel and Cluster | rolling-upgrade |
Kubernetes
| Topic | Reference |
|---|---|
| Official and Bitnami Helm charts, key values | helm |
| Official and Hyperspike operators, CRD examples | operators-overview |
| SAP operator, day-2 operations, choosing operators | operators-day2 |
| StatefulSet PVCs, probes, resource sizing, PDB | statefulset-config |
| Complete StatefulSet manifest, common gotchas | statefulset-example |
| Kernel tuning, Docker/NAT, monitoring sidecars | tuning-k8s |
Operations
| Topic | Reference |
|---|---|
| Memory sizing, connection planning, cluster sizing | capacity-planning |
Production Checklist
| Topic | Reference |
|---|---|
| Pre-launch checklist: system, config, security, monitoring, backup, HA | production-checklist |
More from avifenesh/valkey-skills
valkey
Use when building apps with Valkey - caching, sessions, queues, locks, rate-limiting, leaderboards, counters, pub-sub, streams, scripting. Covers IFEQ/DELIFEQ, hash field TTL, COMMANDLOG. Not for server internals (valkey-dev) or ops (valkey-ops).
5valkey-dev
Use when contributing to the Valkey server - C internals, event loop, commands, data structures, cluster, replication, RDB/AOF, memory, threading, modules, Lua, RESP, tests. Not for app development (valkey) or ops (valkey-ops).
5valkey-ecosystem
Use when evaluating the Valkey ecosystem - client libraries, modules (JSON, Bloom, Search), managed services (AWS, GCP, Aiven), monitoring tools, frameworks (Spring, Django, Rails), Docker/Kubernetes deployment, CI/CD patterns, migration from Redis, and developer tooling.
5glide-mq
Use when building message queues with glide-mq. Covers queue setup, producer/consumer patterns, job scheduling, workflows, batch processing, streaming, and suspend/resume. Not for migrating from BullMQ (migrate-bullmq) or Bee-Queue (migrate-bee).
4valkey-glide
Router for Valkey GLIDE per-language skills. Use when you need to find the right language-specific GLIDE skill or migration skill. Not for GLIDE library internals or contributing to GLIDE source code - use glide-dev instead.
4migrate-go-redis
Use when migrating Go from go-redis to Valkey GLIDE. Covers Result[T] nil handling, CGO dependency, PubSub, SetWithOptions, Alpine/MUSL gotchas. Not for greenfield Go apps - use valkey-glide-go instead.
1