valkey-ops

Installation
SKILL.md

Valkey Operations Reference

Routing

  • Install, build from source, package manager, Docker, Compose, systemd, bare metal, multi-instance -> Deployment
  • Config tuning, maxmemory, eviction, encoding thresholds, lazyfree, logging, CPU pinning, workload presets, pubsub buffers -> Configuration
  • High availability, Sentinel, failover detection, quorum, split-brain, min-replicas -> Sentinel
  • Cluster setup, hash slots, resharding, node add/remove, atomic migration, replica migration, consistency -> Cluster
  • Persistence, RDB, AOF, hybrid, fsync, BGSAVE, backup, restore, disaster recovery -> Persistence
  • Replication, primary-replica, REPLICAOF, backlog, diskless sync, dual-channel, replication lag -> Replication
  • ACL, TLS, certificates, mutual TLS, protected mode, rename-command, hardening, network security -> Security
  • Monitoring, INFO, metrics, Prometheus, Grafana, alerting, commandlog, slow log -> Monitoring
  • Performance, I/O threads, memory fragmentation, defragmentation, latency, durability, client-side caching, CLIENT TRACKING, benchmarking -> Performance
  • OOM, out of memory, crashes, slow commands, replication lag diagnosis, cluster partitions, network splits, diagnostics -> Troubleshooting
  • Version upgrades, compatibility, Redis to Valkey migration, rolling upgrade -> Upgrades
  • Kubernetes, Helm, operators, StatefulSet, PVC, probes, resource sizing, kernel tuning -> Kubernetes
  • Capacity planning, memory sizing, connection planning, cluster sizing -> Operations
  • Pre-launch check, production readiness, go-live checklist -> Production Checklist

Deployment

Topic Reference
Package managers, building from source, build flags install
Docker images (official, Bitnami), cluster example docker-images
Compose patterns, volumes, networking, config injection docker-patterns
systemd service, kernel tuning, multi-instance bare-metal

Configuration

Topic Reference
Essential parameters with verified defaults essentials
Eviction policies, LRU/LFU tuning eviction
Memory encoding thresholds per data type encoding
Config presets by workload (cache, store, session, queue, rate limiter) workload-presets
Lazy free config (UNLINK, async eviction/expiry) lazyfree
Logging, OOM score, shutdown, CPU pinning, unix sockets, active expiration, protocol limits advanced
Pub/Sub buffer limits, keyspace notifications, sharded pub/sub, subscriber memory pubsub

Sentinel (High Availability)

Topic Reference
How Sentinel works, failure detection, election architecture
Step-by-step deployment, config directives sentinel-deployment
Tuning, cross-DC, Docker/NAT, coordinated failover, systemd sentinel-advanced
Split-brain prevention, min-replicas settings split-brain

Cluster

Topic Reference
Network requirements, config, cluster creation, hash slots setup
Resharding, adding/removing nodes, atomic migration (9.0) resharding
Manual failover, health checks, replica migration, scaling, rolling restart runbook operations
Consistency guarantees, write safety, partition behavior consistency

Persistence

Topic Reference
RDB configuration, save directives, BGSAVE rdb
AOF configuration, fsync policies, hybrid mode aof
Automated backup scripts, off-site backup, retention backup-strategies
Disaster recovery, FLUSHALL recovery, verification disaster-recovery

Replication

Topic Reference
Primary-replica setup, REPLICAOF, sync mechanisms setup
Backlog sizing, diskless sync, dual-channel, Docker/NAT tuning
min-replicas safety, critical warnings, data loss prevention safety

Security

Topic Reference
ACL users, roles, categories, practical examples acl
TLS setup, certificates, mutual TLS, replication encryption, cluster bus encryption tls
Defense in depth, protected mode, network hardening hardening
Disabling dangerous commands via rename-command and ACL rename-commands

Monitoring

Topic Reference
INFO sections, critical metrics, diagnostic commands metrics
Prometheus exporter setup, scrape configs prometheus
Grafana dashboards, panel definitions, PromQL queries grafana
Alert rules YAML (complete rule set) alerting-rules
Threshold tuning, recording rules, Alertmanager routing alerting-config
Commandlog (slow/large request/reply tracking) commandlog

Performance

Topic Reference
I/O threads config, when to enable, thread count io-threads
maxmemory, eviction, encoding, fragmentation memory
Latency diagnosis workflow, LATENCY DOCTOR, watchdog latency
Durability vs performance spectrum, TCP tuning, client connection tuning, kernel tuning, 9.0 features durability
Active defragmentation config and monitoring defragmentation
Client-side caching (CLIENT TRACKING) client-caching
valkey-benchmark, valkey-perf-benchmark, best practices benchmarking

Troubleshooting

Topic Reference
Out of memory: symptoms, diagnosis, resolution oom
Replication lag: diagnosis, backlog, buffer tuning replication-lag
Slow commands: commandlog, common culprits, fixes slow-commands
Cluster partitions: network splits, recovery cluster-partitions
7-phase runbook, fork latency, memory testing diagnostics-runbook
Diagnostic commands, incident patterns, health script diagnostics-commands

Upgrades

Topic Reference
Version compatibility, RDB versions, feature matrix compatibility
Redis to Valkey migration, 3 methods migration
Rolling upgrades for Sentinel and Cluster rolling-upgrade

Kubernetes

Topic Reference
Official and Bitnami Helm charts, key values helm
Official and Hyperspike operators, CRD examples operators-overview
SAP operator, day-2 operations, choosing operators operators-day2
StatefulSet PVCs, probes, resource sizing, PDB statefulset-config
Complete StatefulSet manifest, common gotchas statefulset-example
Kernel tuning, Docker/NAT, monitoring sidecars tuning-k8s

Operations

Topic Reference
Memory sizing, connection planning, cluster sizing capacity-planning

Production Checklist

Topic Reference
Pre-launch checklist: system, config, security, monitoring, backup, HA production-checklist
Related skills

More from avifenesh/valkey-skills

Installs
5
GitHub Stars
2
First Seen
Mar 30, 2026