Redis Clustering Agent

Overview

Production-grade agent for Redis high availability and horizontal scaling. Master replication for redundancy, Sentinel for automatic failover, and Redis Cluster for distributed data.

Architecture Selection Guide

┌─────────────────────────────────────────────────────────────────────┐
│                  HA ARCHITECTURE DECISION TREE                      │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│  What's your primary need?                                          │
│  │                                                                  │
│  ├── Data fits in single node?                                      │
│  │   │                                                              │
│  │   ├── Need automatic failover? ──> SENTINEL (3+ nodes)           │
│  │   │                                                              │
│  │   └── Manual failover OK? ───────> REPLICATION                   │
│  │                                                                  │
│  └── Data exceeds single node capacity?                             │
│      │                                                              │
│      └── Need horizontal scaling? ──> CLUSTER (6+ nodes)            │
│                                                                     │
│  ┌─────────────────────────────────────────────────────────────┐    │
│  │  QUICK REFERENCE:                                           │    │
│  │  • <100GB, <100K ops/s → Sentinel                           │    │
│  │  • >100GB or >100K ops/s → Cluster                          │    │
│  └─────────────────────────────────────────────────────────────┘    │
└─────────────────────────────────────────────────────────────────────┘

Feature Comparison

Feature	Replication	Sentinel	Cluster
Auto Failover	❌	✅	✅
Read Scaling	✅	✅	✅
Write Scaling	❌	❌	✅
Data Sharding	❌	❌	✅
Min Nodes (Prod)	2	5 (3 Sentinel + 2 Redis)	6
Complexity	Low	Medium	High
Client Support	All	Most	Cluster-aware

1. Replication

Basic Setup

# On replica
redis-cli REPLICAOF master-host 6379

# Check status
redis-cli INFO replication

Configuration

# Master (redis.conf)
bind 0.0.0.0
protected-mode yes
requirepass "master-password"
masterauth "master-password"  # For chained replication

# Replica (redis.conf)
replicaof master-host 6379
masterauth "master-password"
replica-read-only yes
replica-serve-stale-data yes

Replication Flow

┌─────────────────────────────────────────────────────────────────────┐
│                    REPLICATION ARCHITECTURE                         │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│                    ┌─────────────┐                                  │
│     Writes ───────>│   MASTER    │<───────── Reads                  │
│                    └──────┬──────┘           (optional)             │
│                           │                                         │
│            ┌──────────────┼──────────────┐                          │
│            │              │              │                          │
│            ▼              ▼              ▼                          │
│     ┌──────────┐   ┌──────────┐   ┌──────────┐                      │
│     │ REPLICA 1│   │ REPLICA 2│   │ REPLICA 3│                      │
│     │ (reads)  │   │ (reads)  │   │ (reads)  │                      │
│     └──────────┘   └──────────┘   └──────────┘                      │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

2. Sentinel (Automatic Failover)

Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                    SENTINEL ARCHITECTURE                            │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│     ┌────────────┐   ┌────────────┐   ┌────────────┐                │
│     │ Sentinel 1 │   │ Sentinel 2 │   │ Sentinel 3 │                │
│     └─────┬──────┘   └──────┬─────┘   └──────┬─────┘                │
│           │                 │                 │                     │
│           └────────────┬────┴─────────────────┘                     │
│                        │ Monitor + Failover                         │
│                        ▼                                            │
│                 ┌─────────────┐                                     │
│                 │   MASTER    │                                     │
│                 └──────┬──────┘                                     │
│                        │                                            │
│            ┌───────────┴───────────┐                                │
│            ▼                       ▼                                │
│     ┌──────────┐            ┌──────────┐                            │
│     │ REPLICA 1│            │ REPLICA 2│                            │
│     └──────────┘            └──────────┘                            │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

Sentinel Configuration

# sentinel.conf
port 26379
sentinel monitor mymaster 192.168.1.10 6379 2
sentinel auth-pass mymaster master-password
sentinel down-after-milliseconds mymaster 5000
sentinel parallel-syncs mymaster 1
sentinel failover-timeout mymaster 60000

# Notification
sentinel notification-script mymaster /opt/notify.sh
sentinel client-reconfig-script mymaster /opt/reconfig.sh

Sentinel Commands

# Check master
redis-cli -p 26379 SENTINEL master mymaster

# Get current master address
redis-cli -p 26379 SENTINEL get-master-addr-by-name mymaster

# Force failover
redis-cli -p 26379 SENTINEL failover mymaster

# Check sentinels
redis-cli -p 26379 SENTINEL sentinels mymaster

# Check replicas
redis-cli -p 26379 SENTINEL replicas mymaster

3. Redis Cluster

Cluster Creation

# Create 6-node cluster (3 masters + 3 replicas)
redis-cli --cluster create \
  192.168.1.1:6379 192.168.1.2:6379 192.168.1.3:6379 \
  192.168.1.4:6379 192.168.1.5:6379 192.168.1.6:6379 \
  --cluster-replicas 1

Cluster Node Configuration

# redis.conf for cluster node
port 6379
cluster-enabled yes
cluster-config-file nodes.conf
cluster-node-timeout 5000
cluster-replica-validity-factor 10
cluster-require-full-coverage yes
appendonly yes

Hash Slot Distribution

┌─────────────────────────────────────────────────────────────────────┐
│                    CLUSTER HASH SLOTS                               │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│  Total slots: 16384                                                 │
│                                                                     │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐               │
│  │   Master 1   │  │   Master 2   │  │   Master 3   │               │
│  │ Slots 0-5460 │  │ Slots 5461-  │  │ Slots 10923- │               │
│  │              │  │    10922     │  │    16383     │               │
│  └──────┬───────┘  └──────┬───────┘  └──────┬───────┘               │
│         │                 │                 │                       │
│         ▼                 ▼                 ▼                       │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐               │
│  │  Replica 1   │  │  Replica 2   │  │  Replica 3   │               │
│  └──────────────┘  └──────────────┘  └──────────────┘               │
│                                                                     │
│  Key "user:123" → CRC16("user:123") % 16384 → Slot 5649 → Master 2  │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

Cluster Operations

# Check cluster status
redis-cli -c -h 192.168.1.1 CLUSTER INFO

# View nodes
redis-cli -c -h 192.168.1.1 CLUSTER NODES

# Add node
redis-cli --cluster add-node new-node:6379 existing-node:6379

# Add as replica
redis-cli --cluster add-node new-node:6379 existing-node:6379 \
  --cluster-slave --cluster-master-id <master-node-id>

# Reshard
redis-cli --cluster reshard existing-node:6379

# Remove node (empty slots first!)
redis-cli --cluster del-node host:6379 <node-id>

# Rebalance
redis-cli --cluster rebalance existing-node:6379

Multi-Key Operations (Hash Tags)

# Same slot guaranteed with hash tags
SET {user:123}:profile "..."
SET {user:123}:sessions "..."
MGET {user:123}:profile {user:123}:sessions  # Works!

# Without hash tags - may fail
MGET user:123:profile user:456:sessions  # CROSSSLOT error possible

Related Skills

redis-replication - Replication deep dive (PRIMARY_BOND)
redis-cluster - Cluster configuration (PRIMARY_BOND)

Troubleshooting Guide

Common Issues & Solutions

1. CLUSTERDOWN

CLUSTERDOWN The cluster is down

Diagnosis:

redis-cli -c CLUSTER INFO | grep cluster_state
redis-cli -c CLUSTER NODES | grep -v "connected"

Causes & Fixes:

Cause	Fix
Node down	Restart node or failover
Network partition	Fix network, check firewall
No quorum	Ensure majority of masters up
Slots uncovered	`cluster-require-full-coverage no` or fix

# Force failover of a master
redis-cli -c -h replica-host CLUSTER FAILOVER

2. MOVED/ASK Redirects

(error) MOVED 5649 192.168.1.2:6379

Cause: Client sent command to wrong node

Fix:

Use cluster-aware client
Or follow redirect: connect to indicated node

# CLI auto-follows with -c flag
redis-cli -c -h any-node SET key value

3. Replication Lag

# Check lag
redis-cli INFO replication | grep lag