Skill

caching-strategy

Designs cache architectures using cache-aside, write-through, write-back patterns with invalidation and CDN strategies. Trigger: "caching strategy", "cache architecture", "cache invalidation", "CDN design", "cache patterns".

From sovereign-architect

Install

Run in your terminal

npx claudepluginhub javimontano/mao-sovereign-architect

Tool Access

This skill is limited to using the following tools:

ReadGlobGrepBashAgent

Supporting Assets

View in Repository

evals/evals.json

examples/sample-output.md

prompts/use-case-prompts.md

references/body-of-knowledge.md

Skill Content

Similar Skills

cqrs-implementation

Implements CQRS patterns with Python templates for command/query separation, event-sourcing, and scalable read/write models. Use for optimizing queries or independent scaling.

backend-development

33.0k

architecture-patterns

1 file

Implements Clean Architecture, Hexagonal Architecture (ports/adapters), and Domain-Driven Design for backend services. For microservice design, monolith refactoring to bounded contexts, and dependency debugging.

backend-development

33.0k

api-design-principles

4 files

Provides REST and GraphQL API design principles including resource hierarchies, HTTP methods, versioning strategies, pagination, and filtering patterns for new APIs, reviews, or standards.

backend-development

33.0k

Stats

Stars0

Forks0

Last CommitMar 28, 2026

Actions

View Source View Plugin View on GitHub View README

Caching Strategy

Designs multi-layer caching architectures selecting appropriate patterns (aside, through, back), invalidation strategies, TTL policies, and CDN configurations to optimize latency and reduce backend load.

Guiding Principle

"There are only two hard things in computer science: cache invalidation and naming things."

Procedure

Step 1 — Profile Access Patterns

Identify the read/write ratio for each data domain (caching benefits read-heavy workloads).
Map the hot data set: which 20% of data serves 80% of requests?
Determine staleness tolerance per data type (real-time, seconds, minutes, hours).
Measure current latency and throughput bottlenecks to establish a baseline.

Step 2 — Select Cache Patterns

Cache-Aside (Lazy Loading): Application checks cache first, loads from DB on miss, writes to cache. Best for read-heavy, tolerates stale data.
Write-Through: Application writes to cache and DB simultaneously. Ensures cache is always fresh but adds write latency.
Write-Behind (Write-Back): Application writes to cache; cache asynchronously flushes to DB. Best throughput but risk of data loss.
Read-Through: Cache itself fetches from DB on miss (cache acts as the data source). Simplifies application code.
Select the pattern per data domain based on consistency requirements and staleness tolerance.

Step 3 — Design Invalidation Strategy

TTL-Based: Set time-to-live per cache entry; simple but may serve stale data up to TTL.
Event-Driven Invalidation: Publish cache-invalidation events on data changes; fresher but requires event infrastructure.
Version-Based: Include a version key; increment on writes to invalidate all related entries.
Tag-Based Invalidation: Group cache entries by tags; invalidate all entries with a given tag.
Define the cache warming strategy for cold starts and deployments.

Step 4 — Multi-Layer Architecture

L1 — In-Process Cache: Local memory (e.g., Caffeine, lru-cache) for ultra-low latency on hot keys.
L2 — Distributed Cache: Redis or Memcached for shared cache across application instances.
L3 — CDN/Edge Cache: CloudFront, Fastly, or Cloudflare for static assets and cacheable API responses.
Define cache key naming conventions to prevent collisions across services.
Document the eviction policy per layer (LRU, LFU, FIFO) and maximum memory allocation.

Quality Criteria

Every cached data type has a documented TTL with staleness-tolerance justification.
Cache hit ratio targets are defined (typically >90% for hot data).
Invalidation strategy is tested for race conditions (thundering herd, dog-pile effect).
Cache failure is handled gracefully — the system degrades to direct DB reads, not errors.

Anti-Patterns

Caching everything without profiling access patterns ("cache-all" strategy that wastes memory).
No invalidation strategy — relying solely on TTL expiry for data that requires real-time freshness.
Cache stampede: all instances simultaneously miss the cache and hammer the database.
Storing large objects in cache that exceed the eviction budget and push out hot small entries.