Skill

capacity-plan

Create a capacity plan — analyse current load, project growth, calculate headroom, and recommend scaling actions.

Install

npx claudepluginhub hpsgd/turtlestack --plugin performance-engineer

Tool Access

This skill is limited to using the following tools:

ReadBashGlobGrep

Preview

Create a capacity plan for $ARGUMENTS.

SKILL.md

Similar Skills

android-clean-architecture

Implements Clean Architecture in Android and Kotlin Multiplatform projects: module layouts, dependency rules, UseCases, Repositories, domain models, and data layers with Room, SQLDelight, Ktor.

everything-claude-code

157.7k

ai-regression-testing

Delivers DB-free sandbox API regression tests for Next.js/Vitest to catch AI blind spots in self-reviewed code changes like API routes and backend logic.

everything-claude-code

157.7k

ai-first-engineering

Provides process, architecture, review, hiring, and testing guidelines for engineering teams relying on AI code generation.

everything-claude-code

157.7k

Stats

Parent Repo Stars0

Parent Repo Forks0

Last CommitApr 16, 2026

Actions

View Source View Plugin View on GitHub View README

Metric	How to measure	What to capture
Requests/sec	APM, load balancer logs, application metrics	Average, peak, by endpoint
Concurrent users	Session count, WebSocket connections	Average, peak, by time of day
Data volume	Database size, storage metrics	Total size, growth rate per day/week
Bandwidth	Network metrics, CDN analytics	Inbound, outbound, by service

Metric

How to measure

What to capture

Requests/sec

APM, load balancer logs, application metrics

Average, peak, by endpoint

Concurrent users

Session count, WebSocket connections

Average, peak, by time of day

Data volume

Database size, storage metrics

Total size, growth rate per day/week

Bandwidth

Network metrics, CDN analytics

Inbound, outbound, by service

Resource	Current utilisation	Target	Headroom
CPU	[%] at normal, [%] at peak	< 70% normal, < 90% peak	[remaining %]
Memory	[GB] at normal, [GB] at peak	< 70% normal, < 85% peak	[remaining GB]
Disk I/O	[IOPS] at normal	< 60% provisioned IOPS	[remaining IOPS]
Network	[Mbps] at normal	< 50% bandwidth limit	[remaining Mbps]
Connection pools	[n] active / [n] max	< 70% pool size	[remaining connections]
Storage	[GB] used / [GB] provisioned	< 80% provisioned	[remaining GB, days until full]

Resource

Current utilisation

Target

Headroom

CPU

[%] at normal, [%] at peak

< 70% normal, < 90% peak

[remaining %]

Memory

[GB] at normal, [GB] at peak

< 70% normal, < 85% peak

[remaining GB]

Disk I/O

[IOPS] at normal

< 60% provisioned IOPS

[remaining IOPS]

Network

[Mbps] at normal

< 50% bandwidth limit

[remaining Mbps]

Connection pools

[n] active / [n] max

< 70% pool size

[remaining connections]

Storage

[GB] used / [GB] provisioned

< 80% provisioned

[remaining GB, days until full]

Growth model	When to use	How to calculate
Linear	Steady user acquisition, mature product	Current growth rate × time
Exponential	Viral growth, new market, post-launch	Compound growth rate, with a ceiling estimate
Step function	Sales-driven (enterprise customers), geographic expansion	Planned customer additions × per-customer load
Seasonal	E-commerce, education, tax/financial	Historical seasonal multiplier × base growth

Growth model

When to use

How to calculate

Linear

Steady user acquisition, mature product

Current growth rate × time

Exponential

Viral growth, new market, post-launch

Compound growth rate, with a ceiling estimate

Step function

Sales-driven (enterprise customers), geographic expansion

Planned customer additions × per-customer load

Seasonal

E-commerce, education, tax/financial

Historical seasonal multiplier × base growth

Question	Answer
At what load does the system degrade?	[requests/sec or concurrent users]
What component fails first?	[database, application, cache, network, external API]
What is the failure mode?	[errors, latency spike, timeout, crash, data corruption]
How far is current peak from the breaking point?	[multiplier — e.g., "3.2x headroom"]

Question

Answer

At what load does the system degrade?

[requests/sec or concurrent users]

What component fails first?

[database, application, cache, network, external API]

What is the failure mode?

[errors, latency spike, timeout, crash, data corruption]

How far is current peak from the breaking point?

[multiplier — e.g., "3.2x headroom"]

Component	Current peak	Breaking point	Headroom	Time to scale (at projected growth)
[component]	[metric]	[metric]	[Nx]	[months]

Component

Current peak

Breaking point

Headroom

Time to scale (at projected growth)

[component]

[metric]

[Nx]

[months]

Strategy	When to use	Lead time	Cost impact
Vertical (bigger instance)	Single bottleneck, simple architecture	Hours–days	Linear increase
Horizontal (more instances)	Stateless services, read-heavy workloads	Minutes (auto-scale) to days (manual)	Linear increase
Caching (CDN, Redis, application cache)	Read-heavy, cacheable content	Days–weeks	Reduces load on origin
Read replicas	Database read bottleneck	Days	Database cost increase
Async processing (queues, workers)	Write-heavy, batch operations	Weeks	Decouples peak from processing
Architectural (sharding, microservices, CQRS)	Fundamental scalability limit reached	Months	Significant engineering investment

Strategy

When to use

Lead time

Cost impact

Vertical (bigger instance)

Single bottleneck, simple architecture

Hours–days

Linear increase

Horizontal (more instances)

Stateless services, read-heavy workloads

Minutes (auto-scale) to days (manual)

Linear increase

Caching (CDN, Redis, application cache)

Read-heavy, cacheable content

Days–weeks

Reduces load on origin

Read replicas

Database read bottleneck

Days

Database cost increase

Async processing (queues, workers)

Write-heavy, batch operations

Weeks

Decouples peak from processing

Architectural (sharding, microservices, CQRS)

Fundamental scalability limit reached

Months

Significant engineering investment

Scaling method	Lead time	Automation
Auto-scaling (cloud)	Minutes	Fully automated — configure scaling policies
Manual horizontal scaling	Hours–days	Requires provisioning and deployment
Vertical scaling	Hours (cloud) to weeks (on-prem)	Requires downtime for some configurations
Database scaling	Days–weeks	Requires migration planning, potential downtime
Architectural changes	Sprints–quarters	Requires design, implementation, migration

Scaling method

Lead time

Automation

Auto-scaling (cloud)

Minutes

Fully automated — configure scaling policies

Manual horizontal scaling

Hours–days

Requires provisioning and deployment

Vertical scaling

Hours (cloud) to weeks (on-prem)

Requires downtime for some configurations

Database scaling

Days–weeks

Requires migration planning, potential downtime

Architectural changes

Sprints–quarters

Requires design, implementation, migration

# Capacity Plan: [system/service] ## Current State | Metric | Average | Peak | Trend | |---|---|---|---| | Requests/sec | [n] | [n] | [growing/stable/declining] | | Concurrent users | [n] | [n] | [trend] | | Data volume | [GB] | — | [growth rate/day] | ## Resource Utilisation | Resource | Normal | Peak | Headroom | Status | |---|---|---|---|---| | CPU | [%] | [%] | [%] | OK / WARNING / CRITICAL | | Memory | [GB] | [GB] | [GB] | OK / WARNING / CRITICAL | | Storage | [GB] | — | [days until full] | OK / WARNING / CRITICAL | ## Growth Projections | Horizon | Requests/sec | Users | Data volume | |---|---|---|---| | 3 months | [n] | [n] | [GB] | | 6 months | [n] | [n] | [GB] | | 12 months | [n] | [n] | [GB] | ## Bottleneck Analysis | Component | Current peak | Breaking point | Headroom | Time to scale | |---|---|---|---|---| | [component] | [metric] | [metric] | [Nx] | [months] | ## Recommendations | Priority | Action | Cost/month | Capacity gain | Lead time | |---|---|---|---|---| | 1 | [action] | [$] | [Nx] | [time] | ## Decision Timeline - **Immediate:** [actions needed now] - **30 days:** [decisions that must be made] - **90 days:** [planned scaling activities]

Metric	How to measure	What to capture
Requests/sec	APM, load balancer logs, application metrics	Average, peak, by endpoint
Concurrent users	Session count, WebSocket connections	Average, peak, by time of day
Data volume	Database size, storage metrics	Total size, growth rate per day/week
Bandwidth	Network metrics, CDN analytics	Inbound, outbound, by service

Metric

How to measure

What to capture

Requests/sec

APM, load balancer logs, application metrics

Average, peak, by endpoint

Concurrent users

Session count, WebSocket connections

Average, peak, by time of day

Data volume

Database size, storage metrics

Total size, growth rate per day/week

Bandwidth

Network metrics, CDN analytics

Inbound, outbound, by service

Resource	Current utilisation	Target	Headroom
CPU	[%] at normal, [%] at peak	< 70% normal, < 90% peak	[remaining %]
Memory	[GB] at normal, [GB] at peak	< 70% normal, < 85% peak	[remaining GB]
Disk I/O	[IOPS] at normal	< 60% provisioned IOPS	[remaining IOPS]
Network	[Mbps] at normal	< 50% bandwidth limit	[remaining Mbps]
Connection pools	[n] active / [n] max	< 70% pool size	[remaining connections]
Storage	[GB] used / [GB] provisioned	< 80% provisioned	[remaining GB, days until full]

Resource

Current utilisation

Target

Headroom

CPU

[%] at normal, [%] at peak

< 70% normal, < 90% peak

[remaining %]

Memory

[GB] at normal, [GB] at peak

< 70% normal, < 85% peak

[remaining GB]

Disk I/O

[IOPS] at normal

< 60% provisioned IOPS

[remaining IOPS]

Network

[Mbps] at normal

< 50% bandwidth limit

[remaining Mbps]

Connection pools

[n] active / [n] max

< 70% pool size

[remaining connections]

Storage

[GB] used / [GB] provisioned

< 80% provisioned

[remaining GB, days until full]

Growth model	When to use	How to calculate
Linear	Steady user acquisition, mature product	Current growth rate × time
Exponential	Viral growth, new market, post-launch	Compound growth rate, with a ceiling estimate
Step function	Sales-driven (enterprise customers), geographic expansion	Planned customer additions × per-customer load
Seasonal	E-commerce, education, tax/financial	Historical seasonal multiplier × base growth

Growth model

When to use

How to calculate

Linear

Steady user acquisition, mature product

Current growth rate × time

Exponential

Viral growth, new market, post-launch

Compound growth rate, with a ceiling estimate

Step function

Sales-driven (enterprise customers), geographic expansion

Planned customer additions × per-customer load

Seasonal

E-commerce, education, tax/financial

Historical seasonal multiplier × base growth

Question	Answer
At what load does the system degrade?	[requests/sec or concurrent users]
What component fails first?	[database, application, cache, network, external API]
What is the failure mode?	[errors, latency spike, timeout, crash, data corruption]
How far is current peak from the breaking point?	[multiplier — e.g., "3.2x headroom"]

Question

Answer

At what load does the system degrade?

[requests/sec or concurrent users]

What component fails first?

[database, application, cache, network, external API]

What is the failure mode?

[errors, latency spike, timeout, crash, data corruption]

How far is current peak from the breaking point?

[multiplier — e.g., "3.2x headroom"]

Component	Current peak	Breaking point	Headroom	Time to scale (at projected growth)
[component]	[metric]	[metric]	[Nx]	[months]

Component

Current peak

Breaking point

Headroom

Time to scale (at projected growth)

[component]

[metric]

[Nx]

[months]

Strategy	When to use	Lead time	Cost impact
Vertical (bigger instance)	Single bottleneck, simple architecture	Hours–days	Linear increase
Horizontal (more instances)	Stateless services, read-heavy workloads	Minutes (auto-scale) to days (manual)	Linear increase
Caching (CDN, Redis, application cache)	Read-heavy, cacheable content	Days–weeks	Reduces load on origin
Read replicas	Database read bottleneck	Days	Database cost increase
Async processing (queues, workers)	Write-heavy, batch operations	Weeks	Decouples peak from processing
Architectural (sharding, microservices, CQRS)	Fundamental scalability limit reached	Months	Significant engineering investment

Strategy

When to use

Lead time

Cost impact

Vertical (bigger instance)

Single bottleneck, simple architecture

Hours–days

Linear increase

Horizontal (more instances)

Stateless services, read-heavy workloads

Minutes (auto-scale) to days (manual)

Linear increase

Caching (CDN, Redis, application cache)

Read-heavy, cacheable content

Days–weeks

Reduces load on origin

Read replicas

Database read bottleneck

Days

Database cost increase

Async processing (queues, workers)

Write-heavy, batch operations

Weeks

Decouples peak from processing

Architectural (sharding, microservices, CQRS)

Fundamental scalability limit reached

Months

Significant engineering investment

Scaling method	Lead time	Automation
Auto-scaling (cloud)	Minutes	Fully automated — configure scaling policies
Manual horizontal scaling	Hours–days	Requires provisioning and deployment
Vertical scaling	Hours (cloud) to weeks (on-prem)	Requires downtime for some configurations
Database scaling	Days–weeks	Requires migration planning, potential downtime
Architectural changes	Sprints–quarters	Requires design, implementation, migration

Scaling method

Lead time

Automation

Auto-scaling (cloud)

Minutes

Fully automated — configure scaling policies

Manual horizontal scaling

Hours–days

Requires provisioning and deployment

Vertical scaling

Hours (cloud) to weeks (on-prem)

Requires downtime for some configurations

Database scaling

Days–weeks

Requires migration planning, potential downtime

Architectural changes

Sprints–quarters

Requires design, implementation, migration

capacity-plan

Install

Tool Access

Preview

SKILL.md

Similar Skills

capacity-plan

Install

Tool Access

Preview

SKILL.md

Process (sequential — do not skip steps)

Step 1: Current Load Profile

Step 2: Resource Utilisation Baseline

Step 3: Growth Projection

Step 4: Breaking Point Identification

Step 5: Headroom Calculation

Step 6: Scaling Options

Step 7: Lead Time Assessment

Step 8: Recommendation

Anti-Patterns (NEVER do these)

Output Format

Related Skills

Similar Skills

Process (sequential — do not skip steps)

Step 1: Current Load Profile

Step 2: Resource Utilisation Baseline

Step 3: Growth Projection

Step 4: Breaking Point Identification

Step 5: Headroom Calculation

Step 6: Scaling Options

Step 7: Lead Time Assessment

Step 8: Recommendation

Anti-Patterns (NEVER do these)

Output Format

Related Skills