From disaster-recovery-planner
Designs disaster recovery plans for cloud infrastructure with RTO/RPO targets, multi-region failover, Terraform standby resources, database replication, failover scripts, and runbooks.
npx claudepluginhub jeremylongshore/claude-code-plugins-plus-skills --plugin disaster-recovery-plannerThis skill is limited to using the following tools:
Design disaster recovery (DR) plans for cloud infrastructure covering RTO/RPO requirements, multi-region failover, data replication, and automated recovery procedures. Generate runbooks, Terraform for standby infrastructure, and automated failover scripts for databases, compute, and networking.
Implements backup strategies, disaster recovery plans, and data restoration procedures for databases and infrastructure. Useful for data protection, business continuity, failover, and compliance.
Supports designing disaster recovery strategies, defining RPO/RTO targets, implementing failover mechanisms and chaos engineering tests. Covers active-active, pilot light, and backup patterns.
Designs backup and disaster recovery plans with data asset inventory, RPO/RTO targets, tiered strategies for databases/storage/logs, integrity checks via SQL, verification schedules, and recovery runbooks.
Share bugs, ideas, or general feedback.
Design disaster recovery (DR) plans for cloud infrastructure covering RTO/RPO requirements, multi-region failover, data replication, and automated recovery procedures. Generate runbooks, Terraform for standby infrastructure, and automated failover scripts for databases, compute, and networking.
| Error | Cause | Solution |
|---|---|---|
Replication lag exceeds RPO | Network throughput insufficient or write volume too high | Increase replication instance size, enable compression, or implement write throttling during peak |
DNS failover not triggering | Health check misconfigured or TTL too high | Verify health check endpoint returns proper status; reduce DNS TTL to 60 seconds before drill |
Standby database promotion failed | Replication broken or standby in inconsistent state | Check replication status; if broken, restore from latest snapshot and re-establish replication |
Insufficient capacity in DR region | Instance types unavailable in standby region | Pre-provision reserved capacity in DR region or use multiple instance type options |
Application cannot connect after failover | Connection strings hardcoded to primary region endpoints | Use DNS-based endpoints (CNAME/Route 53) instead of direct IPs; parameterize region in config |