From nickcrew-claude-ctx-plugin
Guides incident response workflows from detection and triage through containment, resolution, and postmortem, with severity frameworks and resilience patterns like circuit breakers.
npx claudepluginhub nickcrew/claude-cortexThis skill uses the workspace's default tool permissions.
Structured incident management from detection through postmortem, with resilience patterns for preventing and containing cascading failures.
Triages production incidents by severity (P1-P3), contains blast radius via rollback or strategies, root-causes issues, documents timeline, generates postmortems. Triggers on outages, errors, or 'incident' mentions.
Orchestrates multi-agent SRE incident response for outages, security breaches, and degradations: triage, observability sweeps, mitigation, and blameless postmortems.
Guides incident management with lifecycle stages, severity levels, roles, metrics like MTTR. Use for runbooks, on-call rotations, postmortems.
Share bugs, ideas, or general feedback.
Structured incident management from detection through postmortem, with resilience patterns for preventing and containing cascading failures.
Avoid when:
| Topic | Load reference |
|---|---|
| Triage Framework | skills/incident-response/references/triage-framework.md |
| Postmortem Patterns | skills/incident-response/references/postmortem-patterns.md |
| Level | Impact | Response Time | Examples |
|---|---|---|---|
| P0 | Complete outage, data loss, security breach | Immediate (< 5 min) | Service down, data corruption, credential leak |
| P1 | Major feature broken, significant user impact | < 30 min | Payment processing failed, auth broken for region |
| P2 | Degraded performance, partial feature loss | < 4 hours | Elevated latency, non-critical feature unavailable |
| P3 | Minor issue, workaround available | Next business day | UI glitch, slow report generation, cosmetic error |