npx claudepluginhub mayurpise/draft --plugin draftThis skill uses the workspace's default tool permissions.
You are managing an incident through its full lifecycle using structured incident management practices.
Runs incident response workflow: triage severity and roles, draft communications, track mitigation, generate blameless postmortem from alerts or status updates.
Classifies incidents by severity (SEV1-4), constructs timelines, assesses impact, performs 5 Whys root cause analysis, and generates blameless post-mortems for production issues.
Guide incident response, root cause analysis, and post-mortem documentation. Use when: production incident, outage response, post-mortem writing, RCA. Keywords: incident, outage, post-mortem, RCA, root cause, 事故, 故障, 根因分析.
Share bugs, ideas, or general feedback.
You are managing an incident through its full lifecycle using structured incident management practices.
Communicate first. Fix second. Learn always.
ls draft/ 2>/dev/null
This skill works standalone — incidents don't wait for project setup.
core/shared/draft-context-loading.md./draft:incident-response new <description> — Start new incident/draft:incident-response update <status> — Post status update/draft:incident-response postmortem — Generate postmortem report/draft:incident-response (no args) — Interactive: ask which modeClassify severity:
| Level | Response Time | Who | Examples |
|---|---|---|---|
| SEV1 | Immediate, all-hands | Entire team | Data loss, complete outage, security breach |
| SEV2 | 15 minutes | On-call + team lead | Major feature broken, significant degradation |
| SEV3 | 1 hour | On-call | Minor feature broken, workaround exists |
| SEV4 | Next business day | Assigned engineer | Cosmetic issue, minor inconvenience |
Assess:
draft/product.md user types if available)draft/.ai-context.md service topology if available)Generate initial status update:
INCIDENT: {description}
Severity: SEV{1-4}
Impact: {who/what is affected}
Status: Investigating
Commander: {name or "unassigned"}
Next update: {time — SEV1: 15min, SEV2: 30min, SEV3: 1hr}
get_issue, get_issue_description, get_issue_comments)curl/wget to fetch dashboards or error pages mentionedssh to access remote log paths if mentionedgh CLI available: check recent deployments and merged PRs (gh pr list --state merged --search "merged:>2024-01-01")Following core/agents/ops.md production-safety mindset:
Document all actions taken with timestamps.
Save to: draft/incidents/incident-<timestamp>.md or draft/tracks/<id>/incident.md
# Incident: {description}
| Field | Value |
|-------|-------|
| **Severity** | SEV{N} |
| **Status** | {Investigating/Mitigating/Resolved} |
| **Started** | {timestamp} |
| **Commander** | {name} |
## Timeline
| Time | Action |
|------|--------|
| {time} | Incident detected |
| {time} | Triage: classified as SEV{N} |
| {time} | {mitigation action} |
## Evidence
| Source | Finding |
|--------|---------|
| {source} | {finding} |
## Status Updates
{chronological updates}
git log for related commits during incident windowgh CLI: pull PRs submitted during/after incidentReference core/agents/rca.md methodology:
5 Whys Analysis:
Root Cause Classification:
Detection Lag: When was the bug introduced vs when was it detected?
SLO Impact: Which SLOs were affected and by how much?
MANDATORY: Include YAML frontmatter with git metadata. Follow core/shared/git-report-metadata.md.
Save to: draft/incidents/postmortem-<timestamp>.md with symlink postmortem-latest.md
Or track-scoped: draft/tracks/<id>/postmortem.md
# Postmortem: {incident title}
## Summary
{2-3 sentences: what happened, impact, duration}
## Impact
- **Duration:** {start} to {end} ({total time})
- **Users affected:** {count or percentage}
- **SLO impact:** {which SLOs, by how much}
- **Data impact:** {any data loss or corruption}
## Timeline
| Time | Event |
|------|-------|
| {time} | {event} |
## Root Cause
{1-2 sentence root cause statement}
### 5 Whys
1. Why? → {answer}
2. Why? → {answer}
...
### Classification
- **Type:** {classification}
- **Detection Lag:** {introduced} → {detected} = {gap}
## What Went Well
- {positive observations}
## What Went Wrong
- {things that made the incident worse}
## Action Items
| # | Action | Owner | Deadline | Status |
|---|--------|-------|----------|--------|
| 1 | {detection improvement} | {name} | {date} | [ ] |
| 2 | {process improvement} | {name} | {date} | [ ] |
| 3 | {code improvement} | {name} | {date} | [ ] |
Follow core/shared/jira-sync.md:
⚠️ Test Writing Guardrail: If postmortem identifies missing tests, ASK: "Want me to create regression test tasks? [Y/n]"
/draft:new-track when incident keywords detected in descriptiongit bisect (find the breaking commit), /draft:learn (update guardrails)/draft:new-track for the fixIf no incident file found (update/postmortem mode): List available incidents, ask which one If no Jira ticket: Proceed without sync, note: "Link a Jira ticket for automatic sync"