From supervibe
Use WHEN production is broken (outage, data issue, security event) to triage, mitigate, root-cause, and write postmortem with timeline and action items. Triggers: 'incident', 'постмортем', 'упал прод', 'разбор инцидента'.
npx claudepluginhub vtrka/supervibe --plugin supervibeThis skill is limited to using the following tools:
WHEN production is broken: outage, data corruption, security event, user-facing degradation, on-call page. Or simulating drill / tabletop.
Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.
Share bugs, ideas, or general feedback.
WHEN production is broken: outage, data corruption, security event, user-facing degradation, on-call page. Or simulating drill / tabletop.
This skill prioritizes mitigation > investigation > postmortem (in that order, time-pressured).
Follow docs/references/skill-expert-operating-standard.md: start from source of truth, preserve retrieval evidence, apply scope safety, use real producers with runtime receipts for durable delegated outputs, verify before completion claims, and keep confidence below gate when evidence is partial.
git log --since='2 hours ago')runbooks/ for the affected serviceSeverity?
├─ SEV1 (everything down, data loss imminent) → mitigate FIRST, investigate after
├─ SEV2 (partial outage, customer-facing) → mitigate within 30 min, investigate in parallel
├─ SEV3 (degradation, minor) → investigate first, mitigate as RCA found
└─ SEV4 (internal-only) → schedule for normal hours
Mitigation options (SEV1/2):
├─ Roll back recent deploy → fastest if cause is recent
├─ Failover to replica → if database/region issue
├─ Disable feature flag → if isolatable to feature
├─ Scale up → if load-related
└─ Enable circuit breaker → if downstream issue
supervibe:systematic-debugging for root cause.supervibe/artifacts/postmortems/YYYY-MM-DD-<incident>.md:
# Postmortem: <Title>
**Date:** YYYY-MM-DD
**Duration:** <start> to <end> (X minutes)
**Severity:** SEV1/2/3/4
**Author:** <name>
## Summary
<2-3 sentences: what happened, impact, resolution>
## Timeline
- HH:MM — <event>
- HH:MM — <event>
## Root cause
<technical explanation>
## Detection
<how was it noticed; could it have been detected earlier?>
## Mitigation
<what stopped the bleeding>
## Action items
- [ ] <prevent recurrence> — owner — due
- [ ] <improve detection> — owner — due
- [ ] <improve runbook> — owner — due
## What went well / poorly
<blame-free retrospective>
supervibe:confidence-scoring agent-output ≥9Returns:
supervibe:systematic-debugging — invoked for root-cause investigationdevops-sre agent — primary usersupervibe:prd — for architectural changes resulting from postmortem