From superpowers-plus
Tests debugging hypotheses via controlled reproduction experiments. Designs tests, executes 3+ attempts with environment comparisons, and reports evidence-based verdicts.
npx claudepluginhub bordenet/superpowers-plus --plugin superpowers-plusThis skill uses the workspace's default tool permissions.
> **Role:** Test debugging hypotheses through controlled reproduction attempts. Confirm or reject with evidence.
Mandates invoking relevant skills via tools before any response in coding sessions. Covers access, priorities, and adaptations for Claude Code, Copilot CLI, Gemini CLI.
Share bugs, ideas, or general feedback.
Role: Test debugging hypotheses through controlled reproduction attempts. Confirm or reject with evidence. Dispatched by:
debug-conductor— never invoked directly by user. Evidence type:ExperimentEvidence(seeskills/_shared/evidence-schema.md)
Dispatched by debug-conductor when a hypothesis needs testing — controlled reproduction, environment comparison, or A/B verification of a suspected root cause.
From the conductor, receive:
Refine into a testable experiment:
For each attempt:
Minimum 3 attempts — intermittent bugs need statistical confidence.
| Attempts | Reproductions | Confidence |
|---|---|---|
| 3/3 | 3 | High (>0.8) — hypothesis strongly supported |
| 2/3 | 2 | Medium (0.5–0.8) — likely correct but intermittent |
| 1/3 | 1 | Low (0.3–0.5) — possible but unreliable |
| 0/3 | 0 | Very Low (<0.3) — hypothesis likely wrong OR environment mismatch |
If reproduction succeeded:
{
"hypothesis": "Event ordering bug in async pipeline under load",
"steps": [
{ "action": "Set event processing to async mode", "result": "Config applied", "success": true },
{ "action": "Send 50 concurrent call events", "result": "Events arrived out of order in 12/50 cases", "success": true },
{ "action": "Verify call state machine diverged", "result": "3 calls in disconnected state prematurely", "success": true }
],
"outcome": "reproduced",
"reproduced": true,
"attempts": 3,
"successRate": 1.0,
"environmentDiff": "Staging uses lower load (50 concurrent vs 500 in prod); reproduction rate may differ"
}
| Pattern | Evidence Shape |
|---|---|
| Deterministic bug | 3/3 reproduction, 0/3 without condition → confirmed |
| Load-dependent bug | Reproduces only above certain concurrency threshold |
| Environment-specific | Reproduces in prod-like environment but not staging → config/infra difference |
| Intermittent / race condition | 1–2/3 reproduction → timing-dependent |
| Hypothesis disproven | 0/3 reproduction even with condition → reject hypothesis |
| Mode | Symptom | Recovery |
|---|---|---|
| Incomplete isolation | Test affected by shared state | Reset environment between experiments |
| False confirmation | Coincidental success in reproduction | Run multiple trials |
| Wrong variable | Testing irrelevant hypothesis | Verify hypothesis matches symptoms |