Skill

diagnose

Performs disciplined root-cause analysis for software bugs and crashes using a systematic hypothesis-falsification workflow. Accepts a symptom description or error trace, and produces a structured diagnosis report detailing the symptom, root cause, fix, and reproduction feedback loop. Trigger on: 'debug', 'fix crash', 'why is this failing', 'unexpected output', 'diagnose bug', 'root cause analysis', 'feedback loop', 'instrumentation'. Also triggers when troubleshooting test failures or diagnosing unexpected runtime exceptions. Always prefer this skill over test-driven-development or refactor when diagnosing a bug prior to implementing a fix.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-agent-dev:diagnose [symptom description or error trace]

User invocable

Model invocable

Inline context

Default effort

Argument hint[symptom description or error trace]

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Identify true root cause through systematic falsification. **DO NOT GUESS.**

Supporting Files

evals/evals.jsonreferences/feedback-loops.mdreferences/phases.md

SKILL.md

106 lines · ~1k tokens

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitJun 24, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

diagnose

Identify true root cause through systematic falsification. DO NOT GUESS.

Process Flow

Phase 1: Build Feedback Loop (pass/fail signal)
  -> Phase 2: Reproduce (confirm bug)
  -> Phase 3: Hypothesize & Falsify (3-5 hypotheses)
       -- falsified --> retry Phase 3 with new hypotheses
  -> Phase 4: Instrumentation (targeted probes)
  -> Phase 5: Red-Green Fix (regression test)
  -> Phase 6: Finalization (de-instrument / verify)

trigger: debug, fix crash, unexpected behavior constraint: apply 1 hypothesis per run constraint: modify working copy only constraint: reject "works on my machine"

Phase 1: Feedback Loop

action: create <2s deterministic pass/fail signal action: isolate filesystem, pin seeds/time mandatory: read references/feedback-loops.md (do NOT load references/phases.md) gate: require loop or request telemetry/logs

Phase 2: Reproduce

action: achieve >50% reproduction rate gate: require logged repro signal before Phase 3

Phase 3: Hypothesize & Falsify

mandatory: read references/phases.md (do NOT load references/feedback-loops.md) action: propose 3-5 falsifiable hypotheses via AskUserQuestion (surface top 3, queue rest) format: "If [X] is the cause, then [Y] will change when I do [Z]." dispatch: use multi-agent-dispatch for independent hypotheses (require isolation: worktree) gate: require confirmed probe result (no guessing by elimination)

Phase 4: Instrumentation

action: instrument decision boundaries dynamically format: prefix logs with [DEBUG-XXXX] constraint: target logs strictly; use profilers for performance

Phase 5: Red-Green Fix

action: write regression test targeting failing seam action: confirm RED action: apply minimal fix on working copy action: confirm GREEN action: execute N-1 test (revert fix -> confirm RED -> restore fix)

Phase 6: Finalization

action: remove all [DEBUG-XXXX] tags action: verify fix via Phase 1 loop action: promote scripts to test suite or delete

Next Skills

test-driven-development: implement new logic/tests refactor: clean up 1 file/function architecting: clean up multiple files/modules planning: address major specification gaps context-optimizer: if context bloats mid-skill (long reads, many tool calls)

Transitions

verification-before-completion: re-verify in same skill test-driven-development: resume current task/phase multi-agent-development: resume current task/phase refactor: resume refactor cycle multi-agent-dispatch: resume INTEGRATE step receive-code-review: resume Step 4 Implement codebase-init: resume Failure Recovery step github-automation: resume failed script/PR step

Exclusions

test-driven-development: use for writing new feature tests refactor: use for non-bug structural issues

References

references/feedback-loops.md: setup patterns by system type references/phases.md: detailed phases, hypothesis prioritization, profiling

Output Format

symptom: [Description] root_cause: [Correct Hypothesis] fix: [Changes] feedback_loop: [Reproduction Script] prevention: [Architecture/Test improvement] next_steps: [Follow-up tasks]

diagnose

Invocation

Context Preview

Supporting Files

SKILL.md

diagnose

Invocation

Context Preview

Supporting Files

SKILL.md

diagnose

Process Flow

Phase 1: Feedback Loop

Phase 2: Reproduce

Phase 3: Hypothesize & Falsify

Phase 4: Instrumentation

Phase 5: Red-Green Fix

Phase 6: Finalization

Next Skills

Transitions

Exclusions

References

Output Format

Similar Skills

diagnose

Process Flow

Phase 1: Feedback Loop

Phase 2: Reproduce

Phase 3: Hypothesize & Falsify

Phase 4: Instrumentation

Phase 5: Red-Green Fix

Phase 6: Finalization

Next Skills

Transitions

Exclusions

References

Output Format

Similar Skills