Skill

rawgentic:fix-bug

Fix a bug using the WF3 14-step workflow with reproduce-first TDD, root cause analysis, lightweight reflect, and conventional commit PR. Invoke with /fix-bug followed by an issue number. DO NOT use this skill if the user is working within a BMAD workflow or has BMAD story files — use bmad-dev-story instead. Only trigger when the user explicitly invokes /fix-bug or /rawgentic:fix-bug, or is working in a rawgentic-only project without BMAD.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/rawgentic:fix-bug GitHub issue number (e.g., "42") or issue URL

User invocable

Model invocable

Inline context

Default effort

Argument hintGitHub issue number (e.g., "42") or issue URL

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

<role>

Supporting Files

references/headless.md

SKILL.md

733 lines · ~9.8k tokens(exceeds 5k compaction limit)

Stats

LanguageHTML

Stars2

MaintenanceExcellent

Last CommitJun 25, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

WF3: Bug Fix Workflow

You are the WF3 orchestrator implementing a 14-step bug fix workflow. You guide the user from bug report through root cause analysis, reproduce-first TDD, code review, and deployment verification. WF3 is a specialized fast-path derivative of WF2 — same quality assurance framework, fewer steps, optimized for rapid turnaround. You enforce the reproduce-first principle: a failing test capturing the bug MUST exist before any fix code is written. PROJECT_ROOT = "" BRANCH_PREFIX = "fix/" COMPLEXITY_THRESHOLDS: simple_bug: 1-3 files, clear root cause, no migration needed moderate_bug: 4-10 files, root cause requires investigation, may need migration complex_bug: 10+ files, cross-service, unclear root cause → UPGRADE TO WF2 LOOPBACK_BUDGET: Step_4_to_3: max 1 Step_9_to_3: max 1 global_cap: 2 The following steps are MANDATORY and must NEVER be skipped, abbreviated, or combined — regardless of context window pressure, session length, perceived simplicity, or any other justification:

Step	Name	Why mandatory
1	Receive Bug Report	Foundation — wrong bug = wrong fix
2	Analyze Bug Context	Complexity classification + reproduction context
3	Root Cause Analysis	Fixing symptoms without RCA causes regressions
4	Quality Gate (Reflect)	Validates the RCA before implementation
5	Create Fix Plan	Task decomposition for TDD
6	Create Branch	Git isolation is non-negotiable
7	TDD Bug Fix	Reproduce-first is the core WF3 principle
8	Verification	Confirms fix works and no regressions
9	Code Review	NON-NEGOTIABLE. Catches security issues, logic errors, and regression risks in the fix.
10	Create PR	Deliverable — no PR means no review trail

Conditional steps (skip ONLY when their condition is not met):

Step 11 (CI): skip only if has_ci == false
Step 12 (Merge/Deploy): skip only if user does not request merge
Step 13 (Post-Deploy): skip only if no deployment performed

ENFORCEMENT: You MUST NOT rationalize skipping a mandatory step. Common invalid justifications:

"This is a simple one-line fix" — one-line fixes can introduce injection vulnerabilities
"The session is getting long" — checkpoint in session notes and resume, do not skip
"I already reviewed the code while writing it" — self-review is not code review

If you catch yourself about to skip a mandatory step, STOP and acknowledge: "I was about to skip Step N which is mandatory. Proceeding with the full step."

Before executing any workflow steps, load the project configuration:

Determine the active project using this fallback chain: Level 1 -- Conversation context: If a previous /rawgentic:switch in this session set the active project, use that. Level 2 -- Session registry: Read claude_docs/session_registry.jsonl. Grep for your session_id. If found, use the project from the most recent matching line. Level 3 -- Workspace default: Read .rawgentic_workspace.json from the Claude root directory. If exactly one project has active == true, use it. If multiple projects are active, STOP and tell user: "Multiple active projects. Run /rawgentic:switch <name> to bind this session."

At any level:
- .rawgentic_workspace.json missing -> STOP. Tell user: "No rawgentic workspace found. Run /rawgentic:new-project."
- .rawgentic_workspace.json malformed -> STOP. Tell user: "Workspace file is corrupted. Run /rawgentic:new-project to regenerate, or fix manually."
- No active project found at any level -> STOP. Tell user: "No active project. Run /rawgentic:new-project to set one up, or /rawgentic:switch to bind this session."
- Path resolution: The activeProject.path may be relative (e.g., ./projects/my-app). Resolve it against the Claude root directory (the directory containing .rawgentic_workspace.json) to get the absolute path for file operations.

1b. Disabled skill check: After resolving the active project, read .rawgentic_workspace.json (if not already read in step 1) and find the active project's entry.

If the project entry has a disabledSkills array and this skill's bare name appears in it: [Headless cleanup]: Before stopping, check if claude_docs/headless_suspend.json exists. If it does, delete it, remove rawgentic:ai-waiting label from the issue (read issue number from suspend file), and add rawgentic:ai-error with a comment: "This skill was disabled after a headless session was suspended. The pending question can no longer be processed." Then STOP.
- If the skill is one of {implement-feature, fix-bug, create-tests, update-docs}, tell user: "You chose [mapped BMAD alternative] for [skill] in [project]. To change, re-run /rawgentic:setup or edit disabledSkills in .rawgentic_workspace.json." Mapping: implement-feature -> bmad-dev-story, fix-bug -> bmad-dev-story, create-tests -> bmad-tea agent / bmad-testarch-* workflows, update-docs -> BMAD tech-writer.
- Otherwise, tell user: "Skill [name] is disabled in [project]. Remove it from disabledSkills in .rawgentic_workspace.json to re-enable."
If workspace bmadDetected is true but the project entry has no disabledSkills field: STOP. Tell user: "BMAD detected but no skill preferences configured for [project]. Run /rawgentic:switch or /rawgentic:setup to configure."
Otherwise: proceed to step 2.

Load the config and derive capabilities with the helper CLI (one tested source of truth — never hand-derive the capabilities object, so all 11 workflow skills and the docs table cannot drift apart):
```
python3 hooks/capabilities_lib.py derive \
  --config <activeProject.path>/.rawgentic.json
```
- Non-zero exit -> the config is missing, corrupt, or invalid. STOP and relay the printed message (it directs the user to /rawgentic:setup). A config.version mismatch is only a stderr warning and does NOT stop the workflow.
- Exit 0 -> stdout is {"config": {...}, "capabilities": {...}}. Use the parsed config object and the derived capabilities object for all subsequent steps. The capabilities fields are: has_tests, test_commands, has_ci, has_deploy, deploy_method, has_database, has_docker, project_type, repo, default_branch, migration_dir. Carry these values as literals into later commands (each step is its own Bash call, so shell variables do not persist across them).

All subsequent steps use config and capabilities — never probe the filesystem for information that should be in the config.

If this workflow discovers new project capabilities during execution (e.g., a new test framework, a previously unknown service), update `.rawgentic.json` before completing: - Append to arrays (e.g., add new test framework to testing.frameworks[]) - Set fields that are currently null or missing - Do NOT overwrite existing non-null values without asking the user - Always read full file, modify in memory, write full file back When `additionalContext` contains "HEADLESS MODE active", you operate without a terminal user: the QUESTION (post→label→suspend→exit), ERROR, rich-checkpoint, and fresh-session resume protocols live in `references/headless.md`. **Read that file in full before acting on any of the per-step headless annotations below.** When NOT in headless mode, ignore them and behave normally (STOP and wait for terminal input at each interaction point). PROJECT_ROOT is populated at workflow start (Step 1) by running: - `PROJECT_ROOT`: `git rev-parse --show-toplevel`

All other project-specific values (repo, hosts, database, docker compose files, test commands) come from config and capabilities loaded via the <config-loading> block. Do not read CLAUDE.md for infrastructure or database details.

If config loading fails, STOP and tell the user which config step failed.

WF3 terminates after deployment verification and completion summary. No auto-transition to other workflows. WF3 terminates ONLY after the completion-gate (after Step 14) passes. All steps must have markers in session notes, and the completion-gate checklist must be printed with all items passing. Per rawgentic workflow principle (context preservation): before context compaction, document in `claude_docs/session_notes.md`: current step number, branch name, last commit SHA, bug classification, RCA findings, and loop-back budget state. Bug fixes enforce a strict "reproduce first" TDD pattern: 1. Write a failing test that reproduces the exact bug behavior described in the issue 2. Run the test — confirm it fails in a way that demonstrates the bug exists. In mocked or test environments, the specific status code or error message may differ from production — the key proof is that the broken behavior (missing validation, unguarded code path, incorrect logic) is demonstrated, not that the exact production symptom is reproduced. 3. Fix the code — make the test pass 4. Run full test suite — confirm no regressions 5. Add edge case tests — cover related scenarios the original bug report hints at

This is stricter than WF2's general TDD flow because bugs have a concrete "before" state that MUST be captured in a test before fixing. A test written after the fix cannot prove the fix actually addressed the bug.

WF3 accepts bug reports of any complexity. However: - If Step 2 classifies the bug as `complex_bug` (fix touches 10+ files, cross-service, unclear root cause), the workflow UPGRADES to WF2 automatically. - Before escalating, document all Step 2 findings in `claude_docs/session_notes.md`: affected files list, blast radius, suspected root cause, test inventory, related issues. This ensures WF2 Step 2 can build on existing analysis. - Inform the user: "This bug fix is complex enough to warrant the full feature implementation workflow. Switching to `/implement-feature`." - If the user disagrees, they can override and stay in WF3. The low-end mirror of ``. Some bug fixes are **trivial** — a typo, a one-line off-by-one, a comment, a config/constant tweak — where even WF3's 14 steps are more ceremony than the fix warrants. This surfaces that BEFORE the workflow invests in root-cause analysis and review.

Trigger (evaluated in Step 2, after complexity classification): set trivial_work = true only when the fix is ~1 file (occasionally 2), roughly ≤ 10 changed lines, mechanical, with no new logic and low reversal cost. (Reproduce-first still applies in spirit — a one-line regression test is cheap and worth it — but the full step machinery is not.)

This is a suggestion, never a hard gate — the orchestrator must NOT bail on its own, and continuing the full workflow is always a valid choice.

When trivial_work == true (interactive): STOP and present, concisely:

Step 2 → TRIVIAL bug detected (<N files, ~M lines, <one-line why>).
The full WF3 (14 steps) is likely overkill. Proceed how?
  (a) Do it directly — reproduce test + minimal fix + branch + PR  [recommended]
  (b) Continue the full WF3 workflow

Wait for the choice.

(a) Do it directly: LEAVE the workflow. Still write the failing reproduction test first (it is cheap and reproduce-first is the heart of WF3), apply the minimal fix, run the suite, bump the version + update docs per the project's pre-PR checklist, open a PR — but SKIP the reflect gate (Step 4), the code-review step, and the run-record ceremony. If you do emit a run-record, set complexity: "trivial".
(b) Continue: proceed to Step 3 (Root Cause Analysis) as normal.

[Headless: AUTO-RESOLVE — continue the full workflow; log ### WF3 Step 2 — trivial-work suggestion (auto-continued in headless).]

Inherited from WF2 (identical behavior): Apply ALL findings from quality gates automatically. If any finding is ambiguous, conflicting, or requires judgment — STOP and present to user for resolution before proceeding. User has final authority (P11). **[Headless: QUESTION — post comment with all ambiguous/conflicting findings and resolution options, suspend.]** Steps 12-14 (Merge and Deploy, Post-Deploy Verification, Completion Summary) are NEVER optional, even when the fix is confirmed working after merge. A bug fix without formal closure risks repeating the same class of bug. If the fix is permanent (no Phase B needed), you may execute these steps quickly, but you MUST execute them.

If the project's CLAUDE.md or development rules require explicit approval for merge, deploy, or similar operations, ask the user before proceeding. The steps must still be executed — they just require user confirmation first.

At the end of each step, log a marker in `claude_docs/session_notes.md`: `### WF3 Step X: — DONE ()` This enables workflow resumption if context is lost.

Step 1: Receive Bug Report Reference

Instructions

Execute <config-loading> to load the project configuration and build the capabilities object. Then execute <environment-setup>** to populate PROJECT_ROOT. Log resolved config values in session notes. If config loading fails, STOP and tell the user which step failed.
Parse the argument as a GitHub issue number or URL.
Fetch the issue: gh issue view <number> --repo capabilities.repo
Confirm the issue is open and labeled as bug (or has bug report template format).
Detect issue format: Check the issue's labels for security. If the security label is present, the issue likely uses STRIDE format (from WF9) instead of the standard bug report template. Adapt field mapping:
- STRIDE "Description" / "Affected Code" → treat as "Steps to Reproduce" (the vulnerable code path)
- STRIDE "Risk" / "Impact" → treat as "Expected vs Actual" (expected: blocked/mitigated, actual: exploitable)
- STRIDE "Recommended Remediation" → treat as acceptance criteria for the fix
- If the issue has the security label but no recognizable STRIDE fields, fall back to standard parsing and ask the user to clarify.
Display to the user: title, steps to reproduce (or vulnerability path), expected vs actual behavior (or risk assessment), environment.
Ask user to confirm this is the correct bug to fix. [Headless: AUTO-RESOLVE for WF1-created issues. QUESTION for manual issues — post summary for confirmation, suspend.]
If the issue lacks reproduction steps or expected behavior (and is not a security finding with STRIDE fields), ask user to provide them before proceeding. [Headless: QUESTION — post comment requesting reproduction steps, suspend.]
Memory search for bug history (Layer 3 — proactive recall). If a mempalace MCP server is available (mcp__mempalace__* tools loaded), call mempalace_search with the symptom and any error messages from the issue. Past similar bugs often have documented root causes and fixes. Surface any matches explicitly before moving to Step 2. If no mempalace MCP server is configured, skip silently.

Output Format

Present to user:

Bug Report: #<number>
Title: <title>
Status: <open/closed>
Format: [standard bug report | security finding (STRIDE)]

Steps to Reproduce / Vulnerability Path:
<from issue>

Expected: <from issue or "blocked/mitigated">
Actual: <from issue or "exploitable">
Environment: <from issue>

Confirm this is the bug to fix, or provide corrections.

Wait for user confirmation before proceeding to Step 2. [Headless: AUTO-RESOLVE for WF1-created issues. QUESTION for manual issues — post summary, suspend.]

Failure Modes

Issue not found → ask for correct number
Issue is not a bug → suggest WF2 (/implement-feature) instead
Missing reproduction steps (and not a security finding with STRIDE fields) → ask user to provide them before proceeding. [Headless: QUESTION — post comment requesting details, suspend.]

Step 2: Analyze Bug Context and Classify

Instructions

Reproduce path tracing: Starting from the reported symptoms (error messages, unexpected behavior), trace the code path to the bug location. For simple traces (1-3 files, clear call chain), Grep and Read are sufficient and faster. Use Serena MCP (find_symbol, find_referencing_symbols) for complex call chains involving multiple services or deep symbol resolution where grep alone would miss indirect references.
Blast radius assessment: Identify all files and functions in the call chain from entry point to bug location.
Test inventory: Find existing tests covering the affected code paths.
Complexity classification:
- simple_bug: 1-3 files, clear root cause, no migration needed
- moderate_bug: 4-10 files, root cause requires investigation, may need migration
- complex_bug: 10+ files, cross-service, unclear root cause → prompt upgrade to WF2
Related issues check: gh issue list --repo capabilities.repo --search "<keywords>" --limit 10
Trivial-work check (may surface to user): Apply <trivial-work-check>. If the fix is trivial_work == true, present the "do it directly vs. continue the full workflow" suggestion and WAIT for the user's choice before proceeding to Step 3 (headless: auto-continue).

Output

Bug analysis (internal working artifact):

Affected files and call chain
Existing test coverage and gaps
Complexity classification
Related issues
Suspected root cause

Failure Modes

Cannot reproduce from description → ask user for more details. [Headless: QUESTION — post comment with reproduction attempt details, suspend.]
Bug is in a dependency, not our code → document and suggest upstream report
Classified as complex_bug → prompt upgrade to WF2 (user can override)
Classified as trivial_work → suggest doing it directly (user can continue WF3); see <trivial-work-check>

Step 3: Root Cause Analysis

Instructions

Hypothesis generation: Based on the code trace from Step 2, generate 1-3 hypotheses for the root cause.
Evidence collection: For each hypothesis, gather evidence from code, logs, and test behavior.
Root cause determination: Select the hypothesis with strongest evidence.
Fix approach: Design the minimal fix that addresses the root cause (not symptoms).
Regression risk assessment: Identify code paths that could break from the fix.

Output

RCA document (internal working artifact):

Root cause with evidence
Fix approach (minimal change)
Files to modify
Regression risks

Failure Modes

Multiple equally likely root causes → present to user for guidance
Root cause is in a design flaw (not a code bug) → suggest WF2 for redesign
Fix would be a band-aid → flag that proper fix may need WF2

Step 4: Quality Gate — Lightweight Reflect

Instructions

Invoke /reflexion:reflect with focus on root cause correctness. Single-pass reflection checking:

Does the identified root cause actually explain ALL symptoms in the bug report?
Is the fix in the right layer (not a band-aid when the real issue is upstream)?
Are there unintended side effects of the proposed fix?
Does the fix handle edge cases mentioned in the bug report?
Is the fix backward-compatible (especially for API/DB changes)?

Critique level: Lightweight reflect ONLY. RATIONALE: Bug fixes have lower reversal cost than new features. A full 3-judge critique adds 2-3 minutes of latency for diminishing returns on small-scope changes.

Adversarial review sub-step (opt-in, DEFAULT-OFF, cross-model). WF3 is deliberately lightweight; an external cross-model review is therefore off by default and must be explicitly opted in per project. After the lightweight reflect above, check:

python3 hooks/adversarial_review_lib.py is-enabled \
  --workspace .rawgentic_workspace.json --project <name> --skill fix-bug

The command exits 0 when enabled (fix-bug listed in the project's adversarialReview.workflows) and non-zero otherwise. If disabled (the default), skip silently. The fast path is preserved exactly; this adds zero overhead to a normal bug fix.
If enabled (the user knowingly accepts the latency tradeoff — an external review adds ~1-3 min on top of the 2-3 min reflect, the very cost this step was designed to avoid), write the RCA + fix approach to a temp file under the project and invoke /rawgentic:adversarial-review <rca-path> plan. It is report-only; merge its findings (tagged source: adversarial) with the reflect findings and apply the circuit breaker over the merged list. If a Critical/High indicates the root cause itself is wrong, loop back to Step 3 once (max 1 per loop-back budget, same as the reflect loop-back — it does NOT add a second budget). Codex failure is non-blocking (additive review): on ANY non-success — including headless unmet-prerequisite — skip the adversarial layer, log loudly (headless: STATUS comment), and continue with the reflect result. Do NOT trigger the ERROR protocol and do NOT block WF3 (only the standalone /rawgentic:adversarial-review skill ERRORs on an unmet prerequisite). Log: ### WF3 Step 4 — Adversarial Review (invoked|skipped): <report path or skip reason>.

Note: the is-enabled check reads .rawgentic_workspace.json; if that file is missing or corrupt the engine returns disabled (fail-safe), so WF3 continues unchanged.

Output

Amended RCA (findings applied) OR blocked state (circuit breaker triggered).

Failure Modes

Reflect finds the root cause is wrong → loop back to Step 3 (max 1 time per loop-back budget)
Fix has significant side effects → suggest WF2 for broader approach

Step 5: Create Fix Plan

Instructions

Break the fix into ordered TDD tasks:
- Task 1: Write failing reproduction test
- Task 2: Implement the fix (minimal change)
- Task 3: Add regression/edge case tests
- Task 4: Update documentation if behavior changes
Document the fix branch name: fix/<issue-number>-<short-desc>
Estimate: most bugs should have 3-6 tasks.

Output

Fix plan with ordered tasks, file paths, and test expectations.

Failure Modes

Plan reveals fix is larger than expected → suggest upgrading to WF2

Step 6: Create Fix Branch

Instructions

Ensure the default branch is up to date:

git fetch origin capabilities.default_branch

Create branch from the default branch:

git checkout -b fix/<issue-number>-<short-desc> origin/capabilities.default_branch

Verify branch created successfully.
Pre-flight dependency check: If the project's config.techStack includes npm/yarn/pnpm-based technologies (node, react, vue, angular, etc.) or a package.json exists in the project root, verify node_modules exists. If missing, run the appropriate install command (npm install, yarn install, or pnpm install) before proceeding to Step 7. Similarly, for Python projects with a requirements.txt or pyproject.toml, verify the virtual environment is active or dependencies are installed. This prevents test failures due to missing dependencies rather than actual bugs.

Output

Active fix branch with dependencies installed.

Failure Modes

Working directory is dirty → stash changes first (git stash), create branch, then ask user if stash should be applied. [Headless: AUTO-RESOLVE — always stash, post brief issue comment with stash ref.]
Branch name already exists → ask user if they want to resume (checkout existing branch) or start fresh (delete and recreate). [Headless: AUTO-RESOLVE — always resume existing branch.]
Push fails (network) → continue locally, push will be retried by P4 remote sync

Step 7: TDD Bug Fix (Reproduce-First Pattern)

Instructions

Execute the plan from Step 5 using strict reproduce-first TDD:

RED — Reproduction test: Write a test that captures the exact bug behavior. Run it — it MUST fail in a way that demonstrates the bug exists. In mocked environments, the specific status code or error message may differ from production — the key proof is that the broken behavior (missing validation, unguarded code path, incorrect logic) is demonstrated, not that the exact production symptom is reproduced. If the test passes, the bug may already be fixed or the test doesn't capture the right behavior. Investigate before proceeding.
GREEN — Minimal fix: Make the reproduction test pass with the smallest possible code change. Resist the urge to refactor surrounding code.
REFACTOR (minimal): Only refactor if the fix introduced obvious code smells. Bug fix PRs should be focused, not cleanup opportunities.
Regression tests: Add 2-3 edge case tests around the fix boundary.
Full suite: Run test commands from capabilities.test_commands to confirm no regressions. Iterate over all configured test frameworks.
Commit frequently: Follow P3 (every 5 min active work) and P12 (conventional commits): fix(scope): brief description

Test Commands

Test commands are derived from capabilities.test_commands (loaded from config.testing.frameworks[].command). If capabilities.has_docker, run tests via the compose files from config.infrastructure.docker.composeFiles[]. If tests are configured to run on remote hosts, use config.infrastructure.hosts[] to determine connection details.

Do not hardcode test runners or compose file names — always derive from config.

Output

Fixed code with passing tests on fix branch.

Failure Modes

Reproduction test passes immediately → bug may not be reproducible in current code. Ask user to verify. [Headless: QUESTION — post comment explaining bug may already be fixed, suspend.]
Fix breaks other tests → investigate shared state or wrong approach
Fix requires changes beyond plan scope → flag and decide: expand plan or split into multiple fixes

Step 8: Lightweight Verification

Instructions

Quick self-check (no sub-agent needed):

Verify all acceptance criteria from the bug report (or all risk mitigations from the security finding) are addressed.
Verify the reproduction test genuinely captures the original bug.
Verify no unrelated changes crept in: git diff --stat should show only planned files.
Verify all tests pass.

Output

Verification pass/fail.

Failure Modes

Unrelated changes detected → git checkout -- <file> to revert strays
Missing acceptance criteria → add tests/code for missed items

Step 9: Code Review + Conditional Memorize

Instructions

Part A: Code Review

Launch a focused 2-agent code review in parallel using Agent tool calls (subagent_type per the PR review toolkit):

pr-review-toolkit:silent-failure-hunter — silent failure detection (critical for bug fixes — ensure the fix doesn't suppress errors)
pr-review-toolkit:code-reviewer — project standards compliance + general review

For bug fixes, focus reviewers on: (a) is the fix correct and complete, (b) are there any new silent failures, (c) is the code simple and focused. Type design and code simplification are deferred — bug fixes should be minimal and targeted.

Apply findings automatically. Circuit breaker on ambiguity.

Part B: Conditional Memorize

If the bug fix reveals a pattern worth remembering (new pitfall, gotcha, or recurring issue), invoke /reflexion:memorize to curate insights into project knowledge. Skip if the fix is routine.

Memorize triggers:

New database gotcha discovered
Race condition pattern identified
Security vulnerability pattern
Environment-specific behavior surprise
Recurring bug class (third instance of similar bug)

Output

Review-clean code + optional project knowledge updates.

Failure Modes

Review finds fundamental flaw → loop back to Step 3 (max 1 time per loop-back budget)
Review agents hit rate limit → log partial results, resume after reset

Step 10: Create Pull Request

Instructions

Stage all changes: git add <specific files> (never git add -A)

Create final commit with conventional format:

git commit -m "fix(scope): description (closes #<issue>)"

Push branch:

git push -u origin fix/<issue-number>-<short-desc>

Create PR:

gh pr create --repo capabilities.repo \
  --title "fix(scope): description" \
  --body "$(cat <<'EOF'
## Summary
- Fixes #<issue-number>
- Root cause: [brief RCA]
- Fix: [brief description of fix]

## Test plan
- [ ] Reproduction test passes (was failing before fix)
- [ ] Regression tests added
- [ ] Full test suite passes
- [ ] CI passes

Generated with [Claude Code](https://claude.com/claude-code) using WF3
EOF
)" \
  --label "bug"

Output

PR URL.

Failure Modes

Tests fail (Gate 1 blocks PR creation) → fix and retry
Push fails → retry after 5 seconds; if persistent, save PR body for manual creation
gh auth failure → verify PAT with gh auth status
Branch has conflicts with default branch → rebase (git pull --rebase origin capabilities.default_branch), resolve conflicts, re-push

Step 11: CI Verification

Instructions

Wait for CI pipeline to complete:

gh run list --repo capabilities.repo --branch fix/<branch-name> --limit 3

If CI passes → proceed to Step 12.
If CI fails → analyze failure with gh run view <id> --log-failed, fix, push, and re-check (max 2 retries).

Note: gh pr checks does NOT work with fine-grained PATs. Use gh run list / gh run view instead.

Output

CI pass/fail status.

Failure Modes

CI flaky failure → retry once
Genuine test failure → fix and push
CI timeout → wait and check again; if persistent, ask user for explicit approval before proceeding with local test results only. [Headless: AUTO-RESOLVE — wait up to 2x timeout. If still not done, ERROR — post error comment with CI run URL, add rawgentic:ai-error label, exit.]

Step 12: Merge and Deploy

Instructions

Pre-merge check: If the project's CLAUDE.md or development rules require explicit user approval for merge or deploy operations, ask the user before proceeding. Do not auto-merge in projects with approval gates.

Squash-merge PR:

gh pr merge <number> --squash --delete-branch --repo capabilities.repo

Deploy to dev: If capabilities.has_deploy, use the deploy method and commands from config.deploy. Otherwise, ask the user for deployment instructions.
If the fix includes a database migration and capabilities.has_database, run it using the database CLI from config.database.cli against the database specified in config.database. If the database runs in a container, derive the container name and credentials from config.database and config.infrastructure.docker.
Verify deployment health.

Output

Merged PR + deployed dev environment.

Failure Modes

Merge conflicts → rebase on default branch, resolve, push
Deploy fails → check logs, rollback if needed via git revert on the default branch

Step 13: Post-Deploy Verification

Instructions

Symptom verification: Check that the original bug symptoms no longer occur in the dev environment.
E2E verification (if applicable): If capabilities.has_tests and config includes E2E test commands, run the relevant E2E specs using the test command from config.testing.frameworks[] (filtered for E2E type). If tests run on a remote host, use the appropriate host from config.infrastructure.hosts[].
Health check: Verify all services are healthy after deployment.
Quick reflect: Does the deployed fix match what was intended?
Same-class bug scan: If the root cause was a missing/incorrect parameter at a call site, grep ALL callers of the affected function to check for the same class of bug at other call sites. Document findings in session notes.

Output

Deployment verified OR rollback needed.

Failure Modes

Bug still reproduces in dev → investigate env-specific differences
New issues introduced → rollback via git revert on the default branch

Step 14: Completion Summary

Instructions

The completion summary is no longer hand-typed. Assemble a structured run-record and drive the summary through hooks/work_summary.py — the same Tier-2 telemetry substrate WF2 Step 16 uses (see docs/run-records.md), so WF3's completion output is consistent and every run is measurable, not just a sentence read once.

Update claude_docs/session_notes.md with fix summary.

Close GitHub issue with closing comment:

gh issue close <number> --repo capabilities.repo \
  --comment "Fixed in PR #<pr-number>. Root cause: <brief>. Fix: <brief>."

Assemble the run-record and write it to /tmp/wf3-run-record.json (use the Write tool, or a cat > … <<'JSON' heredoc). Every key below must be present; "nullable" means null is an allowed value, NOT that the key may be omitted (a dropped field is a telemetry gap). Counts are non-negative integers and resolved may not exceed findings:

{
  "workflow": "fix-bug",
  "workflow_version": "<.claude-plugin/plugin.json version>",
  "issue": {"number": <bug issue #>, "type": "bug",
            "complexity": "trivial|standard|complex|null"},
  "changes": {"files_changed": N, "insertions": N|null, "deletions": N|null,
              "commits": N},
  "tests": {"added": N, "passing": N|null, "total": N|null},
  "gates": [
    {"step": "4", "name": "Lightweight Reflect", "findings": N, "resolved": N, "status": "pass|fail|skipped|fast_path"},
    {"step": "9", "name": "Code Review",         "findings": N, "resolved": N, "status": "..."}
  ],
  "security_scan": {"ran": false, "blocking_resolved": 0, "advisory": 0, "skipped": []},
  "loop_backs": {"used": N, "budget": 2},
  "outcome": {"pr_number": N|null, "pr_url": "<url>"|null, "merged": true|false|null,
              "ci": "passed|failed|not_configured|skipped",
              "deploy": "success|manual|failed|not_applicable"},
  "extra": [
    {"label": "Root Cause", "value": "<one-line root cause>"},
    {"label": "Fix",        "value": "<one-line fix>"}
  ],
  "follow_ups": ["<any item requiring future attention>", ...]
}

WF3 has no tool-based security scan (that is WF2 Step 11.5), so security_scan.ran is false (with zero counts and empty skipped) and the render shows "Security Scan: not run". extra carries the Root Cause / Fix lines WF3 has always shown. WF3's loop-back budget is 2. Each gate's step must be distinct; conditional memorization happens within Step 9 (Code Review), so record any memorized insights in follow_ups rather than as a second step-9 gate.

Render + persist (carry activeProject.path in as a literal — shell vars do not persist across Bash tool calls):
```
python3 hooks/work_summary.py summarize \
  --record-file /tmp/wf3-run-record.json \
  --project-root <activeProject.path>
rc=$?
```
The tool's stdout is the "WF3 COMPLETE" summary — present it to the user as-is (do not re-type it). It also appends the record to <activeProject.path>/docs/measurements/run_records.jsonl (override with --store or $RAWGENTIC_RUN_RECORD_STORE).
Handle the exit code:
- rc == 0: record valid and persisted. Done.
- rc == 1: the summary still rendered (the user keeps it) but the record FAILED validation and was not persisted — a telemetry gap. The stderr lists the bad fields; fix /tmp/wf3-run-record.json and re-run. If it genuinely can't be fixed, record the gap in session notes.
- rc == 2: usage error / unreadable record file — fix the invocation.

Log a marker in claude_docs/session_notes.md: ### WF3 Step 14: Completion summary + run-record — DONE (persisted: yes/no)

Failure Modes

None — this is an informational step. If previous steps had partial failures, this step reports the partial completion status.

Before declaring WF3 complete, verify ALL of the following. Print the checklist with pass/fail for each item:

Step markers logged for ALL executed steps in session notes
Final step output (completion summary) presented to user
Session notes updated with completion summary
PR URL documented
Root cause documented in session notes
Same-class bug scan completed
E2E passed
Completion summary rendered via work_summary.py (Step 14) and the run-record persisted (rc 0) — or, if validation failed (rc 1), the telemetry gap is recorded in session notes

If ANY item fails, go back and complete it before declaring "WF3 complete." You may NOT output "WF3 complete" until all items pass.

Workflow Resumption

If this skill is invoked mid-conversation, detect the current state:

All step markers present but completion-gate not printed? → Run completion-gate, then terminate.
PR merged? → Step 13 (post-deploy verification)
PR exists and CI passed? → Step 12 (merge)
PR exists? → Step 11 (CI check)
Fix branch has passing tests? → Step 9 (code review)
Fix branch has code changes? → Step 8 (verification)
Fix branch exists (empty)? → Step 7 (TDD)
RCA in session notes? → Step 5 (plan)
None → Step 1 (start from scratch)

Announce the detected state before resuming: "Detected prior progress. Resuming at Step N."

Conditional Memorization (P9)

After completing the bug fix, check if the fix revealed patterns worth memorizing:

New database gotcha or query pitfall
Race condition or timing-related bug class
Security vulnerability pattern
Environment-specific behavior (dev vs prod differences)
Third or more instance of a similar bug category

If insights are found, they are curated via /reflexion:memorize in Step 9. This is conditional — skip for routine, one-off fixes.

rawgentic:fix-bug

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

rawgentic:fix-bug

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

WF3: Bug Fix Workflow

Step 1: Receive Bug Report Reference

Instructions

Output Format

Failure Modes

Step 2: Analyze Bug Context and Classify

Instructions

Output

Failure Modes

Step 3: Root Cause Analysis

Instructions

Output

Failure Modes

Step 4: Quality Gate — Lightweight Reflect

Instructions

Output

Failure Modes

Step 5: Create Fix Plan

Instructions

Output

Failure Modes

Step 6: Create Fix Branch

Instructions

Output

Failure Modes

Step 7: TDD Bug Fix (Reproduce-First Pattern)

Instructions

Test Commands

Output

Failure Modes

Step 8: Lightweight Verification

Instructions

Output

Failure Modes

Step 9: Code Review + Conditional Memorize

Instructions

Output

Failure Modes

Step 10: Create Pull Request

Instructions

Output

Failure Modes

Step 11: CI Verification

Instructions

Output

Failure Modes

Step 12: Merge and Deploy

Instructions

Output

Failure Modes

Step 13: Post-Deploy Verification

Instructions

Output

Failure Modes

Step 14: Completion Summary

Instructions

Failure Modes

Workflow Resumption

Conditional Memorization (P9)

Similar Skills

WF3: Bug Fix Workflow

Step 1: Receive Bug Report Reference

Instructions

Output Format

Failure Modes

Step 2: Analyze Bug Context and Classify

Instructions

Output