Skill

diagnose

Diagnose a failed recording-rig run (CLI or desktop backend). Use when the user reports a recording failed, the validator said FAILED, the rig hung, the cast/GIF looks wrong, the desktop model skipped its rig tools, or invokes `/recording-rig:diagnose`. Examines the cast (CLI) or bridge transcript + .mov + bridge log (desktop), sentinels under /tmp/${SESSION}.*, rendered hooks, and rig output to identify the failure mode and propose a fix.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/recording-rig:diagnose

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Identify why a recording failed and propose a specific fix. This is targeted forensics on a known-bad run — not a general "review my spec" tool.

SKILL.md

131 lines · ~2.3k tokens

Stats

LanguageJavaScript

Stars0

MaintenanceExcellent

Last CommitMay 27, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

diagnose

Identify why a recording failed and propose a specific fix. This is targeted forensics on a known-bad run — not a general "review my spec" tool.

First: which backend?

The two backends fail differently and have different artifacts. Detect the backend before walking any taxonomy:

backend: "desktop" in the spec, or a /tmp/${SESSION}.bridge-transcript.jsonl exists → desktop. Jump to Desktop backend forensics.
Otherwise (a /tmp/${SESSION}.cast exists, no transcript) → CLI. Walk the CLI taxonomy below.

When to use

Validator reported FAILED (missing must_contain or present must_not_contain)
Rig hung > 3 minutes with no agent-done sentinel
[validate] PASSED but the rendered GIF is empty or wrong
User says "the recording broke" / "my rig output is weird"

Inputs to gather

Ask the user for (or derive from context):

Session name (or cast path → derive ${SESSION} from filename)
Path to the spec JSON that was used

Then read:

/tmp/${SESSION}.cast — the recording itself
/tmp/${SESSION}.hooks.json — what hooks were installed
/tmp/${SESSION}.* — all sentinels (which fired, mtimes, contents)
Spec file — what was supposed to happen

Failure-mode taxonomy

Walk this list. The first match is usually the answer.

"FAILED: missing required" — `must_contain` not satisfied

Run node on the cast to extract the stripped output (use the cleanCast function from bin/validate.mjs). Check whether the missing string appears at all (could be a typo) or appears in modified form (ANSI/cursor-overwrite mangling).
Check whether the missing string is only in the prompt echo. If the agent's reply didn't actually contain it, that's a model behaviour issue — adjust the prompt.
If the assertion is uniquely an agent-output string (e.g. RESULT=42) and absent: did the agent's turn complete? Check for turn-end sentinel mtime versus the cast end-time.

"FAILED: forbidden present" — `must_not_contain` matched

Likely step_aborted / failure_reason from claude — the agent crashed mid-turn. Inspect the cast for the surrounding context.
Could be cursor-overwrite ghost text (the validator strips ANSI but doesn't replay cursor positioning). Document the limitation; if real, switch the assertion to must_contain_in_order with a uniquely-emitted positive marker.

Rig hung; `session-start` sentinel never fired

The consent dialogs blocked claude from starting hooks. The consent-sweep should have caught them, but:

Check if a NEW consent dialog text appeared (claude updates) — the sweep's regex may be stale.
Check if --dangerously-skip-permissions was actually set (agent.bypass_permissions: true in spec).
The user may need to run claude --dangerously-skip-permissions interactively once to accept the legal consent globally.

Rig hung; `session-start` fired but no `prompt-submitted`

The driver pasted but claude didn't ingest it. Common causes:

agent.cwd requires a trust dialog the sweep didn't dismiss. Check tmux capture-pane -t ${SESSION} if the session is still alive.
ATTACH_GAP_SEC too short — driver pasted before asciinema attached. Bump pacing.attach_gap_sec to 8-10.
pre_enter_sec: 0 was specified for a gate — keys lost. The driver now floors at 1, but check the rig version.

Rig hung; `prompt-submitted` fired but no `turn-end`

The agent is stuck mid-turn. Causes:

The model is slow (sonnet on a heavy reasoning prompt) — bump pacing.turn_timeout_sec.
The Stop hook command in the rendered hooks.json failed (rare — it's a trivial touch).
The agent called AskUserQuestion but the spec didn't list it as a gate — the agent is waiting forever. Add a gate entry to the spec.

Companion never produced output (two-pane)

Read /tmp/${SESSION}.companion-env — is it well-formed bash? If line concatenation is visible (e.g. export XYexec ...), there's a rig bug; report it.
Did the PostToolUse hook fire? Check sentinel existence + content. Empty file = jq failed to extract the payload.
The tool_response shape may not match the hook's unwrap pattern. Test the jq manually against a captured tool_response sample.

`must_contain_in_order` failed at an unexpected position

The cleaned cast probably has the strings, but in a different order. Cursor-overwrite or claude printing summaries can reorder visible text. Switch to plain must_contain if order is incidental.

Desktop backend forensics

For a backend: "desktop" run there is no cast — the primary inputs are the bridge transcript, the .mov, and the bridge's per-server log. Gather them all at once:

bin/diagnose-desktop.sh ${SESSION} ${SPEC}    # SPEC optional but enables checkpoint coverage

The helper is read-only and emits four detail sections plus a closing --- summary ---. Read the summary first — it integrates the four sections into one primary: verdict (the most-fundamental failure, via a fixed priority ladder: bridge-never-reached → missing-required-checkpoint → turn-never-closed → no-capture → capture-gap → instruction-drift → soft-miss-note → healthy) and a next: fix pointer. Lead your report with it, then expand using the cited detail section below.

(a) checkpoint coverage

MISSING REQUIRED: <name> — the model never called rig_checkpoint for a required checkpoint. The validator FAILs and no GIF renders. This is a model-cooperation failure, not a rig bug: strengthen the system_prompt_prologue wording (see examples/desktop-chat.json — a plain "please use these tools" framing; a "you are being recorded" framing triggers injection-refusal) and re-record.
turn_end: ABSENT — the turn never closed and record.sh did not synthesize a fallback (the transcript was empty at timeout → bridge never reached; see (d)).
turn_end: present (synthesized) — a soft miss: the model produced output but skipped rig_turn_end, so record.sh synthesized the fallback (the run still produced a GIF). One soft miss is tolerable; a trend is not — see (b).
turn_end: present (genuine) — the model closed the turn itself (healthy).

(b) soft-miss trend

rate: N% over last M desktop runs. Above 20% (with ≥3 samples) the helper prints an instruction-drift verdict: the prologue is not reliably steering the model. Strengthen it, or accept that this surface/Desktop-version combo has hit its determinism floor (RDR-001 Risk "instruction drift"). this-session: shows whether this run was a soft miss.

(c) capture coverage

verdict: NO CAPTURE — no .mov. The ScreenCaptureKit → AVAssetWriter path failed: check Screen Recording permission (System Settings → Privacy & Security) and bin/doctor.sh.
verdict: CAPTURE EMPTY / CAPTURE GAP — the .mov is ~0s or shorter than the tool-call span; the capture stopped before the turn finished (driver crash, early agent-done, or a permission revoked mid-run).
verdict: ok — the .mov brackets the tool-call span (healthy).

(d) bridge-log liveness

status: NOT FOUND — the bridge never connected in the Claude-Rig profile. The most common cause: the .mcpb is installed/enabled in the primary profile, not Claude-Rig (the open *.mcpb → default-handler footgun). Re-install + enable in Claude-Rig (Connectors UI), then re-record.
status: present but the log went quiet long before the session ended → the bridge may have crashed mid-session (001-research-20). The log is a coarse transport-lifecycle signal only, not a turn-end source.

HAR / Playwright-trace forensics do not exist for the desktop backend — there is no CDP transport under the AX pivot. Do not look for them.

Reporting

Lead with the single most-likely root cause, then support it. For a desktop run that means opening with the helper's --- summary --- primary: verdict; for CLI, the first matching taxonomy entry. Then, for the diagnosed failure, state:

Symptom (what the rig produced)
Root cause (which failure mode from the taxonomy)
Fix (specific spec change, env var, or command)
How to verify (re-run command + expected new outcome)

What this skill does NOT do

Apply fixes automatically. Diagnosis is read-only; the user accepts the fix and re-runs.
Diagnose architectural design choices in the spec (route to author-spec for redesigns).
Recover from incomplete artifacts that are already overwritten by a new run.

diagnose

Invocation

Context Preview

SKILL.md

diagnose

Invocation

Context Preview

SKILL.md

diagnose

First: which backend?

When to use

Inputs to gather

Failure-mode taxonomy

"FAILED: missing required" — must_contain not satisfied

"FAILED: forbidden present" — must_not_contain matched

Rig hung; session-start sentinel never fired

Rig hung; session-start fired but no prompt-submitted

Rig hung; prompt-submitted fired but no turn-end

Companion never produced output (two-pane)

must_contain_in_order failed at an unexpected position

Desktop backend forensics

(a) checkpoint coverage

(b) soft-miss trend

(c) capture coverage

(d) bridge-log liveness

Reporting

What this skill does NOT do

Similar Skills

diagnose

First: which backend?

When to use

Inputs to gather

Failure-mode taxonomy

"FAILED: missing required" — must_contain not satisfied

"FAILED: forbidden present" — must_not_contain matched

Rig hung; session-start sentinel never fired

Rig hung; session-start fired but no prompt-submitted

Rig hung; prompt-submitted fired but no turn-end

Companion never produced output (two-pane)

must_contain_in_order failed at an unexpected position

Desktop backend forensics

(a) checkpoint coverage

(b) soft-miss trend

(c) capture coverage

(d) bridge-log liveness

Reporting

What this skill does NOT do

Similar Skills

"FAILED: missing required" — `must_contain` not satisfied

"FAILED: forbidden present" — `must_not_contain` matched

Rig hung; `session-start` sentinel never fired

Rig hung; `session-start` fired but no `prompt-submitted`

Rig hung; `prompt-submitted` fired but no `turn-end`

`must_contain_in_order` failed at an unexpected position

"FAILED: missing required" — `must_contain` not satisfied

"FAILED: forbidden present" — `must_not_contain` matched

Rig hung; `session-start` sentinel never fired

Rig hung; `session-start` fired but no `prompt-submitted`

Rig hung; `prompt-submitted` fired but no `turn-end`

`must_contain_in_order` failed at an unexpected position