Skill

codex-handoff

Use when interpreting state mutations during plan/review/verdict transitions — explains exactly which fields the bridge writes when, so the implementer can debug 'why is current_task suddenly T3' or 'why is base_sha old'.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/autopilot-codex:codex-handoff

User invocable

Model invocable

Inline context

Default effort

When to use

Debugging unexpected state.json values | wondering why current_task advanced | checking what last_review/last_verdict mean | inspecting bridge artefacts

Tool Access

This skill is limited to the following tools:

Bash(autopilot-codex:*)ReadGlob

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

The plugin has two halves: a Node bridge (`scripts/codex-bridge.mjs`)

SKILL.md

128 lines · ~1.2k tokens

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitMay 6, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

AutoPilot Codex — handoff between Claude and Codex

The plugin has two halves: a Node bridge (scripts/codex-bridge.mjs) that calls Codex and a small set of hooks that drive Claude. They synchronize through one file: ${CLAUDE_PLUGIN_DATA}/state.json. This skill documents exactly what each half writes when, so you can debug "why does state look like this".

Bridge writes (all in `cmdPlan` / `cmdReview` / `cmdVerdict`)

`cmdPlan` (called by `/codex-plan`, `/autopilot`, codex-manager subagent)

Resets the entire run:

state.goal                ← user's goal verbatim
state.plan_path           ← docs/autopilot/plans/<date>-<slug>.md
state.plan                ← full plan JSON from Codex
state.conversation_id     ← Codex thread id (or recovered from rollout)
state.current_task        ← plan.tasks[0]
state.base_sha            ← HEAD of cwd at plan time
state.iteration           ← 0
state.no_progress_streak  ← 0
state.status              ← "running"
state.budget_used_usd     ← 0    (resolveBudgetUsd reset)
state.budget_usd          ← from --budget / AUTOPILOT_BUDGET_USD / 5.00
state.token_usage.plan    ← { input, cached, output, calls: 1 }

`cmdReview` (called by post-tool-use hook, `/codex-review`, codex-reviewer subagent)

Records review of current diff against current task acceptance:

state.last_review.{task_id, decision, summary, issues_count,
                   base_sha, head_sha, timestamp, review_path}
state.token_usage.review.{input, cached, output, calls++}
state.budget_used_usd     ← += chargeBudget(MODEL, usage)

It does NOT change current_task, base_sha, status, iteration.

`cmdVerdict` (called by stop hook, drives the loop)

Advances state per Codex's decision:

state.iteration           ← ++
state.last_verdict.{decision, reason, timestamp, verdict_path,
                    progressed}
state.token_usage.verdict.{input, cached, output, calls++}
state.budget_used_usd     ← += chargeBudget(MODEL, usage)

# decision-specific:
if decision == "done":
    state.status                ← "done"
    state.no_progress_streak    ← 0

if decision == "abort":
    state.status                ← "aborted"
    state.stop_reason           ← verdict.reason
    state.no_progress_streak    ← 0

if decision == "continue":
    state.current_task          ← verdict.next_task
    state.base_sha              ← HEAD at verdict time
    if progressed:
        state.no_progress_streak ← 0
    else:
        state.no_progress_streak ← ++
        if streak >= threshold:  status = "aborted" (forced)
    if iteration >= max_iterations:
        state.status            ← "stopped_iteration_limit"

Hook writes (PostToolUse, Stop, UserPromptSubmit)

Hooks themselves do not write state. They CALL the bridge (bridge review from PostToolUse, bridge verdict from Stop) and forward its output back into Claude's context. The exception is budget-guard.mjs check, which flips status to stopped_budget when over cap.

Forward-migration on read

Every loadState() call runs migrateBudgetFields(state, env) on the parsed object. A pre-budget state.json file gains budget_used_usd: 0 and budget_usd: <env or 5.00> automatically without disturbing other fields. This is idempotent — re-reads are safe.

Why `current_task` ≠ what you're implementing

The bridge advances current_task only on verdict (one call per turn). If you implemented T1 and T2 inside the same turn (e.g. dogfood on the plugin itself), current_task is still T1 when you finish — the next verdict will see "iteration 0 → 1" but no commits since base_sha relative to T1, treat it as 'no progress' or auto-advance, depending on how Codex weighs the diff.

The honest fix when you've drifted: do one commit per task, end your turn, let the Stop hook run verdict cleanly. That's the path the loop was built for.

Debugging cheat sheet

# What's the bridge thinking right now?
node ${CLAUDE_PLUGIN_ROOT}/scripts/codex-bridge.mjs state | jq '{
  status, goal, current_task: .current_task.id,
  base_sha, iteration, budget_used_usd, budget_usd
}'

# Last review JSON in full
ls -t ${CLAUDE_PLUGIN_DATA}/review-*.json | head -1 | xargs cat | jq

# Last verdict JSON in full
ls -t ${CLAUDE_PLUGIN_DATA}/verdict-*.json | head -1 | xargs cat | jq

# Reset to idle without losing token_usage
node ${CLAUDE_PLUGIN_ROOT}/scripts/codex-bridge.mjs stop --reason "manual reset"

codex-handoff

Invocation

Tool Access

Context Preview

SKILL.md

codex-handoff

Invocation

Tool Access

Context Preview

SKILL.md

AutoPilot Codex — handoff between Claude and Codex

Bridge writes (all in `cmdPlan` / `cmdReview` / `cmdVerdict`)

`cmdPlan` (called by `/codex-plan`, `/autopilot`, codex-manager subagent)

`cmdReview` (called by post-tool-use hook, `/codex-review`, codex-reviewer subagent)

`cmdVerdict` (called by stop hook, drives the loop)

Hook writes (PostToolUse, Stop, UserPromptSubmit)

Forward-migration on read

Why `current_task` ≠ what you're implementing

Debugging cheat sheet

Similar Skills

AutoPilot Codex — handoff between Claude and Codex

Bridge writes (all in `cmdPlan` / `cmdReview` / `cmdVerdict`)

`cmdPlan` (called by `/codex-plan`, `/autopilot`, codex-manager subagent)

`cmdReview` (called by post-tool-use hook, `/codex-review`, codex-reviewer subagent)

`cmdVerdict` (called by stop hook, drives the loop)

Hook writes (PostToolUse, Stop, UserPromptSubmit)

Forward-migration on read

Why `current_task` ≠ what you're implementing

Debugging cheat sheet

Similar Skills

codex-handoff

Invocation

Tool Access

Context Preview

SKILL.md

codex-handoff

Invocation

Tool Access

Context Preview

SKILL.md

AutoPilot Codex — handoff between Claude and Codex

Bridge writes (all in cmdPlan / cmdReview / cmdVerdict)

cmdPlan (called by /codex-plan, /autopilot, codex-manager subagent)

cmdReview (called by post-tool-use hook, /codex-review, codex-reviewer subagent)

cmdVerdict (called by stop hook, drives the loop)

Hook writes (PostToolUse, Stop, UserPromptSubmit)

Forward-migration on read

Why current_task ≠ what you're implementing

Debugging cheat sheet

Similar Skills

AutoPilot Codex — handoff between Claude and Codex

Bridge writes (all in cmdPlan / cmdReview / cmdVerdict)

cmdPlan (called by /codex-plan, /autopilot, codex-manager subagent)

cmdReview (called by post-tool-use hook, /codex-review, codex-reviewer subagent)

cmdVerdict (called by stop hook, drives the loop)

Hook writes (PostToolUse, Stop, UserPromptSubmit)

Forward-migration on read

Why current_task ≠ what you're implementing

Debugging cheat sheet

Similar Skills

Bridge writes (all in `cmdPlan` / `cmdReview` / `cmdVerdict`)

`cmdPlan` (called by `/codex-plan`, `/autopilot`, codex-manager subagent)

`cmdReview` (called by post-tool-use hook, `/codex-review`, codex-reviewer subagent)

`cmdVerdict` (called by stop hook, drives the loop)

Why `current_task` ≠ what you're implementing

Bridge writes (all in `cmdPlan` / `cmdReview` / `cmdVerdict`)

`cmdPlan` (called by `/codex-plan`, `/autopilot`, codex-manager subagent)

`cmdReview` (called by post-tool-use hook, `/codex-review`, codex-reviewer subagent)

`cmdVerdict` (called by stop hook, drives the loop)

Why `current_task` ≠ what you're implementing