Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By evgenygurin
Codex-as-Manager autopilot for Claude Code: OpenAI Codex (gpt-5.4/5.5) decomposes goals, reviews diffs, and directs the loop; Claude Code implements. Works with ChatGPT subscription via codex login (no OPENAI_API_KEY required).
npx claudepluginhub evgenygurin/autopilot-codex --plugin autopilot-codexRun the full Codex-as-Manager autopilot — Codex plans, you implement, Codex reviews each diff, the Stop hook drives the loop until done. Safety rails on Bash; iteration cap on the Stop loop. Confirm before risky actions.
Show the current task contract from state.json without spawning a Codex turn. Read-only — useful when you need a quick reminder of what you should be implementing.
Ask OpenAI Codex (manager / architect) to decompose a goal into a structured plan with ordered ~5-30min tasks. Saves a plan markdown to docs/autopilot/plans/. Read-only — no code changes.
Manually trigger Codex review of the current working tree against the active task's acceptance contract. Useful when the autopilot is paused, when re-reviewing a specific task, or when debugging review behavior. Read-only.
Show the current AutoPilot Codex run state — goal, plan path, current task, base SHA, last review, iteration count.
USE PROACTIVELY whenever the user states a fresh engineering goal in this repo and AutoPilot Codex is enabled. Consults Codex (the manager) via the bridge, persists a structured plan, and hands the FIRST task back to the main thread. Read-only — never writes code itself.
Use ONCE for a focused architectural question (data-model trade-off, library choice, API shape, failure-mode analysis) where you want Codex's opinion without committing to a full plan. Returns Codex's plain-text answer; does NOT mutate plugin state. Read-only.
Use AFTER any non-trivial Edit/Write/MultiEdit during an active autopilot run to get a Codex verdict on the change. Reads the current task from state.json, computes the diff vs base_sha, and asks Codex (the manager) whether the diff meets the task's acceptance criteria. Returns the verdict JSON verbatim. Read-only — never edits files.
Use when running an active autopilot-codex session — explains the plan→implement→review→verdict cycle, state machine, and termination conditions. Required reading when status='running' and the implementer needs to act inside the loop.
Use when interpreting state mutations during plan/review/verdict transitions — explains exactly which fields the bridge writes when, so the implementer can debug 'why is current_task suddenly T3' or 'why is base_sha old'.
Use when deciding whether a question warrants spawning a Codex turn or you can answer locally — covers the cost/value matrix and which subagent or command is the right entry point for each kind of consultation.
Use whenever you're about to run a destructive Bash command, push code, or spend Codex tokens — explains every cap and denylist this plugin enforces and how to extend them.
Use whenever the user invokes /autopilot-codex:* in this repo, OR mentions "autopilot", "Codex manager", "Codex architect", or asks Codex to plan/review for them. Establishes the role split — OpenAI Codex is manager/architect, Claude Code is implementer — and the operating contract.
Admin access level
Server config contains admin-level keywords
Executes bash commands
Hook triggers when Bash tool is used
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Memory compression system for Claude Code - persist context across sessions
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Write feature specs, plan roadmaps, and synthesize user research faster. Keep stakeholders updated and stay ahead of the competitive landscape.
Intelligent project memory management for Claude Code using Mem0. Automatically captures and persists decisions, patterns, and context across sessions.
Bridge to Codegen AI agent platform — delegate tasks, monitor agents, review output, debug failures, and manage PRs via cloud agents
Risk-first crypto market research — 16 MCP tools (market data, analytics, catalog + observability), 4 resources + 1 URI template, 2 prompts, Supabase-backed cache, realtime cache invalidation, provenance tracking, and auditable reasoning.
Intelligent prompt optimization with R2R semantic search integration. Enriches vague prompts using knowledge base search, codebase analysis, and research-based clarifying questions
DJ techno music library management — 20 MCP dispatchers (6 entity CRUD + 3 provider + 2 compute + sync + 2 admin + 6 UI Prefab Apps) + 27 resources + 6 prompts + 6 handlers. FastMCP v3 bounded-contexts architecture.
Modifies files
Hook triggers on file write and edit operations
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools
No model invocation
Executes directly as bash, bypassing the AI model
No model invocation
Executes directly as bash, bypassing the AI model
Share bugs, ideas, or general feedback.
A Claude Code plugin where OpenAI Codex (gpt-5.4 / gpt-5.5) plays senior engineering manager, while Claude Code is the implementer. Codex decomposes the goal, writes acceptance criteria, reviews every diff, and decides "next" / "done"; Claude reads, writes, and runs tests.
Status: v0.1.x — full loop. Plan, automatic per-edit review, and a Stop-hook driven autopilot loop that keeps going until Codex says done. Bash safety rails block a small list of irreversible commands.
npm i -g @openai/codex
# or
brew install --cask codex
codex login — OAuth flow.
This plugin does not require OPENAI_API_KEY.printenv OPENAI_API_KEY | codex login --with-api-key.bash.exe + cygpath).
The hooks ship with a polyglot run-hook.cmd wrapper so the same
.sh files run on macOS / Linux / Windows. Default Git path
C:\Program Files\Git\bin\bash.exe — edit hooks/run-hook.cmd
if Git is installed elsewhere.Verify:
codex --version
codex login status # "Logged in using ChatGPT" or similar
node --version
git clone https://github.com/egurin/autopilot-codex
cd autopilot-codex
npm test # 185 tests, runs in ~2s
claude plugin validate . # manifest + frontmatter check
The top-level package.json is for the test harness only; the plugin
itself is consumed via Claude Code, not npm.
Local development:
git clone https://github.com/egurin/autopilot-codex
claude --plugin-dir ./autopilot-codex
> /reload-plugins
Marketplace (once published):
> /plugin marketplace add egurin/autopilot-codex
> /plugin install autopilot-codex@autopilot-codex-marketplace
> /autopilot-codex:autopilot Add a /healthz endpoint that returns 200 OK with the build SHA
That single command:
fix or reject, the hook returns
decision:"block" so you address the issues before continuing.done?. If
not, it hands you the next task and blocks Stop, so the loop
continues until done, abort, or the iteration cap (default 30).| Command | What it does |
|---|---|
/autopilot-codex:autopilot <goal> | Full loop: plan + run until done. |
/autopilot-codex:codex-plan <goal> | Plan only. Saves a markdown plan; no implementation. |
/autopilot-codex:codex-review [T<n>] | Manually review the current (or specified) task. |
/autopilot-codex:codex-status | Show goal, current task, base SHA, last verdict, iteration. |
/autopilot-codex:codex-next | Show the current task contract from state.json — no Codex turn, no token spend. Quick reminder of what to implement. |
/autopilot-codex:codex-stop [reason] | Disable the autopilot. Hooks become no-ops until codex-plan runs again. |