Skill

design

Performs adversarial pre-coding planning: host and peer agents independently investigate codebase, diff specs, and validate plan with execution drill. Use for 'plan this', 'design approach', or scoping before coding.

developer-tools

npx claudepluginhub heliohq/ship --plugin ship

Tool Access

This skill is limited to using the following tools:

BashReadWriteEditGlobGrepAgentAskUserQuestionTodoWritemcp__codex__codexmcp__codex__codex-reply

Preview

```bash

Supporting Assets

execution-drill.mdindependent-investigator.mdwrite-plan.mdwrite-spec.md

SKILL.md

Similar Skills

design

Guides pre-coding design process: explore context, challenge assumptions, propose approaches, present for approval, set up git worktrees. Enforces HARD-GATE against premature implementation.

2 files

claude-caliper-workflow

plan

Drafts reviewable execution plans for coding tasks by scoping goals, reusing evidence, researching gaps, applying BDD criteria, and persisting plans/lessons to files. Triggers: cwf:plan, plan this task.

1 file

cwf

plan

Generates bulletproof, one-shot-executable plans for complex features via 10-actor verification pipeline (Researcher → Planner → parallel reviewers → Critic) iterating up to 10x until ≥75% approval.

bishx

Stats

Stars50

Forks3

Last CommitApr 20, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Preamble (run first)

SHIP_PLUGIN_ROOT="${SHIP_PLUGIN_ROOT:-$(ship-plugin-root 2>/dev/null || echo "$HOME/.codex/ship")}"
SHIP_SKILL_NAME=design source "${SHIP_PLUGIN_ROOT}/scripts/preflight.sh"

Ship: Design

You ARE the planner. You read code, investigate, write spec and plan. You must read the code yourself — delegating investigation loses the context needed to write a good plan. A peer agent investigates independently and produces its own spec for adversarial comparison.

Runtime Resolution

See ../shared/runtime-resolution.md for the host/peer concept and dispatch commands. In /ship:design, the peer plays two roles: investigator (Phase 2) and drill agent (Phase 6). Both use the same dispatch pattern from the shared reference.

Scope Mode

The prompt may specify Scope mode: full (default) or Scope mode: refactor. It controls how much adversarial validation runs:

Phase	`full`	`refactor`
1 Init	✅	✅
2 Investigate (host + peer)	✅	✅
3 Write spec	✅	✅ (behavior-contract template)
4 Diff & verify	✅	✅
5 Write plan	✅	✅
6 Execution drill	✅	⏭ skipped

Why refactor mode skips Phase 6: for behavior-preserving changes (refactor, simplify, rename, extract, dedupe), the plan steps are usually small, mechanical code movements. The drill's "is every step implementable" check earns little here while adding a full peer round-trip. Peer investigation and diff stay on because they catch the real refactor failure mode — "moved complexity instead of removing it."

If no scope mode is specified (e.g. standalone /ship:design invocation), default to full.

Process Flow

Phase 1  Init            resolve task_id, create .ship/tasks/<id>/plan/
Phase 2  Investigate     dispatch peer (parallel) ─┐
                         you read the code         │
                         ↓                         │
Phase 3  Write spec      write host spec.md  ←─── peer writes peer-spec.md
                         vague? ask user → re-investigate
                         ↓
Phase 4  Diff & verify   compare specs → resolve each divergence
                         disagree? → debate peer (max 2 rounds)
                         still open? → escalate to user
                         critical gap? → re-investigate (max 1 loop)
                         ↓
Phase 5  Write plan      write plan.md with executable tasks
                         self-review against spec
                         ↓
Phase 6  Execution drill dispatch peer (fresh session) to validate plan
                         BLOCKED step?  → escalate
                         UNCLEAR step?  → revise plan (max 1 loop)
                         all CLEAR      → ready for execution

Roles

Phase	Who	Why
Investigation (read code, trace paths)	Host + peer (parallel)	Independent investigation catches different blind spots
Write spec (host version)	You	Investigation context must not be lost
Write spec (peer version)	Peer agent	Independence requires separation
Diff & verify divergences	You	You have the context + code access to judge
Write plan.md	You	Spec context must flow into plan
Execution Drill	Peer agent (fresh session)	Fresh eyes test implementability

Quality Gates

Gate	Condition	Fail action
Investigation → Spec	All claims trace to file:line you read	Re-investigate
Spec → Diff	spec.md has flexible sections scaled to complexity, self-reviewed	Revise
Diff → Plan	Zero `escalated` items (resolved by evidence or debate, or user resolved them)	Ask user
Plan → Drill	plan.md has TDD tasks, checkbox steps, complete code, no placeholders	Revise
Drill → Ready	Zero BLOCKED steps, zero UNCLEAR steps	Revise plan (max 1 loop)

No artifact passes to the next phase without meeting its gate.

Progress Tracking

Use TodoWrite to track your own progress through the design phases. After Phase 1 (init), create todos that reflect the actual work ahead. Adapt the items to what you discover — skip items for phases that don't apply, add items for loops you enter (re-investigation, drill revision).

Principle: one todo per major phase the user would care about. Update activeForm to reflect what's happening within a phase.

Example (full run with peer available):

TodoWrite([
  { content: "Investigate codebase (host + peer)", status: "in_progress", activeForm: "Investigating codebase" },
  { content: "Write spec",                         status: "pending",     activeForm: "Writing spec" },
  { content: "Diff host vs peer specs",            status: "pending",     activeForm: "Diffing specs" },
  { content: "Write implementation plan",          status: "pending",     activeForm: "Writing implementation plan" },
  { content: "Execution drill",                    status: "pending",     activeForm: "Running execution drill" }
])

Adaptations (not exhaustive — use judgment):

Peer unavailable → drop "Diff" item, rename "Investigate" to reflect self-produced peer spec
Upstream spec already exists → drop "Write spec", start with "Validate existing spec"
Re-investigation needed → re-mark "Investigate" as in_progress
Drill revision needed → keep "Execution drill" as in_progress

Red Flag

Never:

Cite files you haven't opened
Let the peer see your spec before producing its own
Resolve divergences by reasoning instead of code evidence (max 2 debate rounds, both cite file:line)
Trust prior conversation over disk artifacts
Mark plan ready when drill has BLOCKED or UNCLEAR items
Skip the drill because "the plan looks solid"
Delegate investigation to a sub-agent — read the code yourself
Claim "function X is not called" without tracing all callers
Propose a fix without searching for existing defenses
Propose to create a file without checking if it already exists
Change a value without grepping tests that assert the old value
Write plan.md with vague steps or placeholders (TBD, TODO, "similar to Task N")

Phase 1: Init

Resolve task_id, create .ship/tasks/<task_id>/plan/ directory.
If resuming, read existing artifacts and determine current state.
Collect branch name and HEAD SHA.

Task ID

If invoked by /ship:auto, the task_id is provided.

If invoked standalone, generate task_id using the shared script:

TASK_ID=$(bash "${SHIP_PLUGIN_ROOT:-$(ship-plugin-root 2>/dev/null || echo "$HOME/.codex/ship")}/scripts/task-id.sh" "<description>")

Artifacts go to .ship/tasks/<task_id>/plan/. The Write tool creates directories automatically — no mkdir needed.

Existing spec.md detection

Check if spec.md already exists with content:

[ -s .ship/tasks/<task_id>/plan/spec.md ] && echo 'SPEC_EXISTS' || echo 'NO_SPEC'

If SPEC_EXISTS:

Read spec.md. This was written by an upstream skill (e.g. refactor).
Check if spec records a HEAD SHA. If it does and it differs from current HEAD, treat spec as stale — proceed as NO_SPEC.
Do not overwrite it. Use it as your investigation input.
Your job narrows: investigate to validate the spec's claims, then produce only plan.md. You may append an ## Investigation section to the existing spec if it lacks one, but preserve all existing sections.
Peer investigation and diff still run — the peer validates the upstream spec independently. Execution drill runs per Scope Mode (always in full, skipped in refactor).

If NO_SPEC: proceed to Phase 2.

Phase 2: Investigate (Parallel)

This is the most important phase. Do not rush it.

Step A: Dispatch peer investigation

Kick off the peer investigation before you start investigating. The peer works in parallel while you read code.

Read independent-investigator.md for the dispatch pattern and prompt template. Fill in the task description, task_id, and repo root. Dispatch the resolved peer runtime and save the returned thread or session id as INVESTIGATION_THREAD_ID when the runtime provides one — needed for debate in Phase 4.

When the peer agent is unavailable

If peer dispatch fails, self-produce the second spec:

Run a second-pass review of your spec using only: placeholder scan, contradiction scan, coverage scan, ambiguity scan
Search for code paths, callers, or consumers you did not trace
Write peer-spec.md with any changed conclusions or additions
Add a warning: WARNING: Second spec was self-generated, not independent

Step B: Your investigation

Read write-spec.md for investigation methodology and spec authoring.

Investigate the codebase, then write spec.md. The reference covers investigation strategy (bug fixes, new features, all tasks), vagueness checks, spec structure, and self-review.

If investigation reveals hidden dependencies or cross-cutting concerns not apparent from the task description, note them for the spec.

Phase 3: Write Spec

Covered by write-spec.md — follow the spec writing and self-review guidance there.

Phase 4: Diff & Verify

Read peer-spec.md (written by the peer investigation dispatched in Phase 2). Compare it against your spec.md.

For each divergence point:

Identify the divergence — what does your spec say vs the peer spec?
Verify against code — read the actual code to determine which is correct. Do NOT resolve by reasoning about which "sounds better."
If still disagree — debate with the peer agent. Continue on the same peer thread or session when possible using that runtime's follow-up mechanism. Present your code evidence and ask the peer to present counter-evidence. If the runtime cannot continue the same session, dispatch a fresh peer session with the prior evidence quoted verbatim. Maximum 2 debate rounds. Both sides must cite file:line references.
Assign disposition after debate:
- patched → Your spec updated based on evidence. Show the diff.
- proven-false → The peer claim is wrong. Cite the code evidence.
- conceded → The peer convinced you with code evidence. Update spec.
- escalated → 2 debate rounds exhausted, still unresolved. Needs user input.

Record in diff-report.md

Only record divergences and their resolutions. If both specs agree on something, there's nothing to record — move on.

For each divergence, write what happened: what each side claimed, what code evidence was cited during debate, and the final disposition (patched / proven-false / conceded / escalated).

After diff resolution:

Update spec.md with all patched and conceded items.
If any escalated items exist:
- Standalone mode: ask user via AskUserQuestion before proceeding. Record the user's ruling in diff-report.md with disposition user-resolved and what they decided. Update spec.md accordingly.
- /ship:auto mode: do NOT ask user. Treat escalated items as BLOCKED and return. Auto owns the only user-approval gate.
If diff reveals a critical investigation gap (e.g., the peer found important code you missed entirely), go back to Phase 2 for targeted re-investigation. Maximum 1 re-investigation loop.

Phase 5: Write Plan

Read write-plan.md for plan structure, task granularity, no-placeholder rules, and self-review.

Translate the validated spec.md into an executable plan.md. The reference covers the plan template, bite-sized steps, code completeness guidance, and the self-review checklist.

Phase 6: Execution Drill

Skip this phase when Scope mode: refactor. Record in the report card that the drill was skipped due to refactor scope, then finish. Behavior-preserving plans rarely contain ambiguous steps, and the peer round-trip doesn't earn its cost here.

For full scope (default), run the drill as below.

The final gate. Give the plan to the peer agent and ask it to validate every step is implementable.

Read execution-drill.md for the dispatch pattern, role, and prompt template. Use a new peer session, not the investigation thread. Save the returned thread or session id as DRILL_THREAD_ID when the runtime provides one — needed for revision reruns.

When the peer agent is unavailable

If peer dispatch fails, dispatch a fresh fallback Agent to perform the drill instead. The Agent gets the same prompt from execution-drill.md — it reads spec.md and plan.md with no prior context, providing the best available independent review. Add a warning: WARNING: Drill was fallback-Agent-performed, not peer-agent

After the drill:

All CLEAR → Plan is ready for execution.
UNCLEAR items → Revise plan.md to make each step unambiguous. Then re-run ONLY the unclear steps:
- If the peer runtime supports continuation, continue on DRILL_THREAD_ID with: "Tasks N, M were revised. Re-read plan.md and re-evaluate ONLY those tasks using the same criteria. Report CLEAR/UNCLEAR/BLOCKED."
- Otherwise re-dispatch the peer agent with the same execution-drill.md prompt scoped to the revised tasks only.
- If peer dispatch is unavailable, use the same fallback-Agent pattern. Maximum 1 revision loop.
BLOCKED items → If resolvable by investigation, investigate and fix. If not, escalate to user or mark plan as blocked.

Artifacts

.ship/tasks/<task_id>/
  plan/
    spec.md          — final merged spec (flexible sections, brainstorming style)
    peer-spec.md     — peer agent's independent spec
    plan.md          — how to build it (TDD tasks, writing-plans style)
    diff-report.md   — host spec vs peer spec divergences and resolutions

Error Handling

Error	Action
Peer agent unavailable	Self-produce second spec + fallback drill with warning
Peer output unparseable	Retry once with format reminder, then fall back to fallback drill
Timeout	Abort, preserve artifacts, summarize honestly
Re-investigation needed	Maximum 1 loop back to Phase 2
Drill revision needed	Maximum 1 revision loop

Completion

Only stop for

Task too vague to plan → ask user via AskUserQuestion
Execution drill blockers that require user input → blocked
Timeout → preserve artifacts, summarize honestly

Never stop for

Peer unavailable (self-produce second spec with warning)
Peer output parse failure (retry once, then fallback Agent)

Execution Handoff

Verify spec.md and plan.md are non-empty on disk, then output the report card (read skills/shared/report-card.md for the standard format):

## [Design] Report Card

| Field | Value |
|-------|-------|
| Status | DONE |
| Summary | <task title> — <N> stories planned |

### Metrics
| Metric | Value |
|--------|-------|
| Scope mode | <full | refactor> |
| Files traced | <N> |
| Divergences resolved | <N> (<M> by evidence, <K> by debate) |
| Drill steps CLEAR | <N>/<total>  (or `skipped — refactor scope`) |
| Stories | <N> |

### Artifacts
| File | Purpose |
|------|---------|
| .ship/tasks/<task_id>/plan/spec.md | Merged spec |
| .ship/tasks/<task_id>/plan/peer-spec.md | Peer spec |
| .ship/tasks/<task_id>/plan/plan.md | Executable plan |
| .ship/tasks/<task_id>/plan/diff-report.md | Divergence resolutions |

### Next Steps
1. **Full pipeline (recommended)** — /ship:auto to implement, review, QA, and ship
2. **Implement only** — /ship:dev to execute this plan
3. **Review the plan** — read the artifacts and give feedback

Blocked (both modes)

[Design] BLOCKED
REASON: <what failed and why>
ATTEMPTED: <what was tried>
UNRESOLVED: <escalated items from diff or drill>
RECOMMENDATION: <what the user should do next>