/execute

Arguments: $ARGUMENTS

Goal: Execute implementation with full context awareness. Reads all JSON configs, parses tasks, maps to agents, then gives YOU control over execution method (manual, Copilot handoff, or agent spawning).

Modes

Spec-based:

I001, F001 - Execute single infrastructure/feature spec
A001, W001 - Execute single application/website page
phase-0/001-auth - Execute by path

Phase-based:

--phase-0, --phase-1 - Execute all items in that phase
--infrastructure - Execute all infrastructure phases (0 → 1 → 2)
--features - Execute all features
--application - Execute all Next.js application pages
--website - Execute all Astro marketing/content pages

Notes:

Reads all project context (JSON files, tasks.md)
Maps tasks to suggested agents
Gives YOU choice: manual execution, Copilot handoff, or agent spawning per layer
Only spawns agents if you explicitly choose that option

Execution

Phase 1: Parse Arguments and Determine Mode

Goal: Understand scope and execution strategy

Actions:

Create todo list for tracking
Parse $ARGUMENTS:
- I001, F001 → MODE = "single-spec"
- A001, W001 → MODE = "single-page"
- phase-0/XXX path → MODE = "single-spec" (extract spec ID)
- --phase-0, --phase-1, --phase-2 → MODE = "phase"
- --infrastructure → MODE = "all-infrastructure"
- --features → MODE = "all-features"
- --application → MODE = "all-application"
- --website → MODE = "all-website"
- --all → MODE = "full"
- Empty → MODE = "auto-continue"
Display: "Mode: [MODE]"
Note: User will choose execution method per layer (manual/Copilot/agents)

Phase 2: Load Context and Initialize Tracking

Goal: Read project configuration and initialize single execution log

Actions:

Read project context:
- !{cat .claude/project.json 2>/dev/null || echo "{}"}
- !{cat features.json 2>/dev/null || echo "{}"}
- !{cat application-design.json 2>/dev/null || echo "{}"}
- !{cat website-design.json 2>/dev/null || echo "{}"}
Initialize SINGLE execution log file:
- !{mkdir -p .claude/execution}
- If .claude/execution/execution.json exists, read it
- Otherwise create new structure: !{echo '{"session":"'$(date -Iseconds)'","specs":{},"overall":{"total_specs":0,"completed":0,"in_progress":0,"pending":0}}' > .claude/execution/execution.json}
Identify target specs based on MODE:
- single-spec: Find the specific spec directory
- single-page: Find the specific page directory (application or website)
- phase: Find all specs in that phase directory
- all-infrastructure: List all specs in specs/infrastructure/
- all-features: List all specs in specs/features/
- all-application: List all pages in specs/application/
- all-website: List all pages in specs/website/
For each target spec:
- Check current status from execution.json
- Skip if already completed
- Validate dependencies are met
Display: "Found X specs to execute"

Phase 3: Load Tasks

Goal: Read and parse task lists from specs

Actions:

For each target spec:
- Find spec directory: !{find specs -type d -name "$SPEC_ID" | head -1}
- Read tasks.md: @$SPEC_DIR/tasks.md
- Parse tasks - identify:
  - Task descriptions
  - Checkbox status ([ ] vs [x])
  - Layer/dependency info (L0, L1, L2, L3)
  - Domain hints (clerk, supabase, redis, etc.)
- Count: total, completed, remaining
- Update execution.json with spec entry: !{jq '.specs["'$SPEC_ID'"] = {"status":"pending","tasks_total":'$TOTAL',"tasks_completed":'$DONE',"remaining":'$REMAINING'}' .claude/execution/execution.json > tmp && mv tmp .claude/execution/execution.json}
Display task summary for each spec

Phase 4: Map Tasks to Agents

Goal: Determine which agents to spawn for remaining tasks

Actions:

Analyze remaining tasks for domain keywords:
- "clerk" / "auth" / "oauth" → clerk:* agents
- "supabase" / "database" / "schema" → supabase:* agents
- "component" / "UI" / "frontend" → nextjs-frontend:* agents
- "endpoint" / "API" / "backend" → fastapi-backend:* agents
- "redis" / "cache" → redis:* agents
- "rag" / "vector" / "embedding" → rag-pipeline:* agents
- "sentry" / "error" / "monitoring" → deployment:observability-integrator
- "dashboard" / "settings" / "profile" → nextjs-frontend:page-generator-agent (App pages)
- "landing" / "pricing" / "about" / "blog" → website-builder:website-content (Marketing pages)
Group tasks by domain and layer
Create agent mapping and store in execution.json: !{jq '.specs["'$SPEC_ID'"].agents_mapped = ["agent1", "agent2"]' .claude/execution/execution.json > tmp && mv tmp .claude/execution/execution.json}
If DOMAIN_HINT provided, prioritize those agents

Phase 5: Present Execution Plan and Get User Decision

Goal: Show what needs to be done and let user choose execution method

Actions:

Update spec status to in_progress: !{jq '.specs["'$SPEC_ID'"].status = "in_progress" | .specs["'$SPEC_ID'"].started_at = "'$(date -Iseconds)'"' .claude/execution/execution.json > tmp && mv tmp .claude/execution/execution.json}
For each LAYER (L0 → L1 → L2 → L3):
- Get all tasks in this layer
- Get mapped agents for these tasks
- DISPLAY EXECUTION PLAN:
  - Show layer number and task count
  - List each task with description
  - Show which agents were mapped (if any)
  - Display estimated complexity (simple/medium/complex)
- ASK USER VIA AskUserQuestion: Question: "How should I execute Layer [N] ([X] tasks)?" Options: A) "I'll do it manually (Claude uses Bash/Read/Write/Edit tools)" B) "Use Copilot Chat (I'll hand off the task list)" C) "Spawn suggested agents (costs tokens, fully automated)" D) "Skip this layer for now"
- EXECUTE based on user choice:
  - Choice A: I execute each task using available tools, update checkboxes as I go
  - Choice B: Display formatted task list for copy/paste to Copilot, mark layer as "delegated"
  - Choice C: Spawn mapped agents in parallel, wait for completion, validate results
  - Choice D: Skip to next layer, mark these tasks as pending
- UPDATE tasks.md and execution.json after execution:
  - Update checkboxes for completed tasks: - [ ] → - [x]
  - Update execution.json with progress
- Display layer completion status
- Proceed to next layer

Phase 6: Final Validation

Goal: Verify all tasks completed before marking spec done

Actions:

Re-read tasks.md
Count checkboxes:
- CHECKED = count of - [x]
- UNCHECKED = count of - [ ]
If UNCHECKED > 0:
- Display: "Warning: $UNCHECKED tasks not completed"
- List uncompleted tasks
- Update execution.json: status = "partial"
- DO NOT mark spec as completed
If UNCHECKED = 0:
- All tasks verified complete
- Update execution.json: status = "completed", completed_at = timestamp
Update overall counts in execution.json: !{jq '.overall.completed = ([.specs[] | select(.status == "completed")] | length) | .overall.in_progress = ([.specs[] | select(.status == "in_progress")] | length) | .overall.pending = ([.specs[] | select(.status == "pending")] | length) | .overall.total_specs = (.specs | length)' .claude/execution/execution.json > tmp && mv tmp .claude/execution/execution.json}

Phase 7: Update Project Status

Goal: Update project.json and features.json

Actions:

If status = "completed":
- Update infrastructure item in project.json
- Or update feature in features.json
- Record completion timestamp
If status = "partial":
- Update status to "in_progress"
- Note remaining tasks
Commit changes:
- !{git add .claude/execution/execution.json $SPEC_DIR/tasks.md}
- !{git commit -m "implementation: Update progress - $SPEC_ID"}

Phase 8: Summary

Goal: Display comprehensive results from single execution.json

Actions:

Read execution.json and display summary: !{cat .claude/execution/execution.json | jq '.'}
Display per-spec status table
Show overall totals from execution.json
If status = "completed", suggest next steps:
- /quality:validate-code $SPEC_ID
- /testing:test $SPEC_ID
- /deployment:prepare (if all infrastructure done)
If status = "partial", suggest:
- Re-run: /implementation:execute $SPEC_ID
- Run: /implementation:check-tasks to see remaining
Mark todo complete

Important Notes

Single Execution Log: All progress tracked in ONE file: .claude/execution/execution.json This file contains all specs, their status, task counts, and timestamps. Never create per-spec JSON files.

User-Controlled Execution: For each layer, user chooses: manual (Claude tools), Copilot handoff, or agent spawning. Only spawn agents if user explicitly selects that option for a given layer.

Task Checkbox Tracking: After execution (any method), update tasks.md checkboxes and execution.json. Never mark a spec complete without verifying all checkboxes are checked.

Flexibility: User maintains full control over execution method and token budget. Can mix methods: Layer 0 manual, Layer 1 Copilot, Layer 2 agents, etc.

Modes

Execution

Phase 1: Parse Arguments and Determine Mode

Phase 2: Load Context and Initialize Tracking

Phase 3: Load Tasks

Phase 4: Map Tasks to Agents

Phase 5: Present Execution Plan and Get User Decision

Phase 6: Final Validation

Phase 7: Update Project Status

Phase 8: Summary

Important Notes

Other plugins with /execute