Search everything...

Skill

Orchestrator Directives Skill

Cost-first delegation patterns and decision frameworks for multi-AI coordination

Install

npx claudepluginhub shakestzd/htmlgraph

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Use this skill for delegation patterns and decision frameworks in orchestrator mode.

Supporting Assets

reference.md

SKILL.md

Similar Skills

kotlin-ktor-patterns

Provides Ktor server patterns for routing DSL, plugins (auth, CORS, serialization), Koin DI, WebSockets, services, and testApplication testing.

everything-claude-code

163.2k

deep-research

Conducts multi-source web research with firecrawl and exa MCPs: searches, scrapes pages, synthesizes cited reports. For deep dives, competitive analysis, tech evaluations, or due diligence.

everything-claude-code

163.2k

inventory-demand-planning

Provides demand forecasting, safety stock optimization, replenishment planning, and promotional lift estimation for multi-location retailers managing 300-800 SKUs.

everything-claude-code

163.2k

Stats

Parent Repo Stars3

Parent Repo Forks2

Last CommitMar 27, 2026

Actions

View Source View Plugin View on GitHub View README

Orchestrator Directives Skill | htmlgraph | ClaudePluginHub

Skill

Orchestrator Directives Skill

From htmlgraph

Cost-first delegation patterns and decision frameworks for multi-AI coordination

Install

npx claudepluginhub shakestzd/htmlgraph

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Use this skill for delegation patterns and decision frameworks in orchestrator mode.

Supporting Assets

reference.md

SKILL.md

Orchestrator Directives Skill

Use this skill for delegation patterns and decision frameworks in orchestrator mode.

Trigger keywords: orchestrator, delegation, subagent, task coordination, parallel execution, cost-first, spawner

Quick Start - What is Orchestration?

Delegate tactical work to specialized subagents while you focus on strategic decisions. Save Claude Code context (expensive) by using FREE/CHEAP AIs for appropriate tasks.

Basic pattern:

Task(
    subagent_type="gemini",  # FREE - use for exploration
    description="Find auth patterns",
    prompt="Search codebase for authentication patterns..."
)

When to use: ALWAYS use for complex tasks requiring research, code generation, git operations, or any work that could fail and require retries.

For complete guidance: See sections below or run /multi-ai-orchestration for model selection details.

CRITICAL: Cost-First Delegation (IMPERATIVE)

Claude Code is EXPENSIVE. You MUST delegate to FREE/CHEAP AIs first.

Cost Comparison & Pre-Delegation Checklist

PRE-DELEGATION CHECKLIST (MUST EXECUTE BEFORE EVERY TASK())

Ask these questions IN ORDER:

Can Gemini do this? → Exploration, research, batch ops, file analysis
- YES = MUST use gemini spawner (FREE - 2M tokens/min)
Is this code work? → Implementation, fixes, tests, refactoring
- YES = MUST use codex spawner (70% cheaper than Claude)
Is this git/GitHub? → Commits, PRs, issues, branches
- YES = MUST use copilot spawner (60% cheaper, GitHub-native)
Does this need deep reasoning? → Architecture, complex planning
- YES = Use Claude Opus (expensive, but strategically needed)
Is this coordination? → Multi-agent work
- YES = Use Claude Sonnet (mid-tier)
ONLY if above fail → Haiku (fallback)

Cost Comparison Examples

Task	WRONG (Cost)	CORRECT (Cost)	Savings
Search 100 files	Task() ($15-25)	Gemini spawner (FREE)	100%
Generate code	Task() ($10)	Codex spawner ($3)	70%
Git commit	Task() ($5)	Copilot spawner ($2)	60%
Strategic decision	Direct task ($20)	Claude Opus ($50)	Must pay for quality

WRONG vs CORRECT Examples

WRONG (wastes Claude quota):
- Code implementation → Task(haiku)               # USE Codex spawner
- Git commits → Task(haiku)                       # USE Copilot spawner
- File search → Task(haiku)                       # USE Gemini spawner (FREE!)
- Research → Task(haiku)                          # USE Gemini spawner (FREE!)

CORRECT (cost-optimized):
- Code implementation → Codex spawner             # Cheap, sandboxed
- Git commits → Copilot spawner                   # Cheap, GitHub-native
- File search → Gemini spawner                    # FREE!
- Research → Gemini spawner                       # FREE!
- Strategic decisions → Claude Opus               # Expensive, but needed
- Haiku → FALLBACK ONLY                           # When spawners fail

Core Concepts

Orchestrator vs Executor Roles

Orchestrator (You):

Makes strategic decisions
Delegates tactical work
Tracks progress with SDK
Coordinates parallel subagents
Only executes: Task(), AskUserQuestion(), TodoWrite(), SDK operations

Executor (Subagent):

Handles tactical implementation
Researches specific problems
Fixes issues with retries
Reports findings back
Consumes resources independently (saves your context)

Why separation matters:

Context preservation (MUST prevent failures from compounding in your context)
Parallel efficiency (MUST run multiple subagents simultaneously)
Cost optimization (ALWAYS use cheaper subagents than Claude Code)
Error isolation (MUST keep failures in subagent context)

Why Delegation Matters: Context Cost Model

What looks like "one bash call" becomes many:

Initial command fails → need to retry
Test hooks break → need to fix code → retry
Push conflicts → need to pull/merge → retry
Each retry consumes tokens

Context cost comparison:

Direct execution (fails):
  bash call 1 → fails
  bash call 2 → fails
  bash call 3 → fix code
  bash call 4 → bash call 1 retry
  bash call 5 → bash call 2 retry
  = 5+ tool calls, context consumed

Delegation (cascades isolated):
  Task(subagent handles all retries) → 1 tool call
  Read result → 1 tool call
  = 2 tool calls, clean context

Token savings:

Each failed retry: 2,000-5,000 tokens wasted
Cascading failures: 10,000+ tokens wasted
Subagent isolation: None of that pollution in orchestrator context

Decision Framework: When to Delegate vs Execute

Ask yourself these questions:

Will this likely be ONE tool call?
- Uncertain → DELEGATE
- Certain → MAY do directly (single file read, quick check)
Does this require error handling?
- If yes → DELEGATE (subagent handles retries)
Could this cascade into multiple operations?
- If yes → DELEGATE
Is this strategic or tactical?
- Strategic (decisions) → Do directly
- Tactical (execution) → DELEGATE

Rule of thumb: When in doubt, ALWAYS DELEGATE. Cascading failures are expensive.

Three Allowed Direct Operations

Only these can be executed directly by orchestrator:

Task() - Delegation itself
- Use spawner subagent types when possible
- Example: Task(subagent_type="htmlgraph:gemini-spawner", ...)
AskUserQuestion() - Clarifying requirements
- Get user input before delegating
- Example: AskUserQuestion("Should we use Redis or PostgreSQL?")
TodoWrite() - Tracking work items
- Create/update todo lists
- Example: TodoWrite(todos=[...])

HtmlGraph CLI operations (create features, spikes, bugs):

htmlgraph feature create "title"
htmlgraph spike create "title"
htmlgraph bug create "title"

Everything else MUST be delegated.

Model Selection & Spawner Guide

Spawner Selection Decision Tree

Decision tree (check each in order):

Is this exploration/research/analysis?
- Files search: YES → Gemini spawner (FREE)
- Pattern analysis: YES → Gemini spawner (FREE)
- Documentation reading: YES → Gemini spawner (FREE)
- Learning unfamiliar system: YES → Gemini spawner (FREE)
Is this code implementation/testing?
- Generate code: YES → Codex spawner (70% cheaper)
- Fix bugs: YES → Codex spawner
- Write tests: YES → Codex spawner
- Refactor code: YES → Codex spawner
Is this git/GitHub operation?
- Commit changes: YES → Copilot spawner (60% cheaper, GitHub-native)
- Create PR: YES → Copilot spawner
- Manage branches: YES → Copilot spawner
- Review code: YES → Copilot spawner
Does this need deep reasoning?
- Architecture decisions: YES → Claude Opus (expensive, but needed)
- Complex design: YES → Claude Opus
- Strategic planning: YES → Claude Opus
Is this multi-agent coordination?
- Coordinate multiple spawners: YES → Claude Sonnet (mid-tier)
- Complex workflows: YES → Claude Sonnet
All else fails → Task() with Haiku (fallback)

Spawner Subagent Types:

gemini - FREE, 2M tokens/min, exploration & research
codex - Cheap code specialist, implementation & testing
copilot - Cheap git specialist, GitHub integration
haiku - Generic Claude Haiku (use as fallback or when spawners fail)

Spawner Details & Configuration

Gemini Spawner (FREE - Exploration)

Task(
    subagent_type="gemini",
    description="Analyze authentication patterns",
    prompt="""
    Analyze codebase for:
    - All authentication patterns
    - OAuth implementations
    - Session management
    - JWT usage
    """
)

Best for:

File searching (FREE!)
Pattern analysis (FREE!)
Documentation research (FREE!)
Understanding unfamiliar systems (FREE!)

Codex Spawner (Cheap - Code)

Task(
    subagent_type="codex",
    description="Implement OAuth middleware",
    prompt="""
    Implement OAuth authentication:
    - Sandbox mode: workspace-write
    - Add JWT token generation
    - Include error handling
    - Write unit tests
    """
)

Best for:

Code generation
Bug fixes
Test writing
Refactoring
Sandboxed execution

Copilot Spawner (Cheap - Git)

Task(
    subagent_type="copilot",
    description="Commit and create PR",
    prompt="""
    Commit changes and create PR:
    - Message: "feat: add OAuth authentication"
    - Files: src/auth/*.py, tests/test_auth.py
    - Create PR with description
    """
)

Best for:

Git commits (60% cheaper than Task)
PR creation
Branch management
GitHub integration
Resolving conflicts

Task() with Sonnet/Opus (Strategic)

Task(
    prompt="Design authentication architecture...",
    subagent_type="sonnet"  # or "opus" for deep reasoning
)

Sonnet (Mid-tier):

Coordinate complex workflows
Multi-agent orchestration
Fallback when spawners fail

Opus (Expensive):

Deep reasoning
Architecture decisions
Strategic planning
When quality matters more than cost

Delegation Patterns & Examples

Basic Delegation Pattern

Simple exploration:

Task(
    subagent_type="gemini",
    description="Find all auth patterns",
    prompt="Search codebase for authentication patterns and summarize findings"
)

Code implementation:

Task(
    subagent_type="codex",
    description="Implement OAuth endpoint",
    prompt="Implement OAuth authentication endpoint with JWT support"
)

Git operations:

Task(
    subagent_type="copilot",
    description="Commit changes",
    prompt="Commit changes with message: 'feat: add OAuth authentication'"
)

Git/Code Operations (Copilot-Operator Agent)

MANDATORY: All git write operations must be delegated to the copilot-operator agent.

# For commits, pushes, PRs, and code generation
Agent(
    subagent_type="htmlgraph:copilot-operator",
    description="Commit and push changes",
    prompt="Commit all staged files with message: 'feat: add X'. Then push to origin main.",
)

The copilot-operator agent:

Tries GitHub Copilot CLI first (cost-optimized, external AI)
Falls back to direct git/gh if copilot unavailable
Hook enforcement verifies copilot was attempted before allowing git-write

Never run git commit/push directly from the orchestrator. The PreToolUse hook will deny it in strict mode.

Code Generation (Codex-Operator Agent)

For implementation, refactoring, and structured output tasks:

Agent(
    subagent_type="htmlgraph:codex-operator",
    description="Implement feature X",
    prompt="Add OAuth authentication to the login endpoint. Use gpt-4.1-mini.",
)

The codex-operator agent:

Tries OpenAI Codex CLI first (structured JSON output, sandboxed)
Falls back to direct implementation if Codex unavailable
Always uses -m gpt-4.1-mini (never expensive gpt-5.4 default)

Research & Analysis (Gemini-Operator Agent)

For codebase exploration, documentation research, and large-context analysis:

Agent(
    subagent_type="htmlgraph:gemini-operator",
    description="Research auth patterns",
    prompt="Analyze all authentication patterns in this codebase. Find security gaps.",
)

The gemini-operator agent:

Tries Google Gemini CLI first (2M context window, FREE)
Falls back to direct Read/Grep/Glob exploration if Gemini unavailable
Ideal for tasks requiring full codebase context

Parallel Delegation (Multiple Independent Tasks)

MANDATORY: Always analyze parallelizability when 2+ tasks are identified.

Before presenting recommendations or starting multi-task work, ALWAYS:

Check dependency graph — do any tasks depend on outputs of others?
Check file overlap — do tasks touch the same files/modules?
If independent → propose parallel worktree execution as the DEFAULT
If dependent → identify the critical path and parallelize what you can

Decision matrix:

Dependency?	File Overlap?	Action
No	No	Parallel worktrees (DEFAULT)
No	Yes	Sequential (same files = merge conflicts)
Yes	No	Pipeline (parallel where deps allow)
Yes	Yes	Sequential

Pattern: Spawn all at once in isolated worktrees

# Launch parallel agents in worktrees — one per feature
Agent(
    subagent_type="htmlgraph:sonnet-coder",
    description="Feature A",
    prompt="Implement feature A...",
    isolation="worktree",
    run_in_background=True,
)

Agent(
    subagent_type="htmlgraph:sonnet-coder",
    description="Feature B",
    prompt="Implement feature B...",
    isolation="worktree",
    run_in_background=True,
)

Agent(
    subagent_type="htmlgraph:haiku-coder",
    description="Feature C (simple)",
    prompt="Implement feature C...",
    isolation="worktree",
    run_in_background=True,
)

Benefits:

3 tasks in parallel: time = max(T1, T2, T3) instead of T1+T2+T3
Cost optimization: Uses cheapest model for each task
Worktree isolation: No merge conflicts during execution
Independent results: Each task tracked separately

After completion: Merge worktree branches to main, run quality gates, clean up.

Sequential Delegation with Dependencies

Pattern: Chain dependent tasks in sequence

# 1. Research existing patterns
Task(
    subagent_type="gemini",
    description="Research OAuth patterns",
    prompt="Find all OAuth implementations in codebase..."
)

# 2. Wait for research, then implement
# (In next message after reading result)
research_findings = "..."  # Read from previous task result

Task(
    subagent_type="codex",
    description="Implement OAuth based on research",
    prompt=f"""
    Implement OAuth using discovered patterns:
    {research_findings}
    """
)

# 3. Wait for implementation, then commit
Task(
    subagent_type="copilot",
    description="Commit implementation",
    prompt="Commit OAuth implementation..."
)

When to use: When later tasks depend on earlier results

HtmlGraph Result Retrieval

Subagents report findings automatically:

When a Task() completes, findings are available via CLI:

# Check recent spikes
htmlgraph spike list

# View specific spike
htmlgraph spike show <id>

Pattern: Read findings after Task completes

# 1. Delegate exploration
Task(
    subagent_type="htmlgraph:gemini-operator",
    description="Analyze auth patterns",
    prompt="Find all authentication patterns..."
)

# 2. The subagent creates a spike with findings
# Read findings via: htmlgraph spike list (then spike show <id>)

# 3. Use findings in next delegation
Task(
    subagent_type="htmlgraph:codex-operator",
    description="Implement based on findings",
    prompt="Implement authentication based on auth pattern research findings..."
)

Error Handling & Retries

Let subagents handle retries:

# WRONG - Don't retry directly as orchestrator
bash_result = Bash(command="git commit -m 'feat: new'")
if failed:
    # Retry directly (context pollution)
    Bash(command="git pull && git commit")  # More context used

# CORRECT - Subagent handles retries
Task(
    subagent_type="copilot",
    description="Commit changes with retry",
    prompt="""
    Commit changes:
    Message: "feat: new feature"

    If commit fails:
    1. Pull latest changes
    2. Resolve conflicts if any
    3. Retry commit
    4. Handle pre-commit hooks

    Report final status: success or failure
    """
)

Benefits:

Subagent context handles retries (not your context)
Cleaner error reporting
Automatic recovery attempts
You get clean success/failure

Advanced: Post-Compact Persistence

Orchestrator Activation After Compact

How it works:

Before compact, SDK sets environment variable: CLAUDE_ORCHESTRATOR_ACTIVE=true
SessionStart hook detects post-compact state
Orchestrator Directives Skill auto-activates
This skill section appears automatically (first time post-compact)

Why: Preserve orchestration discipline after context compact

What you see:

Skill automatically activates (no manual invocation needed)
Quick start section visible by default
Expand detailed sections as needed
Full guidance available without re-reading docs

To manually trigger:

/orchestrator-directives

Environment variable:

CLAUDE_ORCHESTRATOR_ACTIVE=true  # Set by SDK

Session Continuity Across Compacts

Features preserved across compact:

Work items in HtmlGraph
Feature/spike tracking
Delegation patterns
Model selection guidance
This skill's guidance

What's lost:

Your context (that's why compact happens)
Intermediate tool outputs
Local variables

Re-activation pattern:

Before compact:
- Work on features, track in HtmlGraph
- Delegate with clear prompts
- Use SDK to save progress

After compact:
- Orchestrator Skill auto-activates
- Re-read recent spikes for context
- Continue delegations
- Use Task IDs for parallel coordination

Core Development Principles (Enforce in ALL Delegations)

When delegating to ANY coder agent, include these requirements in the prompt:

Research First

Search for existing libraries before implementing from scratch
Check pyproject.toml before adding new dependencies
Prefer well-maintained packages over custom implementations

Code Design

DRY — Extract shared logic; check src/python/htmlgraph/utils/ for existing utilities before writing new ones
Single Responsibility — One clear purpose per module, class, and function
KISS — Simplest solution that satisfies current requirements
YAGNI — Only implement what is needed now, not speculative future needs
Composition over inheritance

Module Size Limits

Functions: <50 lines | Classes: <300 lines | Modules: <500 lines
If a module would exceed limits, split it as part of the work — do not defer refactoring

Before Committing

uv run ruff check --fix && uv run ruff format && uv run mypy src/ && uv run pytest

Never commit with unresolved type errors, lint warnings, or test failures.

Core Philosophy

Core Principles Summary

Principle 1: Delegation > Direct Execution

Cascading failures consume exponentially more context than structured delegation
One failed bash call becomes 3-5 calls with retries
Delegation isolates failures to subagent context

Principle 2: Cost-First > Capability-First

Use FREE/cheap AIs (Gemini, Codex, Copilot) before expensive Claude Code
Gemini: FREE (exploration)
Codex: 70% cheaper (code)
Copilot: 60% cheaper (git)
Claude: Expensive (strategic only)

Principle 3: You Don't Know the Outcome

What looks like "one tool call" often becomes many
Unexpected failures, conflicts, retries consume context
Delegation removes unpredictability from orchestrator context

Principle 4: Parallel > Sequential

Multiple subagents can work simultaneously
Much faster than sequential execution
Orchestrator stays available for decisions

Principle 5: Track Everything

Use HtmlGraph CLI to track delegations
Features, spikes, bugs created for all work
Clear record of who did what

Core Philosophy

Delegation > Direct Execution. Cascading failures consume exponentially more context than structured delegation.

Cost-First > Capability-First. Use FREE/cheap AIs before expensive Claude models.

Quick Reference Table

Operation Type → Correct Delegation

Operation	MUST Use	Cost	Fallback
Search files	Gemini spawner	FREE	Haiku
Pattern analysis	Gemini spawner	FREE	Haiku
Documentation research	Gemini spawner	FREE	Haiku
Code generation	Codex spawner	$ (70% off)	Sonnet
Bug fixes	Codex spawner	$ (70% off)	Haiku
Write tests	Codex spawner	$ (70% off)	Haiku
Git commits	Copilot spawner	$ (60% off)	Haiku
Create PRs	Copilot spawner	$ (60% off)	Haiku
Architecture	Claude Opus	$$$$	Sonnet
Strategic decisions	Claude Opus	$$$$	Task()

Key: FREE = No cost | $ = Cheap | $$$$ = Expensive (but necessary)

Related Skills

/multi-ai-orchestration - Comprehensive model selection guide with detailed decision matrix
/code-quality - Quality gates and pre-commit workflows
/strategic-planning - HtmlGraph analytics for smart prioritization

Reference Documentation

Complete Rules: See orchestration.md
Advanced Patterns: See reference.md
HtmlGraph CLI: htmlgraph --help

Quick Summary

Cost-First Orchestration:

Gemini (FREE) → exploration, research, analysis
Codex (70% off) → code implementation, fixes, tests
Copilot (60% off) → git operations, PRs
Claude Opus → deep reasoning, strategy only

Orchestrator Rule: Only execute: Task(), AskUserQuestion(), TodoWrite(), SDK operations

Everything else → Delegate to appropriate spawner

When in doubt → DELEGATE

Similar Skills

kotlin-ktor-patterns

Provides Ktor server patterns for routing DSL, plugins (auth, CORS, serialization), Koin DI, WebSockets, services, and testApplication testing.

everything-claude-code

163.2k

deep-research

Conducts multi-source web research with firecrawl and exa MCPs: searches, scrapes pages, synthesizes cited reports. For deep dives, competitive analysis, tech evaluations, or due diligence.

everything-claude-code

163.2k

inventory-demand-planning

Provides demand forecasting, safety stock optimization, replenishment planning, and promotional lift estimation for multi-location retailers managing 300-800 SKUs.

everything-claude-code

163.2k

Stats

Parent Repo Stars3

Parent Repo Forks2

Last CommitMar 27, 2026

Actions

View Source View Plugin View on GitHub View README

Orchestrator Directives Skill

Use this skill for delegation patterns and decision frameworks in orchestrator mode.

Trigger keywords: orchestrator, delegation, subagent, task coordination, parallel execution, cost-first, spawner

Quick Start - What is Orchestration?

Delegate tactical work to specialized subagents while you focus on strategic decisions. Save Claude Code context (expensive) by using FREE/CHEAP AIs for appropriate tasks.

Basic pattern:

Task(
    subagent_type="gemini",  # FREE - use for exploration
    description="Find auth patterns",
    prompt="Search codebase for authentication patterns..."
)

When to use: ALWAYS use for complex tasks requiring research, code generation, git operations, or any work that could fail and require retries.

For complete guidance: See sections below or run /multi-ai-orchestration for model selection details.

CRITICAL: Cost-First Delegation (IMPERATIVE)

Claude Code is EXPENSIVE. You MUST delegate to FREE/CHEAP AIs first.

Cost Comparison & Pre-Delegation Checklist

PRE-DELEGATION CHECKLIST (MUST EXECUTE BEFORE EVERY TASK())

Ask these questions IN ORDER:

Can Gemini do this? → Exploration, research, batch ops, file analysis
- YES = MUST use gemini spawner (FREE - 2M tokens/min)
Is this code work? → Implementation, fixes, tests, refactoring
- YES = MUST use codex spawner (70% cheaper than Claude)
Is this git/GitHub? → Commits, PRs, issues, branches
- YES = MUST use copilot spawner (60% cheaper, GitHub-native)
Does this need deep reasoning? → Architecture, complex planning
- YES = Use Claude Opus (expensive, but strategically needed)
Is this coordination? → Multi-agent work
- YES = Use Claude Sonnet (mid-tier)
ONLY if above fail → Haiku (fallback)

Cost Comparison Examples

Task	WRONG (Cost)	CORRECT (Cost)	Savings
Search 100 files	Task() ($15-25)	Gemini spawner (FREE)	100%
Generate code	Task() ($10)	Codex spawner ($3)	70%
Git commit	Task() ($5)	Copilot spawner ($2)	60%
Strategic decision	Direct task ($20)	Claude Opus ($50)	Must pay for quality

WRONG vs CORRECT Examples

WRONG (wastes Claude quota):
- Code implementation → Task(haiku)               # USE Codex spawner
- Git commits → Task(haiku)                       # USE Copilot spawner
- File search → Task(haiku)                       # USE Gemini spawner (FREE!)
- Research → Task(haiku)                          # USE Gemini spawner (FREE!)

CORRECT (cost-optimized):
- Code implementation → Codex spawner             # Cheap, sandboxed
- Git commits → Copilot spawner                   # Cheap, GitHub-native
- File search → Gemini spawner                    # FREE!
- Research → Gemini spawner                       # FREE!
- Strategic decisions → Claude Opus               # Expensive, but needed
- Haiku → FALLBACK ONLY                           # When spawners fail

Core Concepts

Orchestrator vs Executor Roles

Orchestrator (You):

Makes strategic decisions
Delegates tactical work
Tracks progress with SDK
Coordinates parallel subagents
Only executes: Task(), AskUserQuestion(), TodoWrite(), SDK operations

Executor (Subagent):

Handles tactical implementation
Researches specific problems
Fixes issues with retries
Reports findings back
Consumes resources independently (saves your context)

Why separation matters:

Context preservation (MUST prevent failures from compounding in your context)
Parallel efficiency (MUST run multiple subagents simultaneously)
Cost optimization (ALWAYS use cheaper subagents than Claude Code)
Error isolation (MUST keep failures in subagent context)

Why Delegation Matters: Context Cost Model

What looks like "one bash call" becomes many:

Initial command fails → need to retry
Test hooks break → need to fix code → retry
Push conflicts → need to pull/merge → retry
Each retry consumes tokens

Context cost comparison:

Direct execution (fails):
  bash call 1 → fails
  bash call 2 → fails
  bash call 3 → fix code
  bash call 4 → bash call 1 retry
  bash call 5 → bash call 2 retry
  = 5+ tool calls, context consumed

Delegation (cascades isolated):
  Task(subagent handles all retries) → 1 tool call
  Read result → 1 tool call
  = 2 tool calls, clean context

Token savings:

Each failed retry: 2,000-5,000 tokens wasted
Cascading failures: 10,000+ tokens wasted
Subagent isolation: None of that pollution in orchestrator context

Decision Framework: When to Delegate vs Execute

Ask yourself these questions:

Will this likely be ONE tool call?
- Uncertain → DELEGATE
- Certain → MAY do directly (single file read, quick check)
Does this require error handling?
- If yes → DELEGATE (subagent handles retries)
Could this cascade into multiple operations?
- If yes → DELEGATE
Is this strategic or tactical?
- Strategic (decisions) → Do directly
- Tactical (execution) → DELEGATE

Rule of thumb: When in doubt, ALWAYS DELEGATE. Cascading failures are expensive.

Three Allowed Direct Operations

Only these can be executed directly by orchestrator:

Task() - Delegation itself
- Use spawner subagent types when possible
- Example: Task(subagent_type="htmlgraph:gemini-spawner", ...)
AskUserQuestion() - Clarifying requirements
- Get user input before delegating
- Example: AskUserQuestion("Should we use Redis or PostgreSQL?")
TodoWrite() - Tracking work items
- Create/update todo lists
- Example: TodoWrite(todos=[...])

HtmlGraph CLI operations (create features, spikes, bugs):

htmlgraph feature create "title"
htmlgraph spike create "title"
htmlgraph bug create "title"

Everything else MUST be delegated.

Model Selection & Spawner Guide

Spawner Selection Decision Tree

Decision tree (check each in order):

Is this exploration/research/analysis?
- Files search: YES → Gemini spawner (FREE)
- Pattern analysis: YES → Gemini spawner (FREE)
- Documentation reading: YES → Gemini spawner (FREE)
- Learning unfamiliar system: YES → Gemini spawner (FREE)
Is this code implementation/testing?
- Generate code: YES → Codex spawner (70% cheaper)
- Fix bugs: YES → Codex spawner
- Write tests: YES → Codex spawner
- Refactor code: YES → Codex spawner
Is this git/GitHub operation?
- Commit changes: YES → Copilot spawner (60% cheaper, GitHub-native)
- Create PR: YES → Copilot spawner
- Manage branches: YES → Copilot spawner
- Review code: YES → Copilot spawner
Does this need deep reasoning?
- Architecture decisions: YES → Claude Opus (expensive, but needed)
- Complex design: YES → Claude Opus
- Strategic planning: YES → Claude Opus
Is this multi-agent coordination?
- Coordinate multiple spawners: YES → Claude Sonnet (mid-tier)
- Complex workflows: YES → Claude Sonnet
All else fails → Task() with Haiku (fallback)

Spawner Subagent Types:

gemini - FREE, 2M tokens/min, exploration & research
codex - Cheap code specialist, implementation & testing
copilot - Cheap git specialist, GitHub integration
haiku - Generic Claude Haiku (use as fallback or when spawners fail)

Spawner Details & Configuration

Gemini Spawner (FREE - Exploration)

Task(
    subagent_type="gemini",
    description="Analyze authentication patterns",
    prompt="""
    Analyze codebase for:
    - All authentication patterns
    - OAuth implementations
    - Session management
    - JWT usage
    """
)

Best for:

File searching (FREE!)
Pattern analysis (FREE!)
Documentation research (FREE!)
Understanding unfamiliar systems (FREE!)

Codex Spawner (Cheap - Code)

Task(
    subagent_type="codex",
    description="Implement OAuth middleware",
    prompt="""
    Implement OAuth authentication:
    - Sandbox mode: workspace-write
    - Add JWT token generation
    - Include error handling
    - Write unit tests
    """
)

Best for:

Code generation
Bug fixes
Test writing
Refactoring
Sandboxed execution

Copilot Spawner (Cheap - Git)

Task(
    subagent_type="copilot",
    description="Commit and create PR",
    prompt="""
    Commit changes and create PR:
    - Message: "feat: add OAuth authentication"
    - Files: src/auth/*.py, tests/test_auth.py
    - Create PR with description
    """
)

Best for:

Git commits (60% cheaper than Task)
PR creation
Branch management
GitHub integration
Resolving conflicts

Task() with Sonnet/Opus (Strategic)

Task(
    prompt="Design authentication architecture...",
    subagent_type="sonnet"  # or "opus" for deep reasoning
)

Sonnet (Mid-tier):

Coordinate complex workflows
Multi-agent orchestration
Fallback when spawners fail

Opus (Expensive):

Deep reasoning
Architecture decisions
Strategic planning
When quality matters more than cost

Delegation Patterns & Examples

Basic Delegation Pattern

Simple exploration:

Task(
    subagent_type="gemini",
    description="Find all auth patterns",
    prompt="Search codebase for authentication patterns and summarize findings"
)

Code implementation:

Task(
    subagent_type="codex",
    description="Implement OAuth endpoint",
    prompt="Implement OAuth authentication endpoint with JWT support"
)

Git operations:

Task(
    subagent_type="copilot",
    description="Commit changes",
    prompt="Commit changes with message: 'feat: add OAuth authentication'"
)

Git/Code Operations (Copilot-Operator Agent)

MANDATORY: All git write operations must be delegated to the copilot-operator agent.

# For commits, pushes, PRs, and code generation
Agent(
    subagent_type="htmlgraph:copilot-operator",
    description="Commit and push changes",
    prompt="Commit all staged files with message: 'feat: add X'. Then push to origin main.",
)

The copilot-operator agent:

Tries GitHub Copilot CLI first (cost-optimized, external AI)
Falls back to direct git/gh if copilot unavailable
Hook enforcement verifies copilot was attempted before allowing git-write

Never run git commit/push directly from the orchestrator. The PreToolUse hook will deny it in strict mode.

Code Generation (Codex-Operator Agent)

For implementation, refactoring, and structured output tasks:

Agent(
    subagent_type="htmlgraph:codex-operator",
    description="Implement feature X",
    prompt="Add OAuth authentication to the login endpoint. Use gpt-4.1-mini.",
)

The codex-operator agent:

Tries OpenAI Codex CLI first (structured JSON output, sandboxed)
Falls back to direct implementation if Codex unavailable
Always uses -m gpt-4.1-mini (never expensive gpt-5.4 default)

Research & Analysis (Gemini-Operator Agent)

For codebase exploration, documentation research, and large-context analysis:

Agent(
    subagent_type="htmlgraph:gemini-operator",
    description="Research auth patterns",
    prompt="Analyze all authentication patterns in this codebase. Find security gaps.",
)

The gemini-operator agent:

Tries Google Gemini CLI first (2M context window, FREE)
Falls back to direct Read/Grep/Glob exploration if Gemini unavailable
Ideal for tasks requiring full codebase context

Parallel Delegation (Multiple Independent Tasks)

MANDATORY: Always analyze parallelizability when 2+ tasks are identified.

Before presenting recommendations or starting multi-task work, ALWAYS:

Check dependency graph — do any tasks depend on outputs of others?
Check file overlap — do tasks touch the same files/modules?
If independent → propose parallel worktree execution as the DEFAULT
If dependent → identify the critical path and parallelize what you can

Decision matrix:

Dependency?	File Overlap?	Action
No	No	Parallel worktrees (DEFAULT)
No	Yes	Sequential (same files = merge conflicts)
Yes	No	Pipeline (parallel where deps allow)
Yes	Yes	Sequential

Pattern: Spawn all at once in isolated worktrees

# Launch parallel agents in worktrees — one per feature
Agent(
    subagent_type="htmlgraph:sonnet-coder",
    description="Feature A",
    prompt="Implement feature A...",
    isolation="worktree",
    run_in_background=True,
)

Agent(
    subagent_type="htmlgraph:sonnet-coder",
    description="Feature B",
    prompt="Implement feature B...",
    isolation="worktree",
    run_in_background=True,
)

Agent(
    subagent_type="htmlgraph:haiku-coder",
    description="Feature C (simple)",
    prompt="Implement feature C...",
    isolation="worktree",
    run_in_background=True,
)

Benefits:

3 tasks in parallel: time = max(T1, T2, T3) instead of T1+T2+T3
Cost optimization: Uses cheapest model for each task
Worktree isolation: No merge conflicts during execution
Independent results: Each task tracked separately

After completion: Merge worktree branches to main, run quality gates, clean up.

Sequential Delegation with Dependencies

Pattern: Chain dependent tasks in sequence

# 1. Research existing patterns
Task(
    subagent_type="gemini",
    description="Research OAuth patterns",
    prompt="Find all OAuth implementations in codebase..."
)

# 2. Wait for research, then implement
# (In next message after reading result)
research_findings = "..."  # Read from previous task result

Task(
    subagent_type="codex",
    description="Implement OAuth based on research",
    prompt=f"""
    Implement OAuth using discovered patterns:
    {research_findings}
    """
)

# 3. Wait for implementation, then commit
Task(
    subagent_type="copilot",
    description="Commit implementation",
    prompt="Commit OAuth implementation..."
)

When to use: When later tasks depend on earlier results

HtmlGraph Result Retrieval

Subagents report findings automatically:

When a Task() completes, findings are available via CLI:

# Check recent spikes
htmlgraph spike list

# View specific spike
htmlgraph spike show <id>

Pattern: Read findings after Task completes

# 1. Delegate exploration
Task(
    subagent_type="htmlgraph:gemini-operator",
    description="Analyze auth patterns",
    prompt="Find all authentication patterns..."
)

# 2. The subagent creates a spike with findings
# Read findings via: htmlgraph spike list (then spike show <id>)

# 3. Use findings in next delegation
Task(
    subagent_type="htmlgraph:codex-operator",
    description="Implement based on findings",
    prompt="Implement authentication based on auth pattern research findings..."
)

Error Handling & Retries

Let subagents handle retries:

# WRONG - Don't retry directly as orchestrator
bash_result = Bash(command="git commit -m 'feat: new'")
if failed:
    # Retry directly (context pollution)
    Bash(command="git pull && git commit")  # More context used

# CORRECT - Subagent handles retries
Task(
    subagent_type="copilot",
    description="Commit changes with retry",
    prompt="""
    Commit changes:
    Message: "feat: new feature"

    If commit fails:
    1. Pull latest changes
    2. Resolve conflicts if any
    3. Retry commit
    4. Handle pre-commit hooks

    Report final status: success or failure
    """
)

Benefits:

Subagent context handles retries (not your context)
Cleaner error reporting
Automatic recovery attempts
You get clean success/failure

Advanced: Post-Compact Persistence

Orchestrator Activation After Compact

How it works:

Before compact, SDK sets environment variable: CLAUDE_ORCHESTRATOR_ACTIVE=true
SessionStart hook detects post-compact state
Orchestrator Directives Skill auto-activates
This skill section appears automatically (first time post-compact)

Why: Preserve orchestration discipline after context compact

What you see:

Skill automatically activates (no manual invocation needed)
Quick start section visible by default
Expand detailed sections as needed
Full guidance available without re-reading docs

To manually trigger:

/orchestrator-directives

Environment variable:

CLAUDE_ORCHESTRATOR_ACTIVE=true  # Set by SDK

Session Continuity Across Compacts

Features preserved across compact:

Work items in HtmlGraph
Feature/spike tracking
Delegation patterns
Model selection guidance
This skill's guidance

What's lost:

Your context (that's why compact happens)
Intermediate tool outputs
Local variables

Re-activation pattern:

Before compact:
- Work on features, track in HtmlGraph
- Delegate with clear prompts
- Use SDK to save progress

After compact:
- Orchestrator Skill auto-activates
- Re-read recent spikes for context
- Continue delegations
- Use Task IDs for parallel coordination

Core Development Principles (Enforce in ALL Delegations)

When delegating to ANY coder agent, include these requirements in the prompt:

Research First

Search for existing libraries before implementing from scratch
Check pyproject.toml before adding new dependencies
Prefer well-maintained packages over custom implementations

Code Design

DRY — Extract shared logic; check src/python/htmlgraph/utils/ for existing utilities before writing new ones
Single Responsibility — One clear purpose per module, class, and function
KISS — Simplest solution that satisfies current requirements
YAGNI — Only implement what is needed now, not speculative future needs
Composition over inheritance

Module Size Limits

Functions: <50 lines | Classes: <300 lines | Modules: <500 lines
If a module would exceed limits, split it as part of the work — do not defer refactoring

Before Committing

uv run ruff check --fix && uv run ruff format && uv run mypy src/ && uv run pytest

Never commit with unresolved type errors, lint warnings, or test failures.

Core Philosophy

Core Principles Summary

Principle 1: Delegation > Direct Execution

Cascading failures consume exponentially more context than structured delegation
One failed bash call becomes 3-5 calls with retries
Delegation isolates failures to subagent context

Principle 2: Cost-First > Capability-First

Use FREE/cheap AIs (Gemini, Codex, Copilot) before expensive Claude Code
Gemini: FREE (exploration)
Codex: 70% cheaper (code)
Copilot: 60% cheaper (git)
Claude: Expensive (strategic only)

Principle 3: You Don't Know the Outcome

What looks like "one tool call" often becomes many
Unexpected failures, conflicts, retries consume context
Delegation removes unpredictability from orchestrator context

Principle 4: Parallel > Sequential

Multiple subagents can work simultaneously
Much faster than sequential execution
Orchestrator stays available for decisions

Principle 5: Track Everything

Use HtmlGraph CLI to track delegations
Features, spikes, bugs created for all work
Clear record of who did what

Core Philosophy

Delegation > Direct Execution. Cascading failures consume exponentially more context than structured delegation.

Cost-First > Capability-First. Use FREE/cheap AIs before expensive Claude models.

Quick Reference Table

Operation Type → Correct Delegation

Operation	MUST Use	Cost	Fallback
Search files	Gemini spawner	FREE	Haiku
Pattern analysis	Gemini spawner	FREE	Haiku
Documentation research	Gemini spawner	FREE	Haiku
Code generation	Codex spawner	$ (70% off)	Sonnet
Bug fixes	Codex spawner	$ (70% off)	Haiku
Write tests	Codex spawner	$ (70% off)	Haiku
Git commits	Copilot spawner	$ (60% off)	Haiku
Create PRs	Copilot spawner	$ (60% off)	Haiku
Architecture	Claude Opus	$$$$	Sonnet
Strategic decisions	Claude Opus	$$$$	Task()

Key: FREE = No cost | $ = Cheap | $$$$ = Expensive (but necessary)

Related Skills

/multi-ai-orchestration - Comprehensive model selection guide with detailed decision matrix
/code-quality - Quality gates and pre-commit workflows
/strategic-planning - HtmlGraph analytics for smart prioritization

Reference Documentation

Complete Rules: See orchestration.md
Advanced Patterns: See reference.md
HtmlGraph CLI: htmlgraph --help

Quick Summary

Cost-First Orchestration:

Gemini (FREE) → exploration, research, analysis
Codex (70% off) → code implementation, fixes, tests
Copilot (60% off) → git operations, PRs
Claude Opus → deep reasoning, strategy only

Orchestrator Rule: Only execute: Task(), AskUserQuestion(), TodoWrite(), SDK operations

Everything else → Delegate to appropriate spawner

When in doubt → DELEGATE