Skill

ai-council-orchestration

Multi-model autonomous loop through Think→Plan→Create→Review→Verify — uses the best model per stage from ALL connected providers. Live auto-discovery of available models.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/ponytail:ai-council-orchestration

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Every stage uses the best available model for its role.** Model catalog is auto-discovered at runtime — no hardcoded model names that go stale. Works with GitHub Copilot, OpenCode Zen, Nvidia NIM, Ollama, Gemini, OpenAI, Groq, OpenRouter, and official Claude.

SKILL.md

257 lines · ~3.1k tokens

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitJun 22, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

AI Council Orchestration — Multi-Model, All Providers

Every stage uses the best available model for its role. Model catalog is auto-discovered at runtime — no hardcoded model names that go stale. Works with GitHub Copilot, OpenCode Zen, Nvidia NIM, Ollama, Gemini, OpenAI, Groq, OpenRouter, and official Claude.

Quick Start

council-orchestrator models                   # Step 0: discover available models
council-orchestrator init "<your objective>"   # Step 1: start council
council-orchestrator status                    # Step 2: check stage

Then enter the loop below.

Model Discovery (Step 0)

Before entering the main loop, run:

council-orchestrator models

This queries http://127.0.0.1:4001/v1/models live and writes COUNCIL_MODELS.md with:

All available models grouped by provider
Recommended model for each council role
Live indicators (⚡ = connected now)

If the proxy isn't running, fall back to the embedded catalog below.

Role-to-Model Mapping (Live)

Role	Best Picks (in priority order)	Selection Strategy
Thinker (deep reasoning)	copilot/claude-opus-4.6-1m → opencode/qwen3.7-max → opencode/deepseek-v4-pro → opencode/kimi-k2.6	Strongest analytical model available
Planner (task decomposition)	copilot/claude-sonnet-4.6 → opencode/qwen3.6-plus → opencode/minimax-m2.7	Best at structured planning
Creator (code + TDD)	copilot/gpt-5.4 → opencode/deepseek-v4-flash → opencode/minimax-m2.7 → copilot/grok-code-fast-1	Best code generation available
Critic (adversarial review)	copilot/claude-sonnet-4.6 → opencode/deepseek-v4-pro → opencode/kimi-k2.6	Strong at finding flaws
Reviewer (code review)	copilot/claude-sonnet-4.6 → opencode/qwen3.6-plus → opencode/minimax-m2.5	Balanced review quality
Verifier (fast checks)	copilot/claude-haiku-4.5 → copilot/gpt-5-mini → opencode/deepseek-v4-flash-free	Fast & cheap, uses FREE tier if available

Selection rule: Pick the first model from the priority list that is currently connected (⚡ in council-orchestrator models output). If none of the top picks are available, use any connected model — don't stall.

Embedded Model Catalog (Fallback)

If the proxy is unreachable, use this static reference:

GitHub Copilot (connected ✅)

Model ID	Capabilities	Best For
copilot/claude-opus-4.6-1m	Vision, 15x premium	★ Thinker, Critic
copilot/claude-sonnet-4.6	Vision	★ Planner, Reviewer, Critic
copilot/claude-sonnet-4.5	Vision	Reviewer, Planner
copilot/claude-haiku-4.5	Vision, 0.33x cost	★ Verifier
copilot/gpt-5.4	Vision	★ Creator
copilot/gpt-5.2	Vision	Creator
copilot/gpt-5-mini	Vision, FREE	Verifier, Critic, fallback
copilot/grok-code-fast-1	Fast coding	Creator (fast path)

OpenCode Zen (always available ✅)

Model ID	Context	Best For
opencode/minimax-m3	128K	All-rounder
opencode/minimax-m2.7	1M ctx	★ Planner (large codebases)
opencode/minimax-m2.5	1M ctx	Large context tasks
opencode/qwen3.7-max	128K	★ Thinker, Creator
opencode/qwen3.7-plus	128K	Creator
opencode/qwen3.6-plus	131K	★ Planner, Reviewer
opencode/qwen3.5-plus	131K	All-rounder
opencode/kimi-k2.6	131K	★ Thinker, Critic
opencode/kimi-k2.5	131K	Thinker, Critic
opencode/deepseek-v4-pro	65K	★ Thinker, Critic
opencode/deepseek-v4-flash	65K	★ Creator (fast)
opencode/glm-5.1	128K	All-rounder
opencode/glm-5	128K	All-rounder
opencode/mimo-v2.5-pro	262K	Large context
opencode/mimo-v2.5	262K	Large context
opencode/mimo-v2-pro	65K	General
opencode/mimo-v2-omni	65K	General
opencode/hy3-preview	131K	Preview

OpenCode Zen — FREE tier (always available ✅)

Model ID	Best For
opencode/deepseek-v4-flash-free	★ Verifier, fallback Creator
opencode/mimo-v2.5-free	Verifier, fallback
opencode/minimax-m3-free	Verifier, fallback
opencode/nemotron-3-super-free	Verifier, fallback

Nvidia NIM (if connected)

meta/llama-3.3-70b-instruct, meta/llama-3.1-8b-instruct, nvidia/llama-3.1-nemotron-70b-instruct, nvidia/nemotron-3-ultra-550b-a55b, mistralai/mistral-7b-instruct-v0.3

Ollama (local, if running)

qwen3:8b, qwen3:14b, llama3.3:70b

Google Gemini (if connected)

gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash, gemini-1.5-pro, gemini-1.5-flash

OpenAI (if connected)

gpt-4o, gpt-4o-mini, o3-mini, o4-mini, gpt-4.1, codex-mini-latest

Groq (if connected)

llama-3.3-70b-versatile, llama-3.1-8b-instant, deepseek-r1-distill-llama-70b, mixtral-8x7b, gemma2-9b-it

OpenRouter (if connected)

google/gemma-3-27b-it:free, meta-llama/llama-3.3-70b-instruct:free, deepseek/deepseek-r1:free, qwen/qwen3-8b:free

Claude (Anthropic, official — if not using proxy)

claude-sonnet-4-6, claude-sonnet-4-5, claude-haiku-4-5, claude-opus-4-7, claude-opus-4-6, claude-opus-4-5

Architecture

STEP 0: council-orchestrator models     ← discover available models (live)

LOOP:
  1. council-orchestrator status        ← check current stage
  2. Select best model for stage         ← pick from connected providers
  3. Execute stage handler               ← uses embedded patterns below
  4. council-orchestrator advance/loopback ← update state
  5. GOTO step 1                        ← UNCONDITIONAL

BREAK ONLY when:
  - __delivery_check__ says done → DELIVER
  - __maxed_out__ safety limit → REPORT

Council Structure

Agent	Role	Model Selection Strategy
Thinker	Deep reasoning, ideation	Pick strongest analytical model connected
Planner	Task decomposition, file mapping	Best at structured breakdown
Creator	Implementation + TDD	Best code generator connected
Critic	Adversarial review	Strong analysis, find flaws
Reviewer	Code review	Balanced, thorough
Verifier	Fast final verification	Fastest/cheapest connected

Stage 1 — THINK

Embedded: Brainstorming Pattern

Model: Strongest analytical model connected (priority: copilot/claude-opus-4.6-1m → opencode/qwen3.7-max → opencode/deepseek-v4-pro → opencode/kimi-k2.6 → any connected)

Model selection: Run council-orchestrator models or check COUNCIL_MODELS.md. Pick the best Thinker model from what's connected.
Explore context & load helper skills — read project files, docs, recent commits. Also read and load all helper skills (skills/ponytail/SKILL.md, skills/ponytail-review/SKILL.md, skills/ponytail-audit/SKILL.md, skills/ponytail-debt/SKILL.md, skills/ponytail-gain/SKILL.md, skills/ponytail-help/SKILL.md, skills/loop/SKILL.md) to integrate their rules and capabilities into the session context.
Clarify & decompose — break objective into independent subsystems.
Propose 2-3 architectures with explicit trade-offs (adhering to Ponytail rules: YAGNI, standard library/native features first, no speculative abstractions).
Stress-test: What assumptions could be false? What could go wrong?
Spawn Thinker sub-agent → THOUGHT_REPORT.md
Spawn Critic sub-agent → CRITIQUE_REPORT.md
If concerns → council-orchestrator loopback think "<reason>" → GOTO LOOP
If clear → council-orchestrator advance think "approved" → GOTO LOOP

Stage 2 — PLAN

Embedded: Writing Plans Pattern

Model: Best planner connected (priority: copilot/claude-sonnet-4.6 → opencode/qwen3.6-plus → opencode/minimax-m2.7 → best connected)

Model selection: Pick the best Planner model.
Map the absolute minimum file structure needed. Avoid speculative helper files or interfaces.
Decompose into bite-sized tasks (2-5 min each).
Write TASK_EXECUTION_PLAN.md with real code in every step. No placeholders or boilerplate.
Self-review: spec coverage? placeholders? type consistency?
Spawn Critic: missing criteria? dependencies correct?
If concerns → loopback → GOTO LOOP
If clear → advance → GOTO LOOP

Stage 3 — CREATE

Embedded: TDD + Subagent-Driven Development + Parallel Dispatch Patterns

Model: Best coder connected (priority: copilot/gpt-5.4 → opencode/deepseek-v4-flash → opencode/minimax-m2.7 → copilot/grok-code-fast-1 → any connected)

Model selection: Pick the best Creator model. Verify current level using /ponytail.
TDD IRON LAW: No production code without a failing test first.
RED → Verify RED → GREEN (minimal implementation following Ponytail ladder: YAGNI → stdlib → native → one-line → minimum code) → Verify GREEN → REFACTOR (staying green, mark simplifications with ponytail: comments).
Dispatch fresh sub-agents per independent task. Ensure they are instructed to follow the Ponytail ladder.
Two-stage review per task: spec compliance → code quality.
Parallel dispatch for independent domains.
If missing capability → write pattern as skill.
If done → advance → GOTO LOOP
If issues → loopback → GOTO LOOP

Stage 4 — REVIEW & TEST

Embedded: Code Review + Systematic Debugging + Verification Patterns

Model: Best reviewer connected (priority: copilot/claude-sonnet-4.6 → opencode/deepseek-v4-pro → opencode/qwen3.6-plus → any connected)

Model selection: Pick the best Reviewer model.
Pre-review: get SHAs, summary of what was built.
Spawn ALL council roles to review simultaneously. Critic must run a ponytail-review for over-engineering (tags: delete, stdlib, native, yagni, shrink) and report net lines removable. Additionally, run the /ponytail-review command (or ponytail-review skill) directly on the current git diff to harvest a concrete delete-list.
If flaws → Systematic Debugging (4-phase: root cause → pattern → hypothesis → fix)
IRON LAW: No fixes without root cause investigation.
Fix → re-verify → loopback review → GOTO LOOP
When clean → advance → GOTO LOOP

Stage 5 — VERIFY & DELIVER

Embedded: Verification Before Completion + Finishing Branch Patterns

Model: Fastest/cheapest connected (priority: copilot/claude-haiku-4.5 → copilot/gpt-5-mini → opencode/deepseek-v4-flash-free → any FREE model)

Model selection: Pick the cheapest available model — verification is simple checks.
IRON LAW: No "it works" without fresh verification output.
Run full test suite, build, integration.
Spawn completeness verifier.
Run the /ponytail-debt command (or ponytail-debt skill) to harvest any deferred shortcuts into PONYTAIL-DEBT.md.
Produce VERIFICATION_SIGN_OFF.md.
If verified → advance → GOTO LOOP
If not → loopback to appropriate stage → GOTO LOOP

Delivery Check

When stage is __delivery_check__:

If objective satisfied → DELIVER. STOP THE LOOP.
If not → council-orchestrator next-iteration → GOTO LOOP

Standing Directives

#	Directive	Rule
1	NEVER STOP	No user input needed. Resolve blockers autonomously.
2	GOTO LOOP step 1	After every action, immediately check status
3	TDD always	No production code without a failing test first
4	Verify before claiming	Run command, check fresh exit code & output
5	Root cause before fix	No symptom fixes without investigation
6	Safety limit: 50 iterations	Loop terminates to prevent runaway tokens
7	Auto-discover models	Refresh list with `council-orchestrator models`
8	Follow Ponytail rules	YAGNI → stdlib → native → one line → minimum. Mark simplifications with `ponytail:` comments

Activation

# Step 0 — Discover models (run once per session)
council-orchestrator models

# Step 1 — Initialize
council-orchestrator init "<full objective>"

# Step 2 — Enter loop
council-orchestrator status

The council reads COUNCIL_MODELS.md, picks the best model per role from what's actually connected, and executes each stage with its embedded pattern. The loop turns until done.

ai-council-orchestration

Invocation

Context Preview

SKILL.md

ai-council-orchestration

Invocation

Context Preview

SKILL.md

AI Council Orchestration — Multi-Model, All Providers

Quick Start

Model Discovery (Step 0)

Role-to-Model Mapping (Live)

Embedded Model Catalog (Fallback)

GitHub Copilot (connected ✅)

OpenCode Zen (always available ✅)

OpenCode Zen — FREE tier (always available ✅)

Nvidia NIM (if connected)

Ollama (local, if running)

Google Gemini (if connected)

OpenAI (if connected)

Groq (if connected)

OpenRouter (if connected)

Claude (Anthropic, official — if not using proxy)

Architecture

Council Structure

Stage 1 — THINK

Stage 2 — PLAN

Stage 3 — CREATE

Stage 4 — REVIEW & TEST

Stage 5 — VERIFY & DELIVER

Delivery Check

Standing Directives

Activation

Similar Skills

AI Council Orchestration — Multi-Model, All Providers

Quick Start

Model Discovery (Step 0)

Role-to-Model Mapping (Live)

Embedded Model Catalog (Fallback)

GitHub Copilot (connected ✅)

OpenCode Zen (always available ✅)

OpenCode Zen — FREE tier (always available ✅)

Nvidia NIM (if connected)

Ollama (local, if running)

Google Gemini (if connected)

OpenAI (if connected)

Groq (if connected)

OpenRouter (if connected)

Claude (Anthropic, official — if not using proxy)

Architecture

Council Structure

Stage 1 — THINK

Stage 2 — PLAN

Stage 3 — CREATE

Stage 4 — REVIEW & TEST

Stage 5 — VERIFY & DELIVER

Delivery Check

Standing Directives

Activation

Similar Skills