From ponytail
Multi-model autonomous loop through Think→Plan→Create→Review→Verify — uses the best model per stage from ALL connected providers. Live auto-discovery of available models.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ponytail:ai-council-orchestrationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
**Every stage uses the best available model for its role.** Model catalog is auto-discovered at runtime — no hardcoded model names that go stale. Works with GitHub Copilot, OpenCode Zen, Nvidia NIM, Ollama, Gemini, OpenAI, Groq, OpenRouter, and official Claude.
Every stage uses the best available model for its role. Model catalog is auto-discovered at runtime — no hardcoded model names that go stale. Works with GitHub Copilot, OpenCode Zen, Nvidia NIM, Ollama, Gemini, OpenAI, Groq, OpenRouter, and official Claude.
council-orchestrator models # Step 0: discover available models
council-orchestrator init "<your objective>" # Step 1: start council
council-orchestrator status # Step 2: check stage
Then enter the loop below.
Before entering the main loop, run:
council-orchestrator models
This queries http://127.0.0.1:4001/v1/models live and writes COUNCIL_MODELS.md with:
If the proxy isn't running, fall back to the embedded catalog below.
| Role | Best Picks (in priority order) | Selection Strategy |
|---|---|---|
| Thinker (deep reasoning) | copilot/claude-opus-4.6-1m → opencode/qwen3.7-max → opencode/deepseek-v4-pro → opencode/kimi-k2.6 | Strongest analytical model available |
| Planner (task decomposition) | copilot/claude-sonnet-4.6 → opencode/qwen3.6-plus → opencode/minimax-m2.7 | Best at structured planning |
| Creator (code + TDD) | copilot/gpt-5.4 → opencode/deepseek-v4-flash → opencode/minimax-m2.7 → copilot/grok-code-fast-1 | Best code generation available |
| Critic (adversarial review) | copilot/claude-sonnet-4.6 → opencode/deepseek-v4-pro → opencode/kimi-k2.6 | Strong at finding flaws |
| Reviewer (code review) | copilot/claude-sonnet-4.6 → opencode/qwen3.6-plus → opencode/minimax-m2.5 | Balanced review quality |
| Verifier (fast checks) | copilot/claude-haiku-4.5 → copilot/gpt-5-mini → opencode/deepseek-v4-flash-free | Fast & cheap, uses FREE tier if available |
Selection rule: Pick the first model from the priority list that is currently connected (⚡ in council-orchestrator models output). If none of the top picks are available, use any connected model — don't stall.
If the proxy is unreachable, use this static reference:
| Model ID | Capabilities | Best For |
|---|---|---|
| copilot/claude-opus-4.6-1m | Vision, 15x premium | ★ Thinker, Critic |
| copilot/claude-sonnet-4.6 | Vision | ★ Planner, Reviewer, Critic |
| copilot/claude-sonnet-4.5 | Vision | Reviewer, Planner |
| copilot/claude-haiku-4.5 | Vision, 0.33x cost | ★ Verifier |
| copilot/gpt-5.4 | Vision | ★ Creator |
| copilot/gpt-5.2 | Vision | Creator |
| copilot/gpt-5-mini | Vision, FREE | Verifier, Critic, fallback |
| copilot/grok-code-fast-1 | Fast coding | Creator (fast path) |
| Model ID | Context | Best For |
|---|---|---|
| opencode/minimax-m3 | 128K | All-rounder |
| opencode/minimax-m2.7 | 1M ctx | ★ Planner (large codebases) |
| opencode/minimax-m2.5 | 1M ctx | Large context tasks |
| opencode/qwen3.7-max | 128K | ★ Thinker, Creator |
| opencode/qwen3.7-plus | 128K | Creator |
| opencode/qwen3.6-plus | 131K | ★ Planner, Reviewer |
| opencode/qwen3.5-plus | 131K | All-rounder |
| opencode/kimi-k2.6 | 131K | ★ Thinker, Critic |
| opencode/kimi-k2.5 | 131K | Thinker, Critic |
| opencode/deepseek-v4-pro | 65K | ★ Thinker, Critic |
| opencode/deepseek-v4-flash | 65K | ★ Creator (fast) |
| opencode/glm-5.1 | 128K | All-rounder |
| opencode/glm-5 | 128K | All-rounder |
| opencode/mimo-v2.5-pro | 262K | Large context |
| opencode/mimo-v2.5 | 262K | Large context |
| opencode/mimo-v2-pro | 65K | General |
| opencode/mimo-v2-omni | 65K | General |
| opencode/hy3-preview | 131K | Preview |
| Model ID | Best For |
|---|---|
| opencode/deepseek-v4-flash-free | ★ Verifier, fallback Creator |
| opencode/mimo-v2.5-free | Verifier, fallback |
| opencode/minimax-m3-free | Verifier, fallback |
| opencode/nemotron-3-super-free | Verifier, fallback |
meta/llama-3.3-70b-instruct, meta/llama-3.1-8b-instruct, nvidia/llama-3.1-nemotron-70b-instruct, nvidia/nemotron-3-ultra-550b-a55b, mistralai/mistral-7b-instruct-v0.3
qwen3:8b, qwen3:14b, llama3.3:70b
gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash, gemini-1.5-pro, gemini-1.5-flash
gpt-4o, gpt-4o-mini, o3-mini, o4-mini, gpt-4.1, codex-mini-latest
llama-3.3-70b-versatile, llama-3.1-8b-instant, deepseek-r1-distill-llama-70b, mixtral-8x7b, gemma2-9b-it
google/gemma-3-27b-it:free, meta-llama/llama-3.3-70b-instruct:free, deepseek/deepseek-r1:free, qwen/qwen3-8b:free
claude-sonnet-4-6, claude-sonnet-4-5, claude-haiku-4-5, claude-opus-4-7, claude-opus-4-6, claude-opus-4-5
STEP 0: council-orchestrator models ← discover available models (live)
LOOP:
1. council-orchestrator status ← check current stage
2. Select best model for stage ← pick from connected providers
3. Execute stage handler ← uses embedded patterns below
4. council-orchestrator advance/loopback ← update state
5. GOTO step 1 ← UNCONDITIONAL
BREAK ONLY when:
- __delivery_check__ says done → DELIVER
- __maxed_out__ safety limit → REPORT
| Agent | Role | Model Selection Strategy |
|---|---|---|
| Thinker | Deep reasoning, ideation | Pick strongest analytical model connected |
| Planner | Task decomposition, file mapping | Best at structured breakdown |
| Creator | Implementation + TDD | Best code generator connected |
| Critic | Adversarial review | Strong analysis, find flaws |
| Reviewer | Code review | Balanced, thorough |
| Verifier | Fast final verification | Fastest/cheapest connected |
Embedded: Brainstorming Pattern
Model: Strongest analytical model connected (priority: copilot/claude-opus-4.6-1m → opencode/qwen3.7-max → opencode/deepseek-v4-pro → opencode/kimi-k2.6 → any connected)
council-orchestrator models or check COUNCIL_MODELS.md. Pick the best Thinker model from what's connected.skills/ponytail/SKILL.md, skills/ponytail-review/SKILL.md, skills/ponytail-audit/SKILL.md, skills/ponytail-debt/SKILL.md, skills/ponytail-gain/SKILL.md, skills/ponytail-help/SKILL.md, skills/loop/SKILL.md) to integrate their rules and capabilities into the session context.THOUGHT_REPORT.mdCRITIQUE_REPORT.mdcouncil-orchestrator loopback think "<reason>" → GOTO LOOPcouncil-orchestrator advance think "approved" → GOTO LOOPEmbedded: Writing Plans Pattern
Model: Best planner connected (priority: copilot/claude-sonnet-4.6 → opencode/qwen3.6-plus → opencode/minimax-m2.7 → best connected)
TASK_EXECUTION_PLAN.md with real code in every step. No placeholders or boilerplate.Embedded: TDD + Subagent-Driven Development + Parallel Dispatch Patterns
Model: Best coder connected (priority: copilot/gpt-5.4 → opencode/deepseek-v4-flash → opencode/minimax-m2.7 → copilot/grok-code-fast-1 → any connected)
/ponytail.ponytail: comments).Embedded: Code Review + Systematic Debugging + Verification Patterns
Model: Best reviewer connected (priority: copilot/claude-sonnet-4.6 → opencode/deepseek-v4-pro → opencode/qwen3.6-plus → any connected)
/ponytail-review command (or ponytail-review skill) directly on the current git diff to harvest a concrete delete-list.Embedded: Verification Before Completion + Finishing Branch Patterns
Model: Fastest/cheapest connected (priority: copilot/claude-haiku-4.5 → copilot/gpt-5-mini → opencode/deepseek-v4-flash-free → any FREE model)
/ponytail-debt command (or ponytail-debt skill) to harvest any deferred shortcuts into PONYTAIL-DEBT.md.VERIFICATION_SIGN_OFF.md.When stage is __delivery_check__:
council-orchestrator next-iteration → GOTO LOOP| # | Directive | Rule |
|---|---|---|
| 1 | NEVER STOP | No user input needed. Resolve blockers autonomously. |
| 2 | GOTO LOOP step 1 | After every action, immediately check status |
| 3 | TDD always | No production code without a failing test first |
| 4 | Verify before claiming | Run command, check fresh exit code & output |
| 5 | Root cause before fix | No symptom fixes without investigation |
| 6 | Safety limit: 50 iterations | Loop terminates to prevent runaway tokens |
| 7 | Auto-discover models | Refresh list with council-orchestrator models |
| 8 | Follow Ponytail rules | YAGNI → stdlib → native → one line → minimum. Mark simplifications with ponytail: comments |
# Step 0 — Discover models (run once per session)
council-orchestrator models
# Step 1 — Initialize
council-orchestrator init "<full objective>"
# Step 2 — Enter loop
council-orchestrator status
The council reads COUNCIL_MODELS.md, picks the best model per role from what's actually connected, and executes each stage with its embedded pattern. The loop turns until done.
npx claudepluginhub armaan-hub/ai-coincil --plugin ponytailProvides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.