Multi-agent development pipeline for Claude Code — 12 specialist agents, 13 slash commands, 18 skills, adversarial critic loop, and automated sync distribution. Drop-in plugin that adds /plan, /build, /review workflows with explorer, architect, builder, critic, tester, and reviewer agents.
npx claudepluginhub timgranlundmarsden/claude-agent-flow**Skills:** agent-flow-init-check
Display a live backlog status dashboard — active tasks table with PR state and branch-to-test. Optional flag: --qa also generates a QA test guide for all Ready-for-Review tasks.
Build a feature using the adversarial review loop. Runs builder agents, then critic, then fixes, then critic again — repeating until critic passes and tests are green. Use when quality assurance matters more than speed.
Check the current branch's PR for external review feedback, fix errors and warnings, run a final external review, and push.
Report the health of the agent-flow installation
Quick read-only exploration of the codebase. Invokes explorer agent to map files, dependencies, and patterns relevant to a topic. No code changes. Use before starting any significant task to understand the landscape.
Run a single external code review via an external LLM API on the full branch diff vs main. Standalone usage — no aggregation with internal review.
Show comprehensive help guide for the agent team system
Initialize or update the agent-flow system in this repository
Guide a collaborative planning session to refine an idea into a structured build brief. Conducts Socratic dialogue with the user via the brainstorming skill, leverages architect and researcher for design and technical validation, and saves the result as a plan file and backlog task that /build can consume.
Guided rebase workflow. Stashes uncommitted changes, fetches latest from origin, rebases onto the remote branch (or a specified branch), resolves conflicts if possible, and pops the stash on success.
Run the adversarial critic against existing code without rebuilding. Use when you want to harden code that already exists, or run a quality pass on a completed feature before shipping.
Sync plugin skills into the repo for web environment compatibility
View, create, or update TECHSTACK.md for the current project. Runs the Technology Discovery Protocol to detect the project's stack from source files, package manifests, and CI/CD config. Safe to run at any time — existing user-added content is always preserved. Use --force to bypass the 72-hour freshness check and trigger a full rescan.
Analyse Claude Code session token costs inline
Design authority. Read-only — produces design decisions and file-level plans for builder agents. Invoke before features touching 3+ files or new patterns.
Documentation specialist. Invoked last. Updates README, docstrings, CHANGELOG, and CLAUDE.md. Also for standalone documentation tasks.
Backend specialist. Owns API routes, business logic, DB queries, auth, and server infrastructure.
Adversarial code critic. Tries to break code with edge cases and failure scenarios. Returns FAIL/PASS. Used in /build and /review loops.
Read-only codebase navigator. Maps files, traces dependencies, surfaces existing patterns. Cheap — use constantly before implementation tasks.
Frontend specialist. Owns UI, components, styling, and client-side state. Accessibility, performance, and bundle concerns.
Lateral thinking and solution space exploration. Output goes to human only — never into orchestrator or builders. Not for implementation.
Master orchestrator for multi-domain tasks. Breaks work into subtasks, sequences specialist agents, and delegates. Never writes code itself.
Research specialist. Web search for current docs, library comparisons, and best practices. Use when decisions need up-to-date information.
Code review specialist. Read-only. Reviews for security, correctness, performance, style. Returns BLOCKER / WARNING / SUGGESTION findings.
Storage specialist. Owns all data persistence — databases, cloud/object/local storage, and access control policies.
Test specialist — frontend and backend. Writes tests, runs suites, verifies. Never marks done if tests are red.
This skill should be used when the user asks to "create a plugin", "scaffold a plugin", "understand plugin structure", "organize plugin components", "set up plugin.json", "use ${CLAUDE_PLUGIN_ROOT}", "add commands/agents/skills/hooks", "configure auto-discovery", or needs guidance on plugin directory layout, manifest configuration, component organization, file naming conventions, or Claude Code plugin architecture best practices.
This skill should be used when the user wants to "create a skill", "add a skill to plugin", "write a new skill", "improve skill description", "organize skill content", or needs guidance on skill structure, progressive disclosure, or skill development best practices for Claude Code plugins.
This skill should be used when the user asks to "create an agent", "add an agent", "write a subagent", "agent frontmatter", "when to use description", "agent examples", "agent tools", "agent colors", "autonomous agent", or needs guidance on agent structure, system prompts, triggering conditions, or agent development best practices for Claude Code plugins.
Run code review via an external LLM API using external-review.sh. Outputs structured JSON with verdict, summary, and per-file concerns.
Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, artifacts, posters, or applications (examples include websites, landing pages, dashboards, React components, HTML/CSS layouts, or when styling/beautifying any web UI). Generates creative, polished code and UI design that avoids generic AI aesthetics.
Guard skill that halts command execution if the agent-flow system is not initialized. Checks for .claude-agent-flow/sync-state.json (installed repos) or .claude-agent-flow/publish-plugin-manifest.yml (master source repo) and stops with a clear message to run /install if neither is present.
Create visually appealing tables and diagrams for display. Use when the user asks to create tables, diagrams, status displays, dashboards, or structured text layouts that should render nicely in monospace fonts. Triggers include requests for tables, terminal-style output, box diagrams, or recreating visual layouts in text form.
Task and document management with Backlog.md. Use this skill whenever working with tasks, tracking work, creating subtasks, searching or viewing existing tasks, or any time work should be preserved across sessions or handed off. Triggers include: "create a task", "track this", "add to backlog", "what tasks do I have", "mark as done", "continue on task", "break this into tasks", or any request where the outcome has a concrete deliverable worth tracking. Always use this skill when a project includes a Backlog.md setup and work needs to be logged. Prefer CLI commands unless MCP tools are confirmed working in the current environment.
Technical project management for Backlog.md workstreams using coordinated sub-agents. Use only when the user explicitly asks Codex to take over one or more existing Backlog.md tasks (for example: "act as TPM", "coordinate these tasks", "delegate to sub-agents"). Orchestrate planning, implementation, and finalization; manage dependencies and overlapping code areas; review and approve sub-agent plans before implementation.
You MUST use this before any creative work - creating features, building components, adding functionality, or modifying behavior. Explores user intent, requirements and design before implementation.
This skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define command arguments", "use command frontmatter", "organize commands", "create command with file references", "interactive command", "use AskUserQuestion in command", or needs guidance on slash command structure, YAML frontmatter fields, dynamic arguments, bash execution in commands, user interaction patterns, or command development best practices for Claude Code.
This skill should be used when the user asks to "create a hook", "add a PreToolUse/PostToolUse/Stop hook", "validate tool use", "implement prompt-based hooks", "use ${CLAUDE_PLUGIN_ROOT}", "set up event-driven automation", "block dangerous commands", or mentions hook events (PreToolUse, PostToolUse, Stop, SubagentStop, SessionStart, SessionEnd, UserPromptSubmit, PreCompact, Notification). Provides comprehensive guidance for creating and implementing Claude Code plugin hooks with focus on advanced prompt-based hooks API.
Convert a Claude Code session JSONL log into an enriched visualiser JSON file for docs/public/logs/. Runs log_converter.py for the base structure, then enriches with correct agent models, brainstorming Q&A (question events), adversarial loop transitions, parallel timing, and a story-focused title/subtitle.
This skill should be used when the user asks to "add MCP server", "integrate MCP", "configure MCP in plugin", "use .mcp.json", "set up Model Context Protocol", "connect external service", mentions "${CLAUDE_PLUGIN_ROOT} with MCP", or discusses MCP server types (SSE, stdio, HTTP, WebSocket). Provides comprehensive guidance for integrating Model Context Protocol servers into Claude Code plugins for external tool and service integration.
Default verification tool for local HTML files. Use visual-check.sh as the FIRST choice whenever you need to verify, test, or check any local HTML page — whether confirming elements render correctly, checking layout, detecting overflow, or asserting uniform widths. Prefer this over raw playwright-cli commands. Also the authoritative reference for playwright-cli setup and gotchas in this project — always load this skill before using playwright-cli directly.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
This skill should be used when the user asks about "plugin settings", "store plugin configuration", "user-configurable plugin", ".local.md files", "plugin state files", "read YAML frontmatter", "per-project plugin settings", or wants to make plugin behavior configurable. Documents the .claude/plugin-name.local.md pattern for storing plugin-specific configuration with YAML frontmatter and markdown content.
Sync installed Claude Code plugin skills into the repo so they work in web environments. Run this from a local CLI session where plugins are installed.
Analyse Claude Code session token costs inline
Agent team best practices — philosophy, routing, execution model, adversarial loop, cost management, failure modes, and brief writing.
Optional web search skill using a LiteLLM-compatible grounded-search endpoint (e.g. Gemini via LiteLLM). When AGENT_FLOW_WEB_SEARCH_ENABLED=true, agents can shell out to this skill for real-time search results as an alternative to the native WebSearch tool. Outputs synthesised answer with citations (answer mode), raw citation list (search mode), or full API JSON (raw mode).
HelloAGENTS — The orchestration kernel that makes any AI CLI smarter. Adds intelligent routing, quality verification (Ralph Loop), safety guards, and notifications.
The development-workflow plugin for Claude Code — 35 skills organized around a 6-phase workflow (Think → Review → Build → Ship → Maintain → Setup), 24 agents, and an interactive setup wizard for rules, modes, hooks, and MCP servers.
Core planning and workflow infrastructure for the Claudikins ecosystem
Use Claude Code As Is - native plugin leveraging built-in architecture
HarnessFlow — From idea to shipped product: high-quality engineering workflows for AI agents. Spec-anchored SDD, gated TDD, evidence-based routing, independent reviews, and formal closeout.
Persona-driven AI development team: orchestrator, team agents, review agents, skills, slash commands, and advisory hooks for Claude Code
Executes bash commands
Hook triggers when Bash tool is used
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
A Claude Code plugin that brings a multi-agent development pipeline to your repository — adversarial review loops, 12 specialist agents, 14 slash commands, and 18 skills.
Full documentation → · Jump to installation ↓
Note: Install this into your existing project using the
install.shscript: Jump to installation ↓ For a fresh start or sandbox/workspace use, you can also clone this repo directly and open it in Claude Code — run/installonce to initialise sync state.

I started this project because if I am honest I was a bit disappointed with the current crop of AI tooling. As amazing as their results often are, when the model says "I am done", it was always far from done. Maybe you asked for a mobile friendly web page and it turned out that the result wasnt in fact mobile friendly. Or maybe you asked it to fix some code and then on running the tests, you saw that it had made introduced another bug. Either way I was spending too much time hand holding and guiding the models.
I am a big Claude Code user and it supports so many great things so I thought why don't I try and build a workflow that would help me formulate my idea, research it for me, sanity check it and then build it for me all without my intervention. Obviously I would need to discuss with the AI what I wanted but once we had a good plan, I wanted my work to be turned into a "ticket" and then have the AI build it for me and not come back to me until it was "Done Done"!
But the big part was once it thought it was done, it would need to get the approval of multiple separate AI agents that would be very thorough and check it's work. "Works on mobile?", then prove it with screenshots! "Code looks good", then lets run all the tests. If anything breaks, send it back to the agent that did the work for correction. Only when the entire flow if complete and every agent is satisfied, is the job "Ready for Review"
Today it's 12 specialised agents, a skill library, and a full CI/CD integration — built entirely by running itself. What began as a simple automation became a full on factory line capable of solving multiple tickets at once. When you couple it with Claude Code for Web you can literally have AI build your ideas whilst you are out and about. It gets rather addictive :-)
Two ideas drive everything in this project:
Quality emerges from adversarial loops. A dedicated adversary that actively tries to break the code finds what a reviewer never sees. The critic's job is not to approve — it is to fail the work. The loop runs unattended until nothing breaks, compressing days of real-team code review into minutes. This is how you catch race conditions, silent failures, and injection vectors that slip past conventional review.
Specialisation beats generalism. A single agent that must design, implement, test, and review the same work carries compounding bias. Separation of roles removes the tension between building and verifying. Twelve agents each own exactly one part of the problem — clean context, no role confusion. The frontend agent doesn't second-guess the API design; the critic doesn't have to stomach the implementation weight.
The full reasoning behind these choices is on the Why Agent Flow page.
You describe what you want. The pipeline handles the rest.
/plan — Socratic conversation that asks questions, explores approaches, produces a structured brief with acceptance criteria saved as a plan file in your repo.
/build — Takes a plan file and runs the full agent sequence: explorer maps codebase, architect designs approach, builders implement, then the adversarial critic loop kicks in (critic tries to break it, builder fixes, repeat until PASS), then tester runs full test suite, reviewer does final pass, author updates docs. Runs unattended.
/review — For code you wrote yourself without using /build. Runs the adversarial critic loop to stress-test your diff before merging.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claim