Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By krzemienski
3-agent unanimous consensus validation with hard gates for Claude Code. Maps Lead/Alpha/Bravo roles to CC subagents, enforces unanimity at phase transitions, persists evidence per phase+role.
npx claudepluginhub krzemienski/multi-agent-consensus --plugin multi-agent-consensusGenerate a report from the last consensus pipeline run (reads state.json + evidence manifest).
Show the three agent role definitions (Lead, Alpha, Bravo) and what each is calibrated to catch.
Run the full 3-agent consensus validation pipeline with unanimous gates.
Show the merged consensus configuration (.claude/consensus.local.md overlaid on defaults).
Run a single consensus gate check for one phase (no pipeline state advancement).
Use this agent when running a consensus gate check for the Alpha role — the code and logic specialist that reads implementation line-by-line for incorrect accumulation patterns, off-by-one errors, state machine bugs, race conditions, and API contract violations. Alpha is the detail-oriented auditor calibrated to catch the ChatViewModel += vs = bug and its siblings. Invoked exclusively by the consensus-pipeline skill in parallel with Lead and Bravo. <example> Context: The user ran `/consensus-run` and the pipeline reached the audit gate. user: "/consensus-run --target . --phases audit" assistant: "I'll invoke the Alpha agent via the Task tool for independent line-by-line logic validation of the audit phase, in parallel with Lead and Bravo." <commentary> Alpha reads every changed line. Its calibration is the += vs = class of bugs — correct-looking in isolation, broken at the interaction boundary. </commentary> </example> <example> Context: The user wants Alpha on a single-phase spot check. user: "/consensus-validate --target . --phase audit" assistant: "Invoking Alpha alongside Lead and Bravo via parallel Task calls for a one-shot audit gate check." <commentary> Same invocation pattern as /consensus-run but no fix cycle. </commentary> </example>
Use this agent when running a consensus gate check for the Bravo role — the systems and functional specialist that exercises the running system, verifies behavior under real conditions, catches regressions, and confirms fixes work in practice. Bravo is the only role with Bash because its mandate is to RUN the system (build, execute, curl, inspect output). Invoked exclusively by the consensus-pipeline skill in parallel with Lead and Alpha. <example> Context: The user ran `/consensus-run` and the pipeline reached the audit gate. user: "/consensus-run --target . --phases audit" assistant: "I'll invoke the Bravo agent via the Task tool for independent functional validation — it will run the target system and observe actual behavior, in parallel with Lead and Alpha." <commentary> Bravo does not just read code — it runs it. Use it when a FAIL should be a systems- level observation ("Four.Four." renders instead of "Four.") rather than a code-review finding. </commentary> </example> <example> Context: The user wants Bravo on a single-phase functional check. user: "/consensus-validate --target . --phase verify" assistant: "Invoking Bravo alongside Lead and Alpha via parallel Task calls — Bravo will execute the verify-phase runbook against the target." <commentary> Only Bravo has Bash. Security note: do not run Bravo against untrusted repos — its test commands could execute hostile code. </commentary> </example>
Use this agent when running a consensus gate check for the Lead role — the architecture and consistency specialist that validates cross-component coherence, pattern compliance, and regression detection at phase boundaries. Lead is invoked exclusively by the consensus-pipeline skill at gate checkpoints, in parallel with Alpha and Bravo, against a specific phase + target path. <example> Context: The user ran `/consensus-run` and the pipeline reached the audit gate. user: "/consensus-run --target . --phases audit" assistant: "I'll invoke the Lead agent via the Task tool for independent architectural validation of the audit phase, in parallel with Alpha and Bravo." <commentary> Lead is invoked exclusively at gate checkpoints inside the consensus-pipeline skill. Do not invoke Lead outside a gate context — it has no mandate for generic code review. </commentary> </example> <example> Context: The user wants to sanity-check a single phase without the full pipeline. user: "/consensus-validate --target . --phase audit" assistant: "Invoking Lead, Alpha, and Bravo in parallel via Task calls for a single audit-phase gate check." <commentary> Same invocation pattern as /consensus-run but one-shot, no fix cycle. </commentary> </example>
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Unity Development Toolkit - Expert agents for scripting/refactoring/optimization, script templates, and Agent Skills for Unity C# development
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Comprehensive .NET development skills for modern C#, ASP.NET, MAUI, Blazor, Aspire, EF Core, Native AOT, testing, security, performance optimization, CI/CD, and cloud-native applications
20 SEO/GEO skills and 5 commands on one shared contract for keyword research, content creation, technical audits, schema markup, monitoring, quality gates, entity truth, and campaign memory.
Shannon Framework v6 — single-plugin consolidation replacing OMC + VF + Crucible + 13 others. 26 commands, 22 skills, 10 agents, 14 hooks across 7 domain modules + 4 enforcement layers.
Sharp-eyed visual-audit suite for Claude Code. Two coupled skills catch real UI defects — contrast failures, false affordances, modal opacity, contract mismatches — via real-system probes, zero mocks, evidence-cited verdicts.
Deepest-mode planning — consensus + gates + phase hierarchy + multi-plan tournament synthesis. Self-contained. Works with Claude Code and OpenCode.
Fixed-Point Deepen architecture of the Anneal plugin family. One plan, heated and cooled repeatedly — inline red team at every depth, Momus 0-100 scoring, convergence by variance/delta/cap.
Evidence-gated task planning, execution, and validation for Claude Code. Refuses completion without quorum-approved proof. No mocks. No stubs. No silent retries past the gate.
3-agent unanimous validation gate for Claude Code — Lead, Alpha, Bravo all vote PASS or the gate stays closed.
Live site: consensus.withagents.dev Field journal entry: withagents.dev/posts/post-02-multi-agent-consensus
Featured in the Agentic Development Blog series — Post #2: A Single AI Agent Said 'Looks Correct.' Three Agents Found the P2 Bug.
3-agent consensus validation with hard gates for Claude Code.
A framework where three specialized agents (Lead, Alpha, Bravo) independently validate work at phase gates. All three must vote PASS unanimously for the gate to open — no exceptions.
Built from catching a P2 streaming bug that a single-agent review explicitly approved.
┌─────────┐ ┌──────────────────────────────┐ ┌──────────┐
│ Phase │───▶│ Gate Checkpoint │───▶│ Phase │
│ N │ │ │ │ N+1 │
│ │ │ ┌──────┐ ┌───────┐ ┌──────┐│ │ │
│ (work) │ │ │ Lead │ │ Alpha │ │Bravo ││ │ (work) │
│ │ │ │ │ │ │ │ ││ │ │
│ │ │ │ PASS │ │ PASS │ │ PASS ││ │ │
│ │ │ └──────┘ └───────┘ └──────┘│ │ │
│ │ │ │ │ │
│ │ │ ALL 3 PASS → Gate Opens │ │ │
│ │ │ ANY FAIL → Fix Cycle │ │ │
└─────────┘ └──────────────────────────────┘ └──────────┘
Key properties:
Two distribution channels. Pick one — they produce identical evidence on the same target.
| Channel | Command | Use when |
|---|---|---|
| Claude Code plugin (primary) | claude plugin marketplace add krzemienski/multi-agent-consensus then claude plugin install multi-agent-consensus@multi-agent-consensus | You run Claude Code interactively. Native slash commands (/consensus-run, /consensus-validate, /consensus-roles, /consensus-report, /consensus-show-config), inherited auth, Task-tool agents. |
| Python CLI (pipx) | pipx install git+https://github.com/krzemienski/multi-agent-consensus.git | You need non-interactive CI runs, or you don't want to depend on Claude Code. Same model prompts, same evidence schema. |
Both require Claude Code CLI
installed and authenticated. The plugin uses CC's Task tool natively; the
Python CLI shells out to claude --print subprocesses.
Dev-mode (local testing, no install):
git clone https://github.com/krzemienski/multi-agent-consensus.git
claude -p --plugin-dir ./multi-agent-consensus "/consensus-run --target ./my-project --phases audit"
claude plugin uninstall multi-agent-consensuspipx uninstall multi-agent-consensus# Run the full consensus pipeline
consensus run --target ./my-project --phases "explore,audit,fix,verify"
# Run a single gate check
consensus validate --target ./my-project --phase audit
# View results from the last run
consensus report --target ./my-project
# Display role definitions
consensus roles
# Show current configuration
consensus show-config
Validates the whole. Cross-component consistency, pattern compliance, regression detection. Breaks ties when interpretation differs.
Catches: Contract mismatches between layers, pattern violations, fixes that break other components.
Reads implementation line by line. Incorrect accumulation patterns, off-by-one errors, state machine bugs, API contract violations.
Catches: The += vs = bug, state index resets, type mismatches at boundaries, missing error handlers.
Exercises the running system. UI behavior under real conditions, edge cases with actual data, regressions in previously working flows.
Catches: Runtime-only bugs, visual duplication, edge case failures, performance degradation.