Plugin

multi-agent-consensus

3-agent unanimous consensus validation with hard gates for Claude Code. Maps Lead/Alpha/Bravo roles to CC subagents, enforces unanimity at phase transitions, persists evidence per phase+role.

What's Inside

Slash Commands5

Arguments

/consensus-report

Generate a report from the last consensus pipeline run (reads state.json + evidence manifest).

Execution

/consensus-roles

Show the three agent role definitions (Lead, Alpha, Bravo) and what each is calibrated to catch.

Arguments

/consensus-run

Run the full 3-agent consensus validation pipeline with unanimous gates.

Execution

/consensus-show-config

Show the merged consensus configuration (.claude/consensus.local.md overlaid on defaults).

Arguments

/consensus-validate

Run a single consensus gate check for one phase (no pipeline state advancement).

Agents3

alpha

/alpha

Use this agent when running a consensus gate check for the Alpha role — the code and logic specialist that reads implementation line-by-line for incorrect accumulation patterns, off-by-one errors, state machine bugs, race conditions, and API contract violations. Alpha is the detail-oriented auditor calibrated to catch the ChatViewModel += vs = bug and its siblings. Invoked exclusively by the consensus-pipeline skill in parallel with Lead and Bravo. <example> Context: The user ran `/consensus-run` and the pipeline reached the audit gate. user: "/consensus-run --target . --phases audit" assistant: "I'll invoke the Alpha agent via the Task tool for independent line-by-line logic validation of the audit phase, in parallel with Lead and Bravo." <commentary> Alpha reads every changed line. Its calibration is the += vs = class of bugs — correct-looking in isolation, broken at the interaction boundary. </commentary> </example> <example> Context: The user wants Alpha on a single-phase spot check. user: "/consensus-validate --target . --phase audit" assistant: "Invoking Alpha alongside Lead and Bravo via parallel Task calls for a one-shot audit gate check." <commentary> Same invocation pattern as /consensus-run but no fix cycle. </commentary> </example>

bravo

/bravo

Use this agent when running a consensus gate check for the Bravo role — the systems and functional specialist that exercises the running system, verifies behavior under real conditions, catches regressions, and confirms fixes work in practice. Bravo is the only role with Bash because its mandate is to RUN the system (build, execute, curl, inspect output). Invoked exclusively by the consensus-pipeline skill in parallel with Lead and Alpha. <example> Context: The user ran `/consensus-run` and the pipeline reached the audit gate. user: "/consensus-run --target . --phases audit" assistant: "I'll invoke the Bravo agent via the Task tool for independent functional validation — it will run the target system and observe actual behavior, in parallel with Lead and Alpha." <commentary> Bravo does not just read code — it runs it. Use it when a FAIL should be a systems- level observation ("Four.Four." renders instead of "Four.") rather than a code-review finding. </commentary> </example> <example> Context: The user wants Bravo on a single-phase functional check. user: "/consensus-validate --target . --phase verify" assistant: "Invoking Bravo alongside Lead and Alpha via parallel Task calls — Bravo will execute the verify-phase runbook against the target." <commentary> Only Bravo has Bash. Security note: do not run Bravo against untrusted repos — its test commands could execute hostile code. </commentary> </example>

lead

/lead

Use this agent when running a consensus gate check for the Lead role — the architecture and consistency specialist that validates cross-component coherence, pattern compliance, and regression detection at phase boundaries. Lead is invoked exclusively by the consensus-pipeline skill at gate checkpoints, in parallel with Alpha and Bravo, against a specific phase + target path. <example> Context: The user ran `/consensus-run` and the pipeline reached the audit gate. user: "/consensus-run --target . --phases audit" assistant: "I'll invoke the Lead agent via the Task tool for independent architectural validation of the audit phase, in parallel with Alpha and Bravo." <commentary> Lead is invoked exclusively at gate checkpoints inside the consensus-pipeline skill. Do not invoke Lead outside a gate context — it has no mandate for generic code review. </commentary> </example> <example> Context: The user wants to sanity-check a single phase without the full pipeline. user: "/consensus-validate --target . --phase audit" assistant: "Invoking Lead, Alpha, and Bravo in parallel via Task calls for a single audit-phase gate check." <commentary> Same invocation pattern as /consensus-run but one-shot, no fix cycle. </commentary> </example>

Skills1

consensus-pipeline

/consensus-pipeline

This skill should be invoked when the user asks to "run a consensus pipeline", "validate a phase with consensus", "run consensus gates", "check with three agents", "unanimous validation", "audit with Lead Alpha Bravo", or invokes /consensus-run or /consensus-validate. Orchestrates the three-agent unanimous-gate phase pipeline: for each configured phase, invoke Lead, Alpha, and Bravo in parallel, collect independent PASS or FAIL votes, enforce unanimity, trigger fix cycles on any FAIL with re-validation by all three agents, and persist evidence per phase and role. Works for CLI, web, API, and iOS targets — the triad validates the target system independent of its platform.

Hooks1

Event Hooks

1 hook across 1 event

Stats

Version0.2.0

LanguagePython

Stars1

MaintenanceExcellent

LicenseMIT

Last CommitMay 2, 2026

AddedApr 21, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

multi-agent-consensus1

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

Multi-Agent Consensus

3-agent unanimous validation gate for Claude Code — Lead, Alpha, Bravo all vote PASS or the gate stays closed.

Live site: consensus.withagents.dev Field journal entry: withagents.dev/posts/post-02-multi-agent-consensus

Featured in the Agentic Development Blog series — Post #2: A Single AI Agent Said 'Looks Correct.' Three Agents Found the P2 Bug.

Send date: Mon May 18, 2026
LinkedIn: link added on send day
Canonical blog post: https://ai.hack.ski/blog/
Series hub: agentic-development-guide

3-agent consensus validation with hard gates for Claude Code.

A framework where three specialized agents (Lead, Alpha, Bravo) independently validate work at phase gates. All three must vote PASS unanimously for the gate to open — no exceptions.

Built from catching a P2 streaming bug that a single-agent review explicitly approved.

The Pattern

┌─────────┐    ┌──────────────────────────────┐    ┌──────────┐
│  Phase   │───▶│       Gate Checkpoint         │───▶│  Phase   │
│    N     │    │                              │    │   N+1    │
│          │    │  ┌──────┐ ┌───────┐ ┌──────┐│    │          │
│  (work)  │    │  │ Lead │ │ Alpha │ │Bravo ││    │  (work)  │
│          │    │  │      │ │       │ │      ││    │          │
│          │    │  │ PASS │ │ PASS  │ │ PASS ││    │          │
│          │    │  └──────┘ └───────┘ └──────┘│    │          │
│          │    │                              │    │          │
│          │    │  ALL 3 PASS → Gate Opens     │    │          │
│          │    │  ANY FAIL  → Fix Cycle       │    │          │
└─────────┘    └──────────────────────────────┘    └──────────┘

Key properties:

Independent verification: Each agent starts fresh — no shared findings until vote
Hard gates: 2/3 does not pass. Unanimity or nothing.
Re-validation after fixes: ALL agents re-validate, not just the failing one

Install

Two distribution channels. Pick one — they produce identical evidence on the same target.

Channel	Command	Use when
Claude Code plugin (primary)	`claude plugin marketplace add krzemienski/multi-agent-consensus` then `claude plugin install multi-agent-consensus@multi-agent-consensus`	You run Claude Code interactively. Native slash commands (`/consensus-run`, `/consensus-validate`, `/consensus-roles`, `/consensus-report`, `/consensus-show-config`), inherited auth, Task-tool agents.
Python CLI (pipx)	`pipx install git+https://github.com/krzemienski/multi-agent-consensus.git`	You need non-interactive CI runs, or you don't want to depend on Claude Code. Same model prompts, same evidence schema.

Both require Claude Code CLI installed and authenticated. The plugin uses CC's Task tool natively; the Python CLI shells out to claude --print subprocesses.

Dev-mode (local testing, no install):

git clone https://github.com/krzemienski/multi-agent-consensus.git
claude -p --plugin-dir ./multi-agent-consensus "/consensus-run --target ./my-project --phases audit"

Uninstall

Plugin: claude plugin uninstall multi-agent-consensus
Python CLI: pipx uninstall multi-agent-consensus

Quick Start

# Run the full consensus pipeline
consensus run --target ./my-project --phases "explore,audit,fix,verify"

# Run a single gate check
consensus validate --target ./my-project --phase audit

# View results from the last run
consensus report --target ./my-project

# Display role definitions
consensus roles

# Show current configuration
consensus show-config

The Three Roles

Lead — Architecture & Consistency Specialist

Validates the whole. Cross-component consistency, pattern compliance, regression detection. Breaks ties when interpretation differs.

Catches: Contract mismatches between layers, pattern violations, fixes that break other components.

Alpha — Code & Logic Specialist

Reads implementation line by line. Incorrect accumulation patterns, off-by-one errors, state machine bugs, API contract violations.

Catches: The += vs = bug, state index resets, type mismatches at boundaries, missing error handlers.

Bravo — Systems & Functional Specialist

Exercises the running system. UI behavior under real conditions, edge cases with actual data, regressions in previously working flows.

Catches: Runtime-only bugs, visual duplication, edge case failures, performance degradation.

multi-agent-consensus

What's Inside

multi-agent-consensus

Popularity

What's Inside

Confidence

README

Multi-Agent Consensus

Related Post

The Pattern

Install

Uninstall

Quick Start

The Three Roles

Lead — Architecture & Consistency Specialist

Alpha — Code & Logic Specialist

Bravo — Systems & Functional Specialist

The Bug That Proved the Pattern

Similar Plugins

everything-claude-code

supergraph

creative-writing

unity-dev-toolkit

dotnet-skills

claude-seo

More by krzemienski

shannon

lynx

deepest-plan

crucible

anneal-temper

Multi-Agent Consensus

Related Post

The Pattern

Install

Uninstall

Quick Start

The Three Roles

Lead — Architecture & Consistency Specialist

Alpha — Code & Logic Specialist

Bravo — Systems & Functional Specialist

The Bug That Proved the Pattern

Popularity

Health & Quality

More by krzemienski

shannon

lynx

deepest-plan

crucible

anneal-temper

Similar Plugins

everything-claude-code

supergraph

creative-writing

unity-dev-toolkit

dotnet-skills

claude-seo