Plugin

ade

Name: ade
Author: alexsds

Agent-Driven Engineering: Anthropic's 3-agent harness with pluggable rubrics and testing tools for long-running app development

npx claudepluginhub alexsds/ade-workflow --plugin ade

Component Overview

Commands

Agents

Skills

Hooks

MCP Servers

Component Details

Commands (4)

Completion Summary

/done

Archive the current plan after completion

Execute

/execute

Launch Generator + Evaluator agent team to execute the approved plan

Plan

/plan

Research context, ask questions, and create a plan scaled to the scope of the work

Status

/status

Show current ADE build progress and evaluator scores

Agents (2)

ade-evaluator

/ade-evaluator

Use this agent as a team member during /ade:execute to test and score features implemented by the generator. The evaluator uses pluggable rubrics and testing tools to grade work with hard thresholds. <example> Context: The execute command is launching the agent team. user: "Build the approved plan" assistant: "Spawning the ade-evaluator as a team member to test and score features as the generator implements them." <commentary> The evaluator is spawned alongside the generator. It waits for features to review, loads rubrics, tests them, and sends scored feedback. </commentary> </example> <example> Context: Generator messaged that a feature is ready for review. user: "Feature: User Registration. Status: Ready for review." assistant: "The evaluator loads relevant rubrics, tests the feature, scores each criterion, and sends detailed feedback." <commentary> Evaluator receives handoff from generator, runs adversarial testing, scores against rubric thresholds, and reports back. </commentary> </example>

ade-generator

/ade-generator

Use this agent as a team member during /ade:execute to implement features from an approved plan. The generator builds the app feature-by-feature, commits to git, and hands off to the evaluator for scoring. <example> Context: The execute command is launching the agent team. user: "Build the approved plan" assistant: "Spawning the ade-generator as a team member to implement features from the plan." <commentary> The generator is spawned as part of an agent team alongside the evaluator. It implements features and messages the evaluator for review. </commentary> </example> <example> Context: Generator received feedback from evaluator and needs to iterate. user: "Evaluator scored originality 5/10 — needs distinctive color palette" assistant: "The generator iterates on the feature based on evaluator feedback until all rubric scores pass." <commentary> Generator accepts evaluator feedback without argument and iterates until all criteria pass threshold. </commentary> </example>

Skills (3)

ade-evaluation

/ade-evaluation

This skill should be used when the user asks about evaluation methodology, scoring rubrics, testing tools, "how does scoring work", "evaluation criteria", "rubric format", "add a rubric", "create testing tool", "evaluate my feature", "run evaluation", "why did evaluation fail", or needs guidance on adversarial evaluation, graded scoring, or hard thresholds in the ADE workflow. Make sure to use this skill whenever the user mentions quality assessment, code review scoring, feature validation, or wants to understand why a feature passed or failed evaluation.

ade-generation

/ade-generation

Use when asking about the ADE build process, how the generator works, iteration strategy, "why is the generator doing X", "how does building work", "commit conventions", "pivot vs refine", or understanding the implementation phase of the ADE workflow. This skill covers the Generator's methodology for implementing features from approved plans, including the 4-phase build cycle and strategic iteration.

ade-planning

/ade-planning

Use when the user wants to plan any work — building an app, adding a feature, fixing a bug, solving a problem, refactoring, or any task that benefits from thinking before doing. Triggers on "plan", "build me", "I want to", "fix this", "add a", "we need to", "how should we", or when the user describes something they want done. This skill guides interactive discovery, research, and planning scaled to the scope of the work — from a quick task plan to a full product spec.

Hooks (1)

Review workflow modifications before installing

Event Hooks

1 hook across 1 event

MCP Servers (1)

Connects to external services

playwright

README

ADE — Agent-Driven Engineering

A Claude Code plugin implementing Anthropic's recommended 3-agent harness for long-running application development.

Based on: Harness Design for Long-Running Apps

How It Works

/ade:plan "Build a task management app"
    → Planner researches context, asks questions with suggested answers
    → Creates a plan scaled to scope (app, feature, task, bug)
    → You review and approve

/ade:execute
    → Generator implements deliverables one by one
    → Evaluator tests and scores each against rubrics
    → They iterate until all criteria pass

/ade:done
    → Archives the completed plan

Three Agents

Agent	Role	Key Behavior
Planner	Interactive planning	Researches → asks questions → writes plan scaled to scope
Generator	Implementation	Builds deliverable-by-deliverable, commits to git
Evaluator	Adversarial QA	Scores against rubrics with hard thresholds, can't modify code

Planning at Any Scale

The planner adapts to the scope of the work:

Scope	Examples	Questions	Plan Structure
Large	Full app, new product	3-5+	Phased features + user stories
Medium	New feature, integration	1-3	Deliverables + acceptance criteria
Small	Bug fix, task, refactor	0-1	Goal + what to change + done when

The planner always researches before asking questions — exploring the codebase for existing projects or searching for similar products for greenfield work.

Architecture

Skills are the source of truth for methodology. Agents are thin execution shells that reference skills for guidance.

skills/          → methodology, knowledge, the "why" and "how"
agents/          → execution shells that read skills
commands/        → user-facing entry points that invoke agents
rubrics/         → evaluation criteria with scored thresholds
testing-tools/   → testing configurations for the evaluator

Pluggable Rubrics

Default rubrics in rubrics/:

frontend-design.md — UI quality, originality, craft, functionality
code-architecture.md — separation of concerns, clarity, error handling, testability
api-quality.md — API design, responses, validation, security
ux-flows.md — flow coherence, edge cases, information architecture, feedback

Add custom rubrics by dropping .md files in .ade/rubrics/ in your project. Project rubrics override plugin defaults with the same filename.

Pluggable Testing Tools

Default tools in testing-tools/:

playwright.md — browser testing via Playwright MCP (auto-configured, falls back to curl)
api-tester.md — HTTP endpoint testing via curl
unit-test-runner.md — test suite execution (auto-detects framework)

Add custom tools by dropping .md files in .ade/testing-tools/ in your project.

Configuration

Project settings in .claude/ade.local.md:

---
commits_style: conventional    # conventional | jira
---

Commands

Command	Description
`/ade:plan [anything]`	Research, ask questions, create plan scaled to scope
`/ade:execute`	Launch Generator + Evaluator team
`/ade:done`	Archive completed plan
`/ade:status`	Show build progress

Install

Inside a Claude Code session, add the marketplace:

/plugin marketplace add alexsds/ade-workflow

Then install the plugin:

/plugin install ade@alexsds-ade-workflow

Or run /plugin to open the interactive plugin manager.

Why This Architecture

Anthropic's research found:

Planning stays high-level — micro-detail errors cascade through implementation
Separate evaluator — self-evaluation bias makes agents praise mediocre work
Adversarial stance — evaluators must hunt for failures, not confirm correctness
Graded scoring with hard thresholds — pass/fail is not rigorous enough
No sprint contracts — unnecessary with Opus 4.6
No context isolation — Opus 4.6 handles compaction without context anxiety

License

MIT

Similar Plugins

tandemkit

Describe your goal, approve the spec, then step away — Claude and Codex loop together until it's right.

v1.4.0

Stats

Version0.1.0

Stars0

MaintenanceExcellent

LicenseMIT

AddedApr 4, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Available In

ade-workflow

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

Help us improve

Share bugs, ideas, or general feedback.

Back to Plugins

ADE — Agent-Driven Engineering

A Claude Code plugin implementing Anthropic's recommended 3-agent harness for long-running application development.

Based on: Harness Design for Long-Running Apps

How It Works

/ade:plan "Build a task management app"
    → Planner researches context, asks questions with suggested answers
    → Creates a plan scaled to scope (app, feature, task, bug)
    → You review and approve

/ade:execute
    → Generator implements deliverables one by one
    → Evaluator tests and scores each against rubrics
    → They iterate until all criteria pass

/ade:done
    → Archives the completed plan

Three Agents

Agent	Role	Key Behavior
Planner	Interactive planning	Researches → asks questions → writes plan scaled to scope
Generator	Implementation	Builds deliverable-by-deliverable, commits to git
Evaluator	Adversarial QA	Scores against rubrics with hard thresholds, can't modify code

Planning at Any Scale

The planner adapts to the scope of the work:

Scope	Examples	Questions	Plan Structure
Large	Full app, new product	3-5+	Phased features + user stories
Medium	New feature, integration	1-3	Deliverables + acceptance criteria
Small	Bug fix, task, refactor	0-1	Goal + what to change + done when

The planner always researches before asking questions — exploring the codebase for existing projects or searching for similar products for greenfield work.

Architecture

Skills are the source of truth for methodology. Agents are thin execution shells that reference skills for guidance.

skills/          → methodology, knowledge, the "why" and "how"
agents/          → execution shells that read skills
commands/        → user-facing entry points that invoke agents
rubrics/         → evaluation criteria with scored thresholds
testing-tools/   → testing configurations for the evaluator

Pluggable Rubrics

Default rubrics in rubrics/:

frontend-design.md — UI quality, originality, craft, functionality
code-architecture.md — separation of concerns, clarity, error handling, testability
api-quality.md — API design, responses, validation, security
ux-flows.md — flow coherence, edge cases, information architecture, feedback

Add custom rubrics by dropping .md files in .ade/rubrics/ in your project. Project rubrics override plugin defaults with the same filename.

Pluggable Testing Tools

Default tools in testing-tools/:

playwright.md — browser testing via Playwright MCP (auto-configured, falls back to curl)
api-tester.md — HTTP endpoint testing via curl
unit-test-runner.md — test suite execution (auto-detects framework)

Add custom tools by dropping .md files in .ade/testing-tools/ in your project.

Configuration

Project settings in .claude/ade.local.md:

---
commits_style: conventional    # conventional | jira
---

Commands

Command	Description
`/ade:plan [anything]`	Research, ask questions, create plan scaled to scope
`/ade:execute`	Launch Generator + Evaluator team
`/ade:done`	Archive completed plan
`/ade:status`	Show build progress

Install

Inside a Claude Code session, add the marketplace:

/plugin marketplace add alexsds/ade-workflow

Then install the plugin:

/plugin install ade@alexsds-ade-workflow

Or run /plugin to open the interactive plugin manager.

Why This Architecture

Anthropic's research found:

Planning stays high-level — micro-detail errors cascade through implementation
Separate evaluator — self-evaluation bias makes agents praise mediocre work
Adversarial stance — evaluators must hunt for failures, not confirm correctness
Graded scoring with hard thresholds — pass/fail is not rigorous enough
No sprint contracts — unnecessary with Opus 4.6
No context isolation — Opus 4.6 handles compaction without context anxiety

License

MIT

ade

Component Overview

Component Details

Commands (4)

Agents (2)

Skills (3)

Hooks (1)

MCP Servers (1)

README

ADE — Agent-Driven Engineering

How It Works

Three Agents

Planning at Any Scale

Architecture

Pluggable Rubrics

Pluggable Testing Tools

Configuration

Commands

Install

Why This Architecture

License

Similar Plugins

tandemkit

Help us improve

Help us improve

ade

Component Overview

Component Details

Commands (4)

Agents (2)

Skills (3)

Hooks (1)

MCP Servers (1)

README

ADE — Agent-Driven Engineering

How It Works

Three Agents

Planning at Any Scale

Architecture

Pluggable Rubrics

Pluggable Testing Tools

Configuration

Commands

Install

Why This Architecture

License

Similar Plugins

tandemkit

Help us improve

helloagents

agentic-dev-team

prompts.chat

fullstack-dev-skills

claude-code-toolkit