Marketplace

harness

SDD (Spec-Driven Development) harness for high-quality AI engineering

npx claudepluginhub zxdxjtu/harness

README

1 Plugin

harness

7·

SDD (Spec-Driven Development) harness for high-quality AI engineering. Guides spec → test → implement → verify workflow with auto-loop sprint execution.

1mo

v0.2.0

zxdxjtu

Related Marketplaces

antigravity-awesome-skills

40.4K

38plugins

Claude Code marketplace entries for the plugin-safe Antigravity Awesome Skills library and its compatible editorial bundles.

claude-code-workflows

36.4K

22plugins

Production-ready workflow orchestration with 84 marketplace plugins, 192 local specialized agents, and 156 local skills - optimized for granular installation and minimal token usage

claude-plugins-official

29.9K

186plugins

Directory of popular Claude Code extensions including development tools, productivity plugins, and MCP integrations

Stats

Plugins1

Stars7

UpdatedApr 24, 2026

Links

View on GitHub View Marketplace JSON

Help us improve

Share bugs, ideas, or general feedback.

Stats

Links

Help us improve

Share bugs, ideas, or general feedback.

Harness — Spec-Driven Development Plugin for Claude Code

A Claude Code marketplace plugin that enforces a structured Spec → Test → Implement → Verify pipeline for high-quality AI-assisted software engineering.

Install

/plugin marketplace add zxdxjtu/harness

Quick Start

/proposal "Add user authentication with JWT"   # Generate spec + design
/tdd-align F001                                  # Generate tests (all RED)
/decompose F001                                  # Break into atomic tasks
/sprint F001                                     # Auto-execute until done

Pipeline

Standard:
/proposal → /tdd-align → /decompose → /sprint → /verify
   Spec       Tests        Tasks       Execute    Verify
  (human)    (human)      (human)      (auto)    (auto)

Clone/Replicate (with adversarial evaluation):
/baseline → /proposal → ... → /sprint → /evaluate → /eval-fix
  Capture     Spec              Execute   Evaluate    Fix Loop
  (auto)     (human)            (auto)    (Evaluator) (GAN loop)

Phase 1: Spec (`/proposal`)

Interactive dialogue to produce a complete, unambiguous spec with zero-decision-point checklist. Human approves before proceeding.

Phase 2: Test Alignment (`/tdd-align`)

Generate three-layer tests from spec:

V1: Unit tests (logic validation)
V2: Integration tests (module interaction)
V3: E2E tests (user perspective)

All tests start RED. Human approves the test contract.

Phase 3: Decompose (`/decompose`)

Break tests into atomic tasks (≤2h each), analyze dependencies, assign parallel execution waves. Human approves the task DAG.

Phase 4: Sprint (`/sprint`)

Automatic loop execution — powered by a Stop Hook that keeps Claude running until all tasks complete:

Parallel execution via worktree-isolated agents
Regression testing after each wave
Background code quality guardians
Doom loop detection (stops on repeated failures)
Context compression between waves

Phase 5: Verify (`/verify`)

V1 → V2 → V3 staged verification with evidence package.

Adversarial Evaluation (Clone scenarios)

Inspired by GAN-style adversarial design — separate Generator and Evaluator agents to prevent self-assessment bias.

/baseline <url> — Evaluator Agent explores the reference product via Playwright MCP, captures screenshots, interaction flows, and visual specs into .harness/baseline/.

/evaluate <id> --ref-url <url> --dev-url <url> — Independent Evaluator compares reference vs development product across 4 dimensions:

Dimension	Weight
Functional Completeness	40%
Interaction Consistency	25%
Visual Fidelity	20%
Technical Quality	15%

/eval-fix <id> — GAN-style fix loop: Generator fixes gaps → Evaluator re-scores → repeat until convergence or stagnation.

Commands

Command	Description
`/proposal [desc]`	Start new feature (interactive spec generation)
`/tdd-align <id>`	Generate three-layer tests from spec
`/decompose <id>`	Break into atomic task DAG
`/sprint <id>`	Auto-loop execute all tasks
`/verify <id>`	Three-stage verification
`/harness-status`	Check current state and next step
`/baseline <url>`	Capture reference product baseline (Playwright MCP)
`/evaluate <id>`	Adversarial comparison: reference vs dev product
`/eval-fix <id>`	GAN-style fix-evaluate loop until convergence
`/cancel-sprint`	Stop active sprint loop
`/help`	Show documentation

Project State

Harness stores state in .harness/ (auto-created):

.harness/
├── specs/          # Feature specifications
├── designs/        # Design documents
├── baseline/       # Reference product baseline (clone scenarios)
│   ├── baseline-report.md
│   ├── screenshots/
│   └── features/
├── tasks.md        # Task DAG
├── progress.md     # Progress log
├── evidence/       # Verification + evaluation evidence
│   └── FXXX/
│       ├── eval-report.md
│       ├── eval-screenshots/
│       └── eval-loop-state.md
└── sprint-loop.md  # Sprint state (runtime)

Add .harness/ to .gitignore or commit it — your choice.

Use Cases

New projects: Full pipeline from idea to delivery
Cloning products: Baseline capture → spec → pipeline → adversarial evaluation
Incremental features: Add features to existing codebases

Core Principles

Spec → Test → Code — Tests are the alignment contract
Atomic Tasks — Each independently implementable and testable
Closed-Loop Verification — Evidence-driven delivery
Context Discipline — File system as lossless memory

License

MIT

harness

README

1 Plugin

harness

Related Marketplaces

antigravity-awesome-skills

claude-code-workflows

claude-plugins-official

Help us improve

Help us improve

Find plugins for your project

harness

README

Harness — Spec-Driven Development Plugin for Claude Code

Install

Quick Start

Pipeline

Phase 1: Spec (/proposal)

Phase 2: Test Alignment (/tdd-align)

Phase 3: Decompose (/decompose)

Phase 4: Sprint (/sprint)

Phase 5: Verify (/verify)

Adversarial Evaluation (Clone scenarios)

Commands

Project State

Use Cases

Core Principles

License

1 Plugin

harness

Related Marketplaces

antigravity-awesome-skills

claude-code-workflows

claude-plugins-official

Help us improve

Harness — Spec-Driven Development Plugin for Claude Code

Install

Quick Start

Pipeline

Phase 1: Spec (/proposal)

Phase 2: Test Alignment (/tdd-align)

Phase 3: Decompose (/decompose)

Phase 4: Sprint (/sprint)

Phase 5: Verify (/verify)

Adversarial Evaluation (Clone scenarios)

Commands

Project State

Use Cases

Core Principles

License

Phase 1: Spec (`/proposal`)

Phase 2: Test Alignment (`/tdd-align`)

Phase 3: Decompose (`/decompose`)

Phase 4: Sprint (`/sprint`)

Phase 5: Verify (`/verify`)

Phase 1: Spec (`/proposal`)

Phase 2: Test Alignment (`/tdd-align`)

Phase 3: Decompose (`/decompose`)

Phase 4: Sprint (`/sprint`)

Phase 5: Verify (`/verify`)