Plugin

agentic-tdd

Name: agentic-tdd
Author: narailabs

Enforced Test-Driven Development with Claude Code agent teams. Fullstack unit pipeline (backend+frontend in one unit), TypeScript verification scripts enforce anti-cheat via Bash with tsc compilation checks built-in. Script output visible in conversation — cannot be fabricated.

npx claudepluginhub narailabs/narai-claude-plugins --plugin agentic-tdd

Component Overview

Skills

Component Details

Skills (1)

tdd

/tdd

Build features and full-stack apps using strict Test-Driven Development with agent teams, anti-cheat verification, and E2E browser testing. Always use this skill when the user wants to: build or implement something with tests, use TDD or test-driven development, implement a feature with "tests first" or "write tests before code", add test coverage to existing code, implement code against a failing test file, execute a multi-task implementation plan, build a full-stack app (backend + frontend), or invoke /tdd. Also use when the user mentions "red-green-refactor", "test-first", wants "no shortcuts" or "no cheating" in tests, asks to "resume" a TDD session, or wants comprehensive QA testing of their app. This skill handles everything from simple utilities to complex full-stack applications with React frontends and Express backends. Requires CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1.

README

agentic-tdd

Enforced Test-Driven Development for Claude Code using agent teams with anti-cheat guardrails.

What It Does

agentic-tdd decomposes feature specifications into work units, then builds each unit using a strict TDD agent pair:

Test Writer — writes failing tests from a spec contract (never sees implementation)
Code Writer — implements to make tests pass (never sees Test Writer's reasoning)
Spec Compliance Reviewer — verifies implementation matches the spec contract
Adversarial Reviewer — tries to break the tests and find cheating

Anti-cheat guardrails verify at each step:

RED: tests must fail before implementation exists
GREEN: tests must pass after implementation, and test files must be unchanged (checksum verified)
Assertion density, behavior-over-implementation checks, skip marker detection
5 documented testing anti-patterns are flagged automatically

Installation

# From NarAI marketplace
claude plugin install agentic-tdd@narai

# Direct from GitHub
claude plugin install narailabs/claude-agentic-tdd

Prerequisites

Enable agent teams in .claude/settings.json:

{
  "env": {
    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
  }
}

Usage

Implement a new feature

/tdd implement a calculator with add, subtract, multiply, and divide operations

Add tests to existing code

/tdd add test coverage for src/services/

Implement against existing tests

/tdd implement against src/__tests__/auth.test.ts

Complex spec with design review

/tdd "build a payment system with Stripe integration" --design

Simple utility, skip design

/tdd "URL parsing helper" --skip-design

Flags

Flag	Description
`--skip-failed`	Skip work units that fail after max retries (default: escalate to user)
`--config <path>`	Use a custom `.tdd.config.json`
`--design`	Force the design gate even for simple specs
`--skip-design`	Skip the design gate entirely

How It Works

Phase 0: Design Gate (optional)

For complex or ambiguous specs, agentic-tdd runs a design refinement step — clarifying questions, trade-off analysis, and a design summary — before any code is written. Triggers automatically for multi-component specs or when --design is passed.

Phase 1: Framework Detection

Auto-detects your test framework from project files (package.json, pyproject.toml, go.mod, Cargo.toml, etc.). Supports 10+ languages. Falls back to asking you if detection fails.

Phase 2: Work Decomposition

Analyzes the spec and breaks it into independent work units with dependency tracking. Presents the plan for your confirmation before proceeding.

Phase 3: State Persistence

Creates .tdd-state.json for session resume. If interrupted, the next /tdd invocation detects the state file and offers to resume.

Phase 4: Agent Team Orchestration

For each work unit (parallel where dependencies allow):

Test Writer writes failing tests from the spec contract
RED verification confirms tests fail correctly (not syntax errors)
Code Writer implements to make tests pass (information barrier enforced)
GREEN verification confirms tests pass and test files are unchanged
Spec Compliance Review verifies the implementation matches the spec
Adversarial Review tries to break tests and catch cheating

Phase 5: Final Verification

Runs the full test suite across all units to catch integration issues. No completion claim without fresh test output as evidence.

Phase 6: Report

Generates tdd-report.md (human-readable summary) and tdd-session.jsonl (structured event log).

Phase 7: Cleanup

Shuts down agents, removes intermediate artifacts (spec-contract-*.md files), updates state.

Model Cost Optimization

The execution.modelStrategy config key controls agent model assignment:

Strategy	Behavior
`"auto"` (default)	Assess complexity per work unit and assign models accordingly
`"standard"`	Default model for all agents
`"fast"`	Cheapest capable model for all agents
`"capable"`	Most capable model for all agents

Configuration

`.tdd.config.json` (optional, project root)

{
  "framework": {
    "testRunner": "vitest",
    "testCommand": "npx vitest run"
  },
  "antiCheat": {
    "minAssertionsPerTest": 2,
    "maxRetries": 3,
    "maxMockDepth": 2,
    "flagPrivateMethodTests": true
  },
  "execution": {
    "maxParallelPairs": 3,
    "modelStrategy": "auto"
  },
  "reporting": {
    "generateReport": true,
    "generateSessionLog": true
  }
}

CLAUDE.md section (optional)

Add a ## TDD Configuration section to your project's CLAUDE.md with test conventions.

Entry Point Modes

View full README on GitHub

Similar Plugins

fullstack-dev-skills

8.6k

221

Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.

Stats

Version5.2.1

Stars1

MaintenanceExcellent

Last CommitApr 2, 2026

AddedMar 17, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

narai

agentic-tdd

Enforced Test-Driven Development for Claude Code using agent teams with anti-cheat guardrails.

What It Does

agentic-tdd decomposes feature specifications into work units, then builds each unit using a strict TDD agent pair:

Test Writer — writes failing tests from a spec contract (never sees implementation)
Code Writer — implements to make tests pass (never sees Test Writer's reasoning)
Spec Compliance Reviewer — verifies implementation matches the spec contract
Adversarial Reviewer — tries to break the tests and find cheating

Anti-cheat guardrails verify at each step:

RED: tests must fail before implementation exists
GREEN: tests must pass after implementation, and test files must be unchanged (checksum verified)
Assertion density, behavior-over-implementation checks, skip marker detection
5 documented testing anti-patterns are flagged automatically

Installation

# From NarAI marketplace
claude plugin install agentic-tdd@narai

# Direct from GitHub
claude plugin install narailabs/claude-agentic-tdd

Prerequisites

Enable agent teams in .claude/settings.json:

{
  "env": {
    "CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS": "1"
  }
}

Usage

Implement a new feature

/tdd implement a calculator with add, subtract, multiply, and divide operations

Add tests to existing code

/tdd add test coverage for src/services/

Implement against existing tests

/tdd implement against src/__tests__/auth.test.ts

Complex spec with design review

/tdd "build a payment system with Stripe integration" --design

Simple utility, skip design

/tdd "URL parsing helper" --skip-design

Flags

Flag	Description
`--skip-failed`	Skip work units that fail after max retries (default: escalate to user)
`--config <path>`	Use a custom `.tdd.config.json`
`--design`	Force the design gate even for simple specs
`--skip-design`	Skip the design gate entirely

How It Works

Phase 0: Design Gate (optional)

Phase 1: Framework Detection

Auto-detects your test framework from project files (package.json, pyproject.toml, go.mod, Cargo.toml, etc.). Supports 10+ languages. Falls back to asking you if detection fails.

Phase 2: Work Decomposition

Analyzes the spec and breaks it into independent work units with dependency tracking. Presents the plan for your confirmation before proceeding.

Phase 3: State Persistence

Creates .tdd-state.json for session resume. If interrupted, the next /tdd invocation detects the state file and offers to resume.

Phase 4: Agent Team Orchestration

For each work unit (parallel where dependencies allow):

Test Writer writes failing tests from the spec contract
RED verification confirms tests fail correctly (not syntax errors)
Code Writer implements to make tests pass (information barrier enforced)
GREEN verification confirms tests pass and test files are unchanged
Spec Compliance Review verifies the implementation matches the spec
Adversarial Review tries to break tests and catch cheating

Phase 5: Final Verification

Runs the full test suite across all units to catch integration issues. No completion claim without fresh test output as evidence.

Phase 6: Report

Generates tdd-report.md (human-readable summary) and tdd-session.jsonl (structured event log).

Phase 7: Cleanup

Shuts down agents, removes intermediate artifacts (spec-contract-*.md files), updates state.

Model Cost Optimization

The execution.modelStrategy config key controls agent model assignment:

Strategy	Behavior
`"auto"` (default)	Assess complexity per work unit and assign models accordingly
`"standard"`	Default model for all agents
`"fast"`	Cheapest capable model for all agents
`"capable"`	Most capable model for all agents

Configuration

`.tdd.config.json` (optional, project root)

{
  "framework": {
    "testRunner": "vitest",
    "testCommand": "npx vitest run"
  },
  "antiCheat": {
    "minAssertionsPerTest": 2,
    "maxRetries": 3,
    "maxMockDepth": 2,
    "flagPrivateMethodTests": true
  },
  "execution": {
    "maxParallelPairs": 3,
    "modelStrategy": "auto"
  },
  "reporting": {
    "generateReport": true,
    "generateSessionLog": true
  }
}

CLAUDE.md section (optional)

Add a ## TDD Configuration section to your project's CLAUDE.md with test conventions.

agentic-tdd

Component Overview

Component Details

Skills (1)

README

agentic-tdd

What It Does

Installation

Prerequisites

Usage

Implement a new feature

Add tests to existing code

Implement against existing tests

Complex spec with design review

Simple utility, skip design

Flags

How It Works

Phase 0: Design Gate (optional)

Phase 1: Framework Detection

Phase 2: Work Decomposition

Phase 3: State Persistence

Phase 4: Agent Team Orchestration

Phase 5: Final Verification

Phase 6: Report

Phase 7: Cleanup

Model Cost Optimization

Configuration

.tdd.config.json (optional, project root)

CLAUDE.md section (optional)

Entry Point Modes

Similar Plugins

fullstack-dev-skills

agentic-tdd

Component Overview

Component Details

Skills (1)

README

agentic-tdd

What It Does

Installation

Prerequisites

Usage

Implement a new feature

Add tests to existing code

Implement against existing tests

Complex spec with design review

Simple utility, skip design

Flags

How It Works

Phase 0: Design Gate (optional)

Phase 1: Framework Detection

Phase 2: Work Decomposition

Phase 3: State Persistence

Phase 4: Agent Team Orchestration

Phase 5: Final Verification

Phase 6: Report

Phase 7: Cleanup

Model Cost Optimization

Configuration

.tdd.config.json (optional, project root)

CLAUDE.md section (optional)

Entry Point Modes

Similar Plugins

fullstack-dev-skills

dotnet-skills

godot-skills

prompts.chat

agent-browser

impeccable

`.tdd.config.json` (optional, project root)

`.tdd.config.json` (optional, project root)