Skill

tdd

Enforces strict test-driven development for behavior changes: requires verified RED before production code, minimal GREEN implementation, and refactor only after passing tests.

testing

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/supergraph:tdd

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Strict TDD for features, bug fixes, refactors.

SKILL.md

161 lines · ~1.3k tokens

Stats

LanguageHTML

Parent stars11

Parent forks1

MaintenanceExcellent

Last CommitJun 19, 2026

Actions

View Source View Plugin View on GitHub View README

/supergraph:tdd

Strict TDD for features, bug fixes, refactors.

Iron Law: NO PRODUCTION CODE WITHOUT A FAILING TEST FIRST

Delete means delete: Production code written before verified failing test → delete and restart from RED.

When

Full TDD: new features, behavior changes, refactoring. Fast TDD: bug fixes (add regression test → RED verify → minimal fix). Ask to skip: config-only, generated code, throwaway prototype, docs-only.

State Machine (in order, never skip)

needs_test → red_verified → green_verified → refactor_allowed → complete

Steps

0. Announce

"🔴 /supergraph:tdd — TDD: [full|fast] for [behavior]..."

1. Identify One Behavior

Behavior: [single externally visible behavior]
Test file: [path] | Test name: [name]
Command: [focused test command]
Expected RED: [why it should fail before implementation]

One behavior per test. Public behavior, not internals. Real code over mocks.

2. RED — Write + Verify in One Round

Write one failing test, then run immediately:

<write test> && $TEST_CMD <focused command>

Valid RED: fails for the expected missing behavior, not syntax/import/typo.

Record evidence:

## TDD Evidence — RED
- RED: `[command]` → FAIL ([expected missing behavior])

Serena diagnostics (optional): mcp__serena__get_diagnostics_for_file(file=<test_file>) — confirm no type errors mask the real missing behavior. Skip if Serena unavailable.

Invalid RED → fix test setup, don't write production code yet.

3. GREEN — Minimal Implementation

Write only enough code to pass the test. No abstractions, no cleanup, no extra features.

Delete any production code written before RED.

Serena surgery (optional): Prefer mcp__serena__replace_symbol_body(symbol=<fn>) over raw file edits for targeted function-body implementation — exact, no risk of touching surrounding code. Skip if Serena unavailable.

4. GREEN Verify

Run focused test → broader suite:

## TDD Evidence — GREEN
- GREEN: `[command]` → PASS
- Suite: PASS

Serena diagnostics (optional): mcp__serena__get_diagnostics_for_file(file=<source_file>) — confirm no new type errors introduced by implementation. Skip if Serena unavailable.

Failing test → fix code, not test. Other tests fail → fix now.

5. REFACTOR — Only After GREEN

Rename, deduplicate, extract. No behavior changes. Re-run tests.

Before any rename (optional): Enumerate all callers first to confirm scope:

mcp__serena__find_referencing_symbols(symbol=<symbol_to_rename>)

Then use mcp__serena__rename_symbol(old=<name>, new=<name>) for safe codebase-wide rename instead of grep/replace. Skip if Serena unavailable — fall back to manual grep + Edit.

6. Complete

Before marking complete:

## TDD Complete
- Behavior: [behavior]
- Mode: full|fast
- RED verified: yes | GREEN verified: yes | Refactor: yes|none
- Tests: PASS

7. Report

Per behavior: ✅ /supergraph:tdd — behavior N: [brief]

Final:

✅ /supergraph:tdd complete
- Behaviors: N | Mode: full|fast | Tests: PASS | Lint: PASS
- Next: /supergraph:fix → /supergraph:verify → /supergraph:review

Fast TDD Path (Bug Fixes)

For bugs, skip full TDD ceremony:

Write one regression test that reproduces the bug
Verify it fails (RED) — the bug is proved
Write minimal fix only (GREEN)
Verify test passes — bug is fixed
No REFACTOR needed unless specified

Plan Integration

Plans must include TDD metadata per behavior task:

TDD:
- Behavior: [single behavior] | Test file: [path] | Test name: [name]
- RED command: `[focused test command]` | Expected RED failure: [missing behavior]
- Minimal GREEN change: [smallest implementation] | Mocking: none | [why unavoidable]

Executor Enforcement

No production edits before red_verified
Stop if RED passes immediately or fails for wrong reason
Allow implementation only after valid RED, refactor only after GREEN

Review Triggers — Reject When

Tests after implementation | No RED evidence | RED passed immediately | RED failed for wrong reason
Implementation exceeds tested behavior | Bug fix lacks regression test
Tests assert internals | Mocks hide integration risk

Anti-Patterns — Stop & Return to RED

Symptom	Fix
Production code before failing test	Delete, start RED
Tests after implementation	Next behavior test-first, remove untested code
"Too small to test"	Write one-liner test
Immediate pass accepted as RED	Test is wrong — revise
Pre-written implementation kept as reference	Delete, start fresh
Over-building before tests need it	YAGNI — remove

Mock Gate — Answer Before Mocking

What behavior is under test?
Is this mock isolating an external, slow, or flaky boundary?
What side effects does the real dependency provide?
Would this test fail if real behavior broke?

Reject tests that only prove mocks exist or were called.

tdd

Popularity

Invocation

Context Preview

SKILL.md

tdd

Popularity

Invocation

Context Preview

SKILL.md

/supergraph:tdd

When

State Machine (in order, never skip)

Steps

0. Announce

1. Identify One Behavior

2. RED — Write + Verify in One Round

3. GREEN — Minimal Implementation

4. GREEN Verify

5. REFACTOR — Only After GREEN

6. Complete

7. Report

Fast TDD Path (Bug Fixes)

Plan Integration

Executor Enforcement

Review Triggers — Reject When

Anti-Patterns — Stop & Return to RED

Mock Gate — Answer Before Mocking

Similar Skills

/supergraph:tdd

When

State Machine (in order, never skip)

Steps

0. Announce

1. Identify One Behavior

2. RED — Write + Verify in One Round

3. GREEN — Minimal Implementation

4. GREEN Verify

5. REFACTOR — Only After GREEN

6. Complete

7. Report

Fast TDD Path (Bug Fixes)

Plan Integration

Executor Enforcement

Review Triggers — Reject When

Anti-Patterns — Stop & Return to RED

Mock Gate — Answer Before Mocking

Similar Skills