Use when you want to write tests first, enforce test-driven development, or add test coverage — 'write tests for this', 'TDD this feature', 'add test coverage'. Strict RED-GREEN-REFACTOR discipline.
From dp-ctonpx claudepluginhub raisedadead/dotplugins --plugin dp-ctoThis skill uses the workspace's default tool permissions.
references/anti-rationalization.mdreferences/good-tests.mdreferences/implementation-examples.mdreferences/red-flags.mdreferences/test-examples.mdreferences/verification-checklist.mdDesigns and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Implements structured self-debugging workflow for AI agent failures: capture errors, diagnose patterns like loops or context overflow, apply contained recoveries, and generate introspection reports.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
Read ${CLAUDE_PLUGIN_ROOT}/shared/cto-boundaries.md
NO PRODUCTION CODE WITHOUT A FAILING TEST FIRST
Wrote code before the test? Delete it. Not "set aside." Not "use as reference." Delete. Implement fresh from the failing test.
Collect from the user:
If the user's description is vague, ask clarifying questions before dispatch. One question per message.
Dispatch tester via the Agent tool with subagent_type: "dp-cto:tester".
The agent prompt must include:
The tester agent follows strict RED-GREEN-REFACTOR discipline: write one failing test, verify RED, write minimal code to pass, verify GREEN, refactor. The agent definition enforces this cycle.
On agent completion, extract the structured receipt from <!-- RECEIPT:START/END --> markers. Validate:
tests_written — list of tests createdtests_watched_fail — every test must have been watched failing (RED verified)tests_passing — all tests green after implementationverification — command, exit code, and output previewIf tests_watched_fail is empty or does not match tests_written, reject the receipt. A test never seen failing is a test you do not trust.
Present the tester's results to the user: tests written, RED/GREEN verification evidence, and any unresolved issues from the receipt.
Read ${CLAUDE_SKILL_DIR}/references/test-examples.md before proceeding.
Read ${CLAUDE_SKILL_DIR}/references/implementation-examples.md before proceeding.
Read ${CLAUDE_SKILL_DIR}/references/anti-rationalization.md before proceeding.
Read ${CLAUDE_SKILL_DIR}/references/red-flags.md before proceeding.
Read ${CLAUDE_SKILL_DIR}/references/good-tests.md before proceeding.
Read ${CLAUDE_SKILL_DIR}/references/verification-checklist.md before proceeding.