Executes strict TDD cycles: parses tasks from tasks.md, writes failing tests (red), implements minimal code to pass (green), refactors, generates cycle-report.md. Handles retries and [EXTEND]/[MODIFY] tasks.
From humaninloopnpx claudepluginhub deepeshbodh/human-in-loop --plugin humaninloopThis skill uses the workspace's default tool permissions.
references/CYCLE-REPORT-FORMAT.mdreferences/TASK-PARSING.mdreferences/TDD-ANTI-RATIONALIZATION.mdSearches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
Searches prompts.chat for AI prompt templates by keyword or category, retrieves by ID with variable handling, and improves prompts via AI. Use for discovering or enhancing prompts.
Executes pre-written implementation plans: critically reviews, follows bite-sized steps exactly, runs verifications, tracks progress with checkpoints, uses git worktrees, stops on blockers.
Transform a cycle task list into implemented code via strict red/green/refactor discipline. Parse tasks from tasks.md, write failing tests first, implement code to pass them, refactor, mark tasks complete, and produce a structured cycle-report.md. This skill governs both normal cycle execution and retry/fix modes.
Violating the letter of the rules is violating the spirit of the rules.
TDD discipline exists to catch failures before they compound. Every shortcut in this process is a regression waiting to happen.
tasks.md[EXTEND] or [MODIFY] markers (invoke brownfield-integration skill alongside)Execute in strict order. No skipping steps. No reordering.
1. Parse Cycle Tasks
Extract the task list for the current cycle from tasks.md. See references/TASK-PARSING.md for parsing rules.
For each task, extract:
T{N}.{X})[EXTEND] or [MODIFY] markers2. Red Phase — Write Failing Tests
For each task that specifies a test:
3. Green Phase — Implement Code
For each implementation task:
[EXTEND] tasks: read the existing file first, follow existing patterns (invoke brownfield-integration skill)[MODIFY] tasks: read the existing file first, change only what the task specifies (invoke brownfield-integration skill)4. Refactor Phase
After tests pass:
5. Mark Tasks Complete
Update tasks.md: change - [ ] to - [x] for each completed task in this cycle.
6. Write Cycle Report
Produce cycle-report.md following the format in references/CYCLE-REPORT-FORMAT.md.
[x] in tasks.md immediately after completing itcycle-report.md with YAML frontmatter and prose sectionsWhen dispatched after a checkpoint failure:
- [x] back to - [ ] in tasks.mdcycle-report.md with updated attempt numberWhen dispatched after final-validation failure:
cycle-report.md with cycle: fix in frontmatterIf any of these thoughts arise, STOP immediately:
All of these mean: Rationalization in progress. Return to the execution sequence. Follow every step.
No exceptions:
See references/TDD-ANTI-RATIONALIZATION.md for the full rationalization table.
What goes wrong: Tests become retroactive justification. They pass because they were written to match the code, not because the code satisfies the requirement.
Fix: Always write the test first. Run it. Verify it fails for the right reason. Then implement.
What goes wrong: Working code gets rewritten, introducing new bugs. Token budget wasted on already-complete tasks.
Fix: Trace failures to specific tasks. Re-open only those tasks. Leave passing code untouched.
What goes wrong: "While I'm here" changes accumulate. New bugs appear in code that was working. Cycle report doesn't reflect actual changes.
Fix: Refactor phase is limited to code introduced in this cycle. Note improvement opportunities in the cycle report's "Notes for Next Cycle" section.
What goes wrong: Test fails due to syntax error or wrong import, not because the assertion caught a missing implementation. Green phase "passes" the test by fixing the syntax, not by implementing the feature.
Fix: After writing a failing test, verify the failure message matches your expectation. A ModuleNotFoundError is not a meaningful test failure.