Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By jagreehal
Ralph Wiggum-style AI coding loop with specs, skills, and observability. Run Claude Code autonomously until all specs pass.
npx claudepluginhub jagreehal/baldrickExecute Baldrick iterations on existing specs. Use when: run baldrick, execute specs, implement features, start working. Triggers on: baldrick run, run specs, implement, execute features.
Iteratively improve test coverage until target is reached. Use when: improve coverage, test coverage, coverage target, baldrick coverage. Triggers on: coverage, test coverage, coverage loop.
Iteratively reduce code entropy by fixing smells and improving quality. Use when: cleanup, code smells, entropy, refactor loop, baldrick entropy. Triggers on: entropy, cleanup, code smells, reduce entropy.
Iteratively fix all linting errors until clean. Use when: fix lint, lint errors, lint loop, baldrick lint-fix. Triggers on: lint, fix lint, lint errors, lint loop.
Generate PR documentation from completed spec and git diff. Use when: pr docs, generate docs, pull request docs, baldrick docs. Triggers on: pr docs, generate docs, document changes.
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Spec-driven development with task-by-task execution. Research, requirements, design, tasks, autonomous implementation, and epic triage for multi-spec feature decomposition.
SPEC-First development workflow with TDD, Ralph Loop, and autonomous agent coordination for Claude Code
Autonomous end-to-end development system - from requirement to production-ready code with zero manual intervention
Describe your goal, approve the spec, then step away — Claude and Codex loop together until it's right.
Use Claude Code like a Director, not a Programmer. Complete toolkit with 26 commands, 14 agents, 31 skills, and TDD-based Auto-Loop.
Skills-first specification-driven development framework with 7 agent skills for planning, implementation, review, and shipping. Natural language activation with intelligent agent orchestration. Includes /plan, /implement, /research commands plus managing-specifications, implementing-features, and reviewing-and-shipping skills.
Opinionated TypeScript patterns for AI-assisted development. Enforces fn(args, deps), Result types, Zod validation, and production-grade architecture.
Capture mistakes and improvement opportunities with 5 whys root cause analysis. Learn from patterns to prevent future issues.
Autonomous AI coding loops that actually finish.

"I have a cunning plan!" - Baldrick, Blackadder
curl -sO https://raw.githubusercontent.com/jagreehal/baldrick/main/baldrick && chmod +x baldrick && ./baldrick init
Create a spec, run ./baldrick run 5, watch Claude implement it.
Based on Geoffrey Huntley's Ralph pattern — define what "done" looks like, run Claude repeatedly until it gets there. Baldrick adds structure:
See design.md for the full philosophy.
Every AI coding session accumulates context: files read, commands run, wrong turns taken, half-baked plans. You can add but can't delete. Eventually it starts repeating itself, "fixing" the same bug different ways, confidently undoing its own work. That's context pollution.
Baldrick doesn't try to clean the context. It throws it away and starts fresh. Progress persists in files (specs, progress.txt, git). Failures evaporate with the session. Each iteration reconstructs reality from the filesystem, not chat history.
After running ./baldrick init, you'll have:
specs/ # Your feature specs go here
templates/ # Spec templates (minimal, standard, detailed, example)
progress.txt # Session progress log
baldrick-learnings.md # Permanent patterns
tradeoffs.md # Decision log (auto-created if missing)
logs/ # Iteration logs (auto-created if missing)
Requires: Claude Code CLI (configured with appropriate permissions)
Now create a spec. This tells baldrick what "done" looks like:
Quick way:
./baldrick new hello --title "Add Hello Function"
Or manually:
cat > specs/hello.md << 'EOF'
---
title: "Add Hello Function"
passes: false
---
# Add Hello Function
## Done When
- [ ] Function `hello(name)` returns "Hello, {name}!"
- [ ] Tests pass
EOF
For better results, see Spec Templates or use Claude Code skills (spec-create, spec-vibe, spec-detailed)
passes: true means the spec's Done When criteria are met and quality gates (build/test/lint) are green.
Run it:
./baldrick run 1 # Run one iteration
./baldrick run 10 --verbose # Run with streaming output
./baldrick status # Check progress
That's it. Claude implements the feature, runs quality gates (build/test/lint), and commits when all checks pass.
Run options:
--dry-run - Show what would run without executing--stream-json - Use Claude stream-json + jq (requires jq)--verbose or -v - Stream Claude output in real-time (instead of spinner)--stream=PATH - Enable TUI event streaming (advanced)--control=PATH - Enable TUI control socket (advanced)Don't want to write specs manually? Use any AI assistant (ChatGPT, Gemini, Claude):
templates/spec-generator.md into your chatspecs/your-feature.md (Baldrick auto-discovers specs here)./baldrick run or ./baldrick once your-feature to run a specific specThis bridges conversational AI (great for brainstorming) with Baldrick (great for execution).
Tip: If your feature is large, the AI will generate multiple specs. Save each one as a separate file.