Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By iVintik
Makes autonomous coding agents produce software that actually works — real-world verification, observability, and mechanical enforcement via Claude Code hooks.
npx claudepluginhub ivintik/private-claude-marketplace --plugin codeharnessScan an existing project and generate an onboarding plan to bring it to full harness compliance.
Generate or update project documentation — docs/ tree + README.md — using the BMAD tech-writer with codeharness post-processing.
Initialize the codeharness harness in the current project — detect stack, configure enforcement, install dependencies, set up hooks.
Show harness health, sprint progress, and verification state at a glance.
Remove all harness artifacts without touching project source code.
Scans project documentation for staleness, missing AGENTS.md files, and stale exec-plans. Use during retrospectives or on-demand to keep docs fresh. Must complete within 60 seconds (NFR23).
Runs verification pipeline for a story — reads acceptance criteria, produces Showboat proof document with real-world evidence. Use when a story needs verification after implementation and tests pass.
Integrates codeharness with BMAD methodology — reads sprint plans, maps stories to verification tasks, enforces harness requirements in all BMAD workflows. Triggers when working with BMAD artifacts, sprint plans, or story files.
Enforces that the agent queries observability tools (VictoriaLogs, VictoriaMetrics, VictoriaTraces) during development instead of guessing at runtime behavior. Triggers when the agent is debugging, investigating errors, or verifying runtime behavior.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
BMAD Method - Breakthrough Method for Agile AI-Driven Development
Long-running agent harness with 5-layer memory architecture, GitHub integration, autonomous batch processing, Agent Teams with ATDD, 9 hooks (safety, quality gates, team coordination), and 6 Agent Skills
The only Claude Code plugin that verifies AI-generated code against its own design specs.
BMAD story automation skills for create/dev/QA/review/retro orchestration.
AI-native BizDevOps rhythm manager for Claude Code — multi-role collaboration, change impact analysis, quality gates, goal-to-code traceability, and cross-session continuity
Language-agnostic development process harness implementing the Stateless Agent Methodology (SAM) 7-stage pipeline with ARL human touchpoint model and Voltron-style language plugin composition. Provides orchestration, workflows, planning, verification, and testing methodology that any language plugin can compose with.
Plugin lifecycle management — health audits, optimization, releasing, and marketplace operations for Claude Code plugins
Makes autonomous coding agents produce software that actually works — not software that passes tests.
codeharness is an npm CLI + Claude Code plugin that packages verification-driven development as an installable tool: black-box verification via Docker, agent-first observability via VictoriaMetrics, and mechanical enforcement via hooks that make skipping verification architecturally impossible.
Two components — install both:
# CLI (npm package)
npm install -g codeharness
# Claude Code plugin (slash commands, hooks, skills)
claude plugin install github:iVintik/codeharness
# Initialize in your project
codeharness init
# Start autonomous sprint execution (inside Claude Code)
/harness-run
codeharness)The CLI handles all mechanical work — stack detection, Docker management, verification, coverage, retry state.
| Command | Purpose |
|---|---|
codeharness init | Detect stack, install dependencies, start observability, scaffold docs |
codeharness run | Execute the autonomous coding loop (Ralph) |
codeharness verify --story <key> | Run verification pipeline for a story |
codeharness status | Show harness health, sprint progress, Docker stack |
codeharness coverage | Run tests with coverage and evaluate against targets |
codeharness onboard epic | Scan codebase for gaps, generate onboarding stories |
codeharness retry --status | Show retry counts and flagged stories |
codeharness retry --reset | Clear retry state for re-verification |
codeharness verify-env build | Build Docker image for black-box verification |
codeharness stack start | Start the shared observability stack |
codeharness teardown | Remove harness from project |
All commands support --json for machine-readable output.
/harness-*)The plugin provides slash commands that orchestrate the CLI within Claude Code sessions:
| Command | Purpose |
|---|---|
/harness-run | Autonomous sprint execution — picks stories by priority, runs create → implement → check → verify loop |
/harness-init | Interactive project initialization |
/harness-status | Quick overview of sprint progress and harness health |
/harness-onboard | Scan project and generate onboarding plan |
/harness-verify | Verify a story with real-world evidence |
codeharness integrates with BMAD Method for structured sprint planning:
| Phase | Commands |
|---|---|
| Analysis | /create-brief, /brainstorm-project, /market-research |
| Planning | /create-prd, /create-ux |
| Solutioning | /create-architecture, /create-epics-stories |
| Implementation | /sprint-planning, /create-story, then /harness-run |
┌─────────────────────────────────────────┐
│ Claude Code Session │
│ /harness-run picks next story │
│ → create-story → implement → check → verify │
└────────────────────┬────────────────────┘
│ verify
▼
┌─────────────────────────────────────────┐
│ Docker Container (no source code) │
│ - codeharness CLI installed from tarball│
│ - claude CLI for nested verification │
│ - curl/jq for observability queries │
│ Exercises CLI as a real user would │
└────────────────────┬────────────────────┘
│ queries
▼
┌─────────────────────────────────────────┐
│ Observability Stack (VictoriaMetrics) │
│ - VictoriaLogs :9428 (LogQL) │
│ - VictoriaMetrics :8428 (PromQL) │
│ - OTEL Collector :4318 │
└─────────────────────────────────────────┘
When verification finds code bugs → story returns to dev with findings → dev fixes → re-verify. This loop runs up to 10 times per story. Infrastructure failures (timeouts, Docker errors) retry 3 times then skip.