Search everything...

Stats

Actions

Available In

Help us improve

Share bugs, ideas, or general feedback.

codeharness - Claude Code Plugin | ClaudePluginHub

Plugin

codeharness

Name: codeharness
Author: ivintik

By iVintik

Makes autonomous coding agents produce software that actually works — real-world verification, observability, and mechanical enforcement via Claude Code hooks.

npx claudepluginhub ivintik/private-claude-marketplace --plugin codeharness

Popularity

Stars

Above avg

Med: 0·Avg: 267

Installs

Med: 0·Avg: 1

What's Inside

Slash Commands5

Harness Onboard

/harness-onboard

Scan an existing project and generate an onboarding plan to bring it to full harness compliance.

Harness Docs

/harness-docs

Generate or update project documentation — docs/ tree + README.md — using the BMAD tech-writer with codeharness post-processing.

Harness Init

/harness-init

Initialize the codeharness harness in the current project — detect stack, configure enforcement, install dependencies, set up hooks.

Harness Status

/harness-status

Show harness health, sprint progress, and verification state at a glance.

Harness Teardown

/harness-teardown

Remove all harness artifacts without touching project source code.

Agents2

doc-gardener

/doc-gardener

Scans project documentation for staleness, missing AGENTS.md files, and stale exec-plans. Use during retrospectives or on-demand to keep docs fresh. Must complete within 60 seconds (NFR23).

verifier

/verifier

Runs verification pipeline for a story — reads acceptance criteria, produces Showboat proof document with real-world evidence. Use when a story needs verification after implementation and tests pass.

Skills2

bmad-integration

/bmad-integration

Integrates codeharness with BMAD methodology — reads sprint plans, maps stories to verification tasks, enforces harness requirements in all BMAD workflows. Triggers when working with BMAD artifacts, sprint plans, or story files.

visibility-enforcement

/visibility-enforcement

Enforces that the agent queries observability tools (VictoriaLogs, VictoriaMetrics, VictoriaTraces) during development instead of guessing at runtime behavior. Triggers when the agent is debugging, investigating errors, or verifying runtime behavior.

Hooks1

Event Hooks

1 hook across 1 event

Stats

Version0.47.0

ReleasedApr 14, 2026

LanguageTypeScript

Stars1

MaintenanceExcellent

LicenseMIT

Last CommitApr 14, 2026

AddedMar 24, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge.

Available In

ivintik

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

Help us improve

Share bugs, ideas, or general feedback.

README

codeharness

Makes autonomous coding agents produce software that actually works — not software that passes tests.

codeharness is an npm CLI + Claude Code plugin that packages verification-driven development as an installable tool: black-box verification via Docker, agent-first observability via VictoriaMetrics, and mechanical enforcement via hooks that make skipping verification architecturally impossible.

What it does

Verifies features work — not just that tests pass. Black-box verification runs the built CLI inside a Docker container with no source code access. If the feature doesn't work from a user's perspective, verification fails.
Fixes what it finds — verification failures with code bugs automatically return to development with specific findings. The dev agent gets told exactly what's broken and why.
Runs sprints autonomously — reads your sprint plan, picks the highest-priority story, implements it, checks it (tests + lint), verifies it (agent evaluation), and moves to the next one. Cross-epic prioritization, retry management, and session handoff built in.
Makes agents see runtime — ephemeral VictoriaMetrics stack (logs, metrics, traces) that agents query programmatically during development. No guessing at what the code does at runtime.

Installation

Two components — install both:

# CLI (npm package)
npm install -g codeharness

# Claude Code plugin (slash commands, hooks, skills)
claude plugin install github:iVintik/codeharness

Quick Start

# Initialize in your project
codeharness init

# Start autonomous sprint execution (inside Claude Code)
/harness-run

How it works

As a CLI (`codeharness`)

The CLI handles all mechanical work — stack detection, Docker management, verification, coverage, retry state.

Command	Purpose
`codeharness init`	Detect stack, install dependencies, start observability, scaffold docs
`codeharness run`	Execute the autonomous coding loop (Ralph)
`codeharness verify --story <key>`	Run verification pipeline for a story
`codeharness status`	Show harness health, sprint progress, Docker stack
`codeharness coverage`	Run tests with coverage and evaluate against targets
`codeharness onboard epic`	Scan codebase for gaps, generate onboarding stories
`codeharness retry --status`	Show retry counts and flagged stories
`codeharness retry --reset`	Clear retry state for re-verification
`codeharness verify-env build`	Build Docker image for black-box verification
`codeharness stack start`	Start the shared observability stack
`codeharness teardown`	Remove harness from project

All commands support --json for machine-readable output.

As a Claude Code plugin (`/harness-*`)

The plugin provides slash commands that orchestrate the CLI within Claude Code sessions:

Command	Purpose
`/harness-run`	Autonomous sprint execution — picks stories by priority, runs create → implement → check → verify loop
`/harness-init`	Interactive project initialization
`/harness-status`	Quick overview of sprint progress and harness health
`/harness-onboard`	Scan project and generate onboarding plan
`/harness-verify`	Verify a story with real-world evidence

BMAD Method integration

codeharness integrates with BMAD Method for structured sprint planning:

Phase	Commands
Analysis	`/create-brief`, `/brainstorm-project`, `/market-research`
Planning	`/create-prd`, `/create-ux`
Solutioning	`/create-architecture`, `/create-epics-stories`
Implementation	`/sprint-planning`, `/create-story`, then `/harness-run`

Verification architecture

┌─────────────────────────────────────────┐
│  Claude Code Session                     │
│  /harness-run picks next story           │
│  → create-story → implement → check → verify │
└────────────────────┬────────────────────┘
                     │ verify
                     ▼
┌─────────────────────────────────────────┐
│  Docker Container (no source code)       │
│  - codeharness CLI installed from tarball│
│  - claude CLI for nested verification    │
│  - curl/jq for observability queries     │
│  Exercises CLI as a real user would      │
└────────────────────┬────────────────────┘
                     │ queries
                     ▼
┌─────────────────────────────────────────┐
│  Observability Stack (VictoriaMetrics)   │
│  - VictoriaLogs  :9428 (LogQL)          │
│  - VictoriaMetrics :8428 (PromQL)       │
│  - OTEL Collector :4318                  │
└─────────────────────────────────────────┘

When verification finds code bugs → story returns to dev with findings → dev fixes → re-verify. This loop runs up to 10 times per story. Infrastructure failures (timeouts, Docker errors) retry 3 times then skip.

Requirements

View full README on GitHub

Help us improve

Find plugins for your project

Help us improve

codeharness

Popularity

What's Inside

Help us improve

Health & Quality

Confidence

README

codeharness

What it does

Installation

Quick Start

How it works

As a CLI (`codeharness`)

As a Claude Code plugin (`/harness-*`)

BMAD Method integration

Verification architecture

Requirements

Similar Plugins

bmad

claude-harness

bkit — AI Native Development OS

bmad-automator

devpace

dh

More by iVintik

plugin-ops

codeharness

What it does

Installation

Quick Start

How it works

As a CLI (`codeharness`)

As a Claude Code plugin (`/harness-*`)

BMAD Method integration

Verification architecture

Requirements

Help us improve

Find plugins for your project

Help us improve

codeharness

Popularity

What's Inside

Help us improve

Health & Quality

Confidence

README

codeharness

What it does

Installation

Quick Start

How it works

As a CLI (codeharness)

As a Claude Code plugin (/harness-*)

BMAD Method integration

Verification architecture

Requirements

Similar Plugins

bmad

claude-harness

bkit — AI Native Development OS

bmad-automator

devpace

dh

More by iVintik

plugin-ops

codeharness

What it does

Installation

Quick Start

How it works

As a CLI (codeharness)

As a Claude Code plugin (/harness-*)

BMAD Method integration

Verification architecture

Requirements

As a CLI (`codeharness`)

As a Claude Code plugin (`/harness-*`)

As a CLI (`codeharness`)

As a Claude Code plugin (`/harness-*`)