Search everything...

Stats

Actions

Available In

harness-kit

Name: harness-kit
Author: romabeckman

By romabeckman

Orchestrates autonomous TDD/DDD software engineering workflows with phase-isolated agents for adversarial QA, critical code review, and persistent project memory. Coordinates backend and frontend development, automated testing, documentation generation, and trace-driven skill optimization.

npx claudepluginhub romabeckman/harness-kit --plugin harness-kit

Popularity

Stars

Top 25%

Med: 0·Avg: 282

Installs

Med: 0·Avg: 1

What's Inside

Agents8

software-architect

/software-architect

Senior Software Architect specialized in DDD, system design, technical refinement, and technical decision-making. Use for architecture decisions, scope refinement, design refinement, and technical quality gates.

cto

/cto

Chief Technology Officer for the autonomous-orchestrator. Governs execution strategies, evaluates metrics, and triggers the autonomous loop.

developer-backend

/developer-backend

Senior Backend Developer specialized in TDD, API design, database modeling, security, and performance. Use for writing backend code (APIs, services, workers), fixing server bugs, implementing business logic, and backend testing.

developer-debugging

/developer-debugging

Systematic Investigation and Debugging Specialist. Uses the systematic-debugging skill and the "5 Whys" to identify the root cause of bugs before implementation.

developer-frontend

/developer-frontend

Senior Frontend Developer specialized in TDD, UI/UX implementation, accessibility, and performance. Use for writing frontend code (React, Vue, CSS, HTML), fixing UI bugs, implementing designs, and frontend testing.

Skills9

adversarial-qa

/adversarial-qa

Autonomous Adversarial QA agent. Reads machine-readable specs and code to execute edge-case and security testing, returning a JSON verdict.

autonomous-orchestrator

/autonomous-orchestrator

Sovereign loop manager. Handles file initialization, feature lifecycle tracking, and recursive TDD-Validation-Optimization cycles. Strictly delegates all technical tasks to sub-agents.

harness-evaluator

/harness-evaluator

Harness performance evaluator. Reads all execution traces in docs/harness-history/traces/, computes composite scores per skill_chain, identifies the Pareto frontier of best harness configurations, and recommends the optimal chain for the next session. Run periodically or on demand to guide harness optimization.

harness-tracer

/harness-tracer

Execution trace recorder. Captures what happened during a skill session and persists structured logs to docs/harness-history/traces/. Enables retrospective analysis and harness optimization via harness-evaluator and meta-harness.

meta-harness

/meta-harness

Autonomous Meta-Harness proposer. Reads the full harness history filesystem, diagnoses failure patterns, proposes a targeted improvement, stores the candidate, and outputs a JSON decision.

Stats

Version1.2.16

ReleasedJun 20, 2026

Stars22

Forks4

MaintenanceExcellent

LicenseMIT

Last CommitJun 20, 2026

AddedJun 20, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

harness-kit22

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

README

🔧 HarnessKit

Harness Engineering: A reliable AI agent is not just a raw model. It is defined as: $$\text{Reliable Agent} = \text{Model (AI)} + \text{Harness (Controls)} + \text{Human Auditor}$$

HarnessKit is a complete AI-assisted software engineering methodology built on Harness Engineering—the principle that true reliability comes from enclosing generative models inside structured execution scaffolds and human-driven governance loops.

👑 The Core Engine: Autonomous Orchestration & Live Auditing

At the heart of HarnessKit is the autonomous-orchestrator skill. Once provided with the initial task scope, it runs an atomic, continuous execution cycle without stopping, pausing, or asking redundant questions—fully automating domain planning, TDD execution, and multi-agent code reviews.

However, the human engineer is never replaced: your role evolves into Live Auditing.

While the orchestrator executes continuously, you act as the Human Auditor in the cockpit, tracking the live stream through your coding workspace (Claude Code, Cursor, OpenCode, Gemini, Copilot, etc.). The AI moves with sovereignty, but you maintain continuous telemetry and oversight.

⚡ Hot-Interception: Absolute Human Command

Because the engine runs seamlessly without waiting for permissions at every step, you use this live observability to dynamically intercept the loop when necessary:

Pull the Emergency Brake: Forcefully kill the execution (Ctrl+C) the moment you notice the AI has adopted an incorrect architectural premise.
Live In-Flight Injections: Hot-patch the active backlog, append newly uncovered constraints, or update domain specifications while the loop is running.
Dynamic Parameter Tweaking: Modify configuration thresholds on the fly—lower the validation score target (default 0.70) to accept a minor style debt, or increase maxReworks directly inside the configuration files.

🔍 Socratic Code Review in Action

To prevent systemic risks (such as N+1 queries, memory leaks, security vulnerabilities, or database connection exhaustion), HarnessKit employs a Socratic Code Review model. The orchestrator invokes the the-grumpy-tech-lead to validate the code inferentially by asking deep architectural questions rather than providing copy-paste solutions.

Below is a visual example of how this interactive code review occurs under your watch:

Socratic Code Review Example

🤖 Autonomous State Machine

The loop is driven by a robust Product State Machine. It tracks feature backlogs and dynamic transition gates, ensuring that a feature only progresses when quality criteria are fully satisfied.

Autonomous State Machine

Based on gate scores, the orchestrator updates the project state machine into four terminal statuses:

COMPLETED: Approved and ready for your final PR review.
RETRY: Scores fell short; the engine compiles a REWORK-LOG.md and loops back to code automatically.
BLOCKED: Critical crash/break—the engine triggers a circuit breaker and halts for immediate human intervention.
FAILED: Non-blocking tech debt—the pipeline logs the issue and moves to the next feature, leaving the debt for you to audit later.

Installation & Quick Commands

HarnessKit is distributed as a command-line plugin compatible with major AI developer ecosystems.

⚠️ IMPORTANT! This project requires the Superpowers skill. Install it before initializing HarnessKit:

/plugin install superpowers@claude-plugins-official

Claude Code

/plugin marketplace add romabeckman/harness-kit
/plugin install harness-kit@harness-kit
/harness-kit:project-memory --help

GitHub Copilot CLI

copilot plugin marketplace add romabeckman/harness-kit
copilot plugin install harness-kit@harness-kit

Gemini CLI

# Install the extension
agy plugin install https://github.com/romabeckman/harness-kit

What's Inside

To prevent role contamination, the orchestrator isolates operational contexts by dispatching highly specialized agent personas equipped with dedicated skills.

🛠️ Skills (`/skills`)

View full README on GitHub

harness-kit

Popularity

What's Inside

Confidence

README

🔧 HarnessKit

👑 The Core Engine: Autonomous Orchestration & Live Auditing

⚡ Hot-Interception: Absolute Human Command

🔍 Socratic Code Review in Action

🤖 Autonomous State Machine

Installation & Quick Commands

Claude Code

GitHub Copilot CLI

Gemini CLI

What's Inside

🛠️ Skills (/skills)

Similar Plugins

harness-engineering

harness-flow

metaswarm

crucible

claudekit

ship

🔧 HarnessKit

👑 The Core Engine: Autonomous Orchestration & Live Auditing

⚡ Hot-Interception: Absolute Human Command

🔍 Socratic Code Review in Action

🤖 Autonomous State Machine

Installation & Quick Commands

Claude Code

GitHub Copilot CLI

Gemini CLI

What's Inside

🛠️ Skills (/skills)

Popularity

Health & Quality

Similar Plugins

harness-engineering

harness-flow

metaswarm

crucible

claudekit

ship

🛠️ Skills (`/skills`)

🛠️ Skills (`/skills`)