Plugin

harness-engineering

Name: harness-engineering
Author: toru-oizumi

Personal Harness Engineering plugin for Claude Code. Deterministic guardrails, fast Write/Edit feedback loops, Proto-first architecture, and multi-model PR review for TypeScript / Go / Protocol Buffers projects.

Component Overview

/architecture-enforcement, /flow-parallel +7

Commands

architect-reviewer

Agents

proto-stack, quality-gate +23

Skills

PreToolUse, PostToolUse +1

Hooks

MCP Servers

LSP Servers

Output Styles

Install

npx claudepluginhub toru-oizumi/claude-harness-engineering --plugin harness-engineering

Component Details

Commands (9)

architecture-enforcement

/architecture-enforcement

Run architecture enforcement checks: go-arch-lint YAML rules for Go, dependency-cruiser or @softarc/sheriff for TypeScript, depguard, and Factory.ai 4-category framework (grep-ability, glob-ability, architecture boundaries, security). Use when you want to verify layer constraints are not violated, after adding new dependencies, or before opening a PR. Do NOT use for runtime design pattern advice (use architecture-advisor for that).

flow-parallel

/flow-parallel

Implement a feature using Explorer+Workers parallel agents. Use when implementing a non-trivial feature in a single session.

harness-add-proto

/harness-add-proto

Add Proto harness support to an existing repository. Installs buf.yaml, buf.gen.yaml, review-profile.proto.yaml, and the first .proto file skeleton if none exists.

harness-check

/harness-check

Diagnose the current repository's Harness Engineering state. Reports which configs are present, which are missing, which linters are installed on PATH, and whether hooks are active.

harness-init

/harness-init

Bootstrap the current repository with Harness Engineering tooling. Detects TS/Go/Proto stack, installs config templates, CLAUDE.md, lefthook.yml, and .claude/review-profile.yaml.

harness-lint-all

/harness-lint-all

Run all harness linters and formatters across the entire repository. Use as a manual full-repo sweep when CI fails or before opening a large PR.

harness-review-pr

/harness-review-pr

Run a multi-perspective PR review using the pr-reviewer-3a skill. Supports standard, mob, deep, and multi modes.

investigate

/investigate

Systematically investigate a bug to identify root cause. Use when root cause is unclear. Hands off to bug-fix-tdd when cause is found.

quality-gate

/quality-gate

Pre-PR quality gate — stage files, review comments, run code review, then execute lint/test. Use before creating a PR or committing a logical chunk of work.

Agents (1)

architect-reviewer

/architect-reviewer

Specialized subagent for deep architecture review. Use when analyzing layer boundaries, dependency direction, abstraction levels, and design principle adherence. Model: sonnet (for careful reasoning).

Skills (25)

proto-stack

/proto-stack

Protocol Buffers harness configuration guidance. Covers buf lint, buf format, buf breaking (wire compatibility detection), protolint (fallback), field number conventions, reserved tags discipline, and Connect RPC/gRPC integration. Use when adding a new .proto service, troubleshooting buf hook failures, designing a breaking-change-safe schema evolution, or setting up buf.yaml. THIS IS THE GAP that most dev playbooks don't fill.

quality-gate

/quality-gate

Pre-PR quality gate: stage files, review code comments, run code review, then execute lint/test. Use when ready to create a PR, before committing a logical chunk of work, or before requesting team review. Do NOT use for reviewing someone else's PR (use pr-reviewer-3a).

review-comments

/review-comments

Review and update code comments in staged files — add missing doc comments, remove obvious/stale comments, and add LLM-friendly context. Use when asked to 'review comments', 'clean up comments', or 'add doc comments' before committing. Do NOT use for logic review (use pr-reviewer-3a).

ts-stack

/ts-stack

TypeScript/JavaScript harness configuration guidance. Covers Biome (format), Oxlint (lint), tsc (type check), dependency-cruiser or @softarc/sheriff (architecture boundaries), and Knip (dead code). Use when setting up TS harness, troubleshooting TS lint hook failures, choosing between ESLint and Oxlint, or enforcing layer boundaries in TS monorepos. Do NOT use for Go or Proto — see go-stack / proto-stack.

go-stack

/go-stack

Go harness configuration guidance. Covers gofumpt (format), golangci-lint v2 with depguard/cyclop/iface/exhaustive, go-arch-lint (layer boundaries), gosec (security), and custom go/analysis analyzers. Use when setting up Go harness, troubleshooting Go lint hook failures, writing .golangci.yml, or declaring architecture rules in .go-arch-lint.yml. Do NOT use for TS or Proto.

pr-reviewer-3a

/pr-reviewer-3a

Review PRs from Business, Development, and Testing perspectives with 4 modes: standard, mob (broad scan + deep-dive), deep (3-round progressive), multi (parallel personas). Use when reviewing a PR, self-reviewing before merge, or checking a diff. Do NOT use for addressing received comments (use pr-comment-handler).

pre-task-diagnostic

/pre-task-diagnostic

Broad readiness check before complex work: detect risks, ambiguities, missing context, and dependencies. Use before starting a complex feature, before delegating to a sub-agent, or when unsure if all prerequisites are met. Do NOT use for a quick blocker check.

proposal-evaluator

/proposal-evaluator

Evaluate incoming proposals, suggestions, or review comments on 5 axes and produce a principled Accept/Counter/Decline/Defer decision with explicit rationale. Use before acting on any external feedback to avoid blindly following suggestions.

agile-testing-guide

/agile-testing-guide

Guide testing strategy and patterns for the current language/framework. Use when planning how to test a feature, choosing between unit/integration/e2e tests, or reviewing test quality across Go or TypeScript projects. Do NOT use for linting test code against Detroit School principles (use detroit-school-lint) or for writing a specific test (use bug-fix-tdd).

architecture-advisor

/architecture-advisor

Advise on layered architecture patterns for the current language/framework: correct layer boundaries, dependency direction, and anti-pattern detection. Use when designing a new feature's structure, reviewing an architecture decision, or when code violates layer boundaries. Do NOT use for dependency visualization (use dependency-graph) or execution flow tracing (use code-journey).

architecture-enforcement

/architecture-enforcement

Deterministic architecture enforcement across languages: go-arch-lint YAML rules for Go, dependency-cruiser or @softarc/sheriff for TypeScript, depguard inside golangci-lint, Factory.ai's 4-category framework (grep-ability, glob-ability, architecture boundaries, security). Use when designing layering for a new service, enforcing Clean Architecture, blocking forbidden imports, or setting up custom analyzers. Do NOT use for runtime design patterns (use architecture-advisor for that).

babysit-pr

/babysit-pr

Monitor a pull request's CI/CD status, diagnose failures, and suggest fixes. Use when asked to 'babysit', 'watch', or 'monitor' a PR, or when CI checks are failing and need diagnosis. Do NOT use for code review (use pr-reviewer-3a) or comment handling (use pr-comment-handler).

bug-fix-tdd

/bug-fix-tdd

Fix bugs by writing a failing test first, then making it pass (Red-Green-Refactor). Use when QA reports a bug, a customer reports an issue, or a regression is found. Do NOT use when root cause is unclear (use bug-investigation first), not for TDD feature development.

bug-investigation

/bug-investigation

Systematically investigate bugs to identify root cause before fixing. Use when root cause is unclear, when asked to investigate a crash or unexpected behavior, or before writing a fix. Also invoked automatically by code-explorer for breakage deep-dives. Do NOT use when root cause is already known (use bug-fix-tdd to fix it).

harness-setup

/harness-setup

Bootstrap a repository with Harness Engineering tooling: detect stack (TS/Go/Proto/mixed), install config templates (biome.json, .golangci.yml, .go-arch-lint.yml, buf.yaml, lefthook.yml), copy a pointer-style CLAUDE.md, and wire up .claude/review-profile.yaml. Use this when the user says 'set up harness', 'bootstrap the repo', 'add harness engineering', or runs /harness:init. Do NOT use for modifying existing configs — that's what /harness:check is for.

implementation-sequencer

/implementation-sequencer

Determine optimal implementation order based on dependencies, risks, and feature flags. Use when planning sprint execution order, sequencing tasks with dependencies, or deciding what to build first. Do NOT use for estimating complexity (use estimation-poker or similar).

pr-comment-handler

/pr-comment-handler

Analyze, respond to, and fix PR review comments, then save learnings to memory. Use when review comments arrive on a PR, when asked to address review feedback. Do NOT use for reviewing a PR yourself (use pr-reviewer-3a), pre-PR checks (use quality-gate).

pr-review-multi-model

/pr-review-multi-model

Multi-model PR review using the 3 Amigos pattern: Claude Sonnet (architect), Gemini 2.5 Pro (bug hunter), DeepSeek (DX reviewer), orchestrated via Mastra. Use when reviewing a PR that needs multi-perspective analysis, when a single-model review has missed bugs before, or when the PR touches critical paths (auth, data layer, proto). Do NOT use Claude Code Max OAuth for these — API keys only, due to ToS. See docs/RATIONALE.md for the design decision.

careful-mode

/careful-mode

Production safety guard that blocks destructive commands for the session. Use when user says '/careful', 'production work', or 'cautious mode'. Do NOT activate for local development work. Deactivate with '/careful off'.

code-explorer

/code-explorer

Query-driven codebase exploration engine — ask any question about code and get structural answers through automatic multi-analysis. Internally composes code-journey, dependency-graph, bug-investigation, and other skills. Use when onboarding to unfamiliar code, investigating 'how does this work?', or mapping impact before changes. This is the single entry point for all code understanding tasks.

code-journey

/code-journey

Trace and visualize code execution paths step-by-step with annotations. Use when understanding how a request flows through layers, debugging execution order, or learning how data moves between components. Also invoked automatically by code-explorer for flow analysis. Do NOT use for dependency graphs (use dependency-graph), architecture overviews (use ascii-diagram).

config-protect

/config-protect

Explains the config protection policy enforced by the pre-config-protect hook. Covers which files are write-protected, why, and what to do when the agent (or you) hits a blocked edit. Use when the user asks 'why can't Claude edit biome.json?', 'how do I disable the config guard?', or when a PreToolUse block message appears. Do NOT use for adding new protected patterns — that's a hook-utils.js code change.

detroit-school-lint

/detroit-school-lint

Validate tests against Detroit School (Classical) TDD principles: real objects over mocks, state verification, sociable tests. Use when reviewing test quality, when tests look mock-heavy or brittle, or ensuring tests reflect real behavior. Do NOT use for writing tests (use bug-fix-tdd), full PR review (use pr-reviewer-3a).

edit-lint-feedback-loop

/edit-lint-feedback-loop

Explains how the PostToolUse Write/Edit lint feedback loop works end-to-end: hook routing, Rust-based linter invocation, additionalContext injection, and how to extend it with a new language. Use when debugging why a hook isn't firing, when adding support for a new language, or when understanding why Claude 'knows' about a lint violation without being told. Do NOT use for consumer-facing setup (use harness-setup).

flow-parallel

/flow-parallel

Implement a feature using Explorer+Workers parallel agents for maximum flow efficiency. Use when implementing a non-trivial feature in a single session, especially when 'use flow-parallel', 'implement in parallel', or 'Explorer+Workers' is mentioned. Do NOT use for simple single-file changes or debugging tasks.

Hooks (1)

Review workflow modifications before installing

Event Hooks

All tools

3 hooks across 3 events

README

claude-harness-engineering

Personal Harness Engineering plugin for Claude Code. Deterministic guardrails, fast Write/Edit feedback loops, Proto-first architecture, and multi-model PR review — for TypeScript, Go, and Protocol Buffers projects.

日本語版 / Japanese README

What is this?

An opinionated Claude Code plugin that applies Harness Engineering principles to everyday coding sessions:

model = CPU, context = RAM, harness = OS

The goal: make it structurally impossible for the agent to ship code that violates your architecture, silence your linters, or break proto wire compatibility — without you having to babysit it.

Philosophy

"Most of the value comes from the harness, not the model." — Mitchell Hashimoto, OpenAI harness engineering, 2026

Instead of hoping the LLM behaves, this plugin wires deterministic tools into the Claude Code session lifecycle. The agent gets fast, tight feedback loops — not vague instructions.

Features

1. Write/Edit → Lint Feedback Loop

Every file write triggers a language-aware lint + format pass. Violations are injected back as additionalContext on the next turn, so Claude self-corrects within the same flow — without losing context or requiring a second prompt.

Language	Formatter	Linter	Arch check
TypeScript / JS	Biome	Oxlint	dependency-cruiser / sheriff
Go	gofumpt + goimports	golangci-lint v2	go-arch-lint
Proto	`buf format`	`buf lint` + protolint	`buf breaking`

All formatters complete in <100ms (Biome, Oxlint, gofumpt are Rust/Go binaries — no slow Node startup).

2. Config Protection

biome.json, .golangci.yml, .go-arch-lint.yml, buf.yaml, tsconfig.json, and similar harness files are write-protected for agents. When Claude hits a lint violation, it cannot "fix" it by disabling the rule — it must fix the code.

Protected file categories:

Stack	Files
TypeScript	`biome.json`, `biome.jsonc`, `.oxlintrc.json`, `tsconfig.json`, `.eslintrc`, `eslint.config.`, `.prettierrc`, `.dependency-cruiser.*`
Go	`.golangci.yml`, `.golangci.yaml`, `.go-arch-lint.yml`, `gosec.yaml`
Proto	`buf.yaml`, `buf.gen.yaml`, `.protolint.yaml`
Cross-cutting	`lefthook.yml`, `.pre-commit-config.yaml`
Claude Code	`.claude/settings.json`, `.claude/review-profile.yaml`

Blocked edits produce a clear exit 2 message: harness-engineering: BLOCKED edit on protected config file. Only a human can change these configs.

3. Bash Firewall

Dangerous shell commands are blocked at the PreToolUse hook before execution:

git commit --no-verify — bypasses pre-commit hooks
rm -rf on protected directories
git reset --hard on protected branches (main, develop, master)
git push --force to protected branches
chmod 777, sudo rm, and other destructive patterns

The firewall exits with exit 2 and explains why the command was blocked. Claude cannot proceed without explicitly working around the constraint — which requires justification visible in the conversation.

4. Proto First-Class Support

Every .proto file write runs three checks:

buf lint — validates proto style and correctness
buf format --diff — enforces consistent formatting
buf breaking --against .git#branch=main — detects wire-breaking changes before they land

Wire-breaking changes (removing fields, renaming enum values, changing field numbers) are caught immediately — not at deploy time. The buf.gen.yaml template supports both Connect RPC and gRPC code generation.

5. Multi-perspective PR Review

The pr-reviewer-3a skill reviews PRs from three angles — Business, Development, and Testing — in a single Claude session. Four modes:

Mode	Description
`standard`	Sequential Business / Dev / Testing review
`mob`	Broad scan first, then deep-dive on flagged files
`deep`	3-round progressive review (surface → logic → integration)
`multi`	Parallel personas, each perspective runs independently

Run via /harness:review-pr or /harness:review-pr --mode mob.

Skills

25 skills ship with this plugin. Each lives in harness-engineering/skills/<name>/SKILL.md.

Harness (bootstrap + guardrails)

Skill	Description
`harness-setup`	Interactive repo bootstrap: detects stack, copies CLAUDE.md, installs configs and lefthook
`config-protect`	Write-protection policy for linter configs; what's protected and why
`edit-lint-feedback-loop`	PostToolUse → lint → additionalContext injection pipeline details
`careful-mode`	Production safety guard that blocks destructive commands for the session

Language stacks

View full README on GitHub

Similar Plugins

fullstack-dev-skills

8.4k

180

Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.

Stats

Version0.2.2

Parent Repo Stars0

MaintenanceExcellent

LicenseMIT

AddedApr 14, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

toru-harness

Safety Signals

Critical

Matches all tools

Hooks run on every tool call, not just specific ones

Caution

Uses power tools

Uses Bash, Write, or Edit tools

claude-harness-engineering

Personal Harness Engineering plugin for Claude Code. Deterministic guardrails, fast Write/Edit feedback loops, Proto-first architecture, and multi-model PR review — for TypeScript, Go, and Protocol Buffers projects.

日本語版 / Japanese README

What is this?

An opinionated Claude Code plugin that applies Harness Engineering principles to everyday coding sessions:

model = CPU, context = RAM, harness = OS

The goal: make it structurally impossible for the agent to ship code that violates your architecture, silence your linters, or break proto wire compatibility — without you having to babysit it.

Philosophy

"Most of the value comes from the harness, not the model." — Mitchell Hashimoto, OpenAI harness engineering, 2026

Instead of hoping the LLM behaves, this plugin wires deterministic tools into the Claude Code session lifecycle. The agent gets fast, tight feedback loops — not vague instructions.

Features

1. Write/Edit → Lint Feedback Loop

Language	Formatter	Linter	Arch check
TypeScript / JS	Biome	Oxlint	dependency-cruiser / sheriff
Go	gofumpt + goimports	golangci-lint v2	go-arch-lint
Proto	`buf format`	`buf lint` + protolint	`buf breaking`

All formatters complete in <100ms (Biome, Oxlint, gofumpt are Rust/Go binaries — no slow Node startup).

2. Config Protection

Protected file categories:

Stack	Files
TypeScript	`biome.json`, `biome.jsonc`, `.oxlintrc.json`, `tsconfig.json`, `.eslintrc`, `eslint.config.`, `.prettierrc`, `.dependency-cruiser.*`
Go	`.golangci.yml`, `.golangci.yaml`, `.go-arch-lint.yml`, `gosec.yaml`
Proto	`buf.yaml`, `buf.gen.yaml`, `.protolint.yaml`
Cross-cutting	`lefthook.yml`, `.pre-commit-config.yaml`
Claude Code	`.claude/settings.json`, `.claude/review-profile.yaml`

Blocked edits produce a clear exit 2 message: harness-engineering: BLOCKED edit on protected config file. Only a human can change these configs.

3. Bash Firewall

Dangerous shell commands are blocked at the PreToolUse hook before execution:

git commit --no-verify — bypasses pre-commit hooks
rm -rf on protected directories
git reset --hard on protected branches (main, develop, master)
git push --force to protected branches
chmod 777, sudo rm, and other destructive patterns

4. Proto First-Class Support

Every .proto file write runs three checks:

buf lint — validates proto style and correctness
buf format --diff — enforces consistent formatting
buf breaking --against .git#branch=main — detects wire-breaking changes before they land

5. Multi-perspective PR Review

The pr-reviewer-3a skill reviews PRs from three angles — Business, Development, and Testing — in a single Claude session. Four modes:

Mode	Description
`standard`	Sequential Business / Dev / Testing review
`mob`	Broad scan first, then deep-dive on flagged files
`deep`	3-round progressive review (surface → logic → integration)
`multi`	Parallel personas, each perspective runs independently

Run via /harness:review-pr or /harness:review-pr --mode mob.

Skills

25 skills ship with this plugin. Each lives in harness-engineering/skills/<name>/SKILL.md.

Harness (bootstrap + guardrails)

Skill	Description
`harness-setup`	Interactive repo bootstrap: detects stack, copies CLAUDE.md, installs configs and lefthook
`config-protect`	Write-protection policy for linter configs; what's protected and why
`edit-lint-feedback-loop`	PostToolUse → lint → additionalContext injection pipeline details
`careful-mode`	Production safety guard that blocks destructive commands for the session

harness-engineering

Component Overview

Install

Component Details

Commands (9)

Agents (1)

Skills (25)

Hooks (1)

README

claude-harness-engineering

What is this?

Philosophy

Features

1. Write/Edit → Lint Feedback Loop

2. Config Protection

3. Bash Firewall

4. Proto First-Class Support

5. Multi-perspective PR Review

Skills

Harness (bootstrap + guardrails)

Language stacks

Similar Plugins

fullstack-dev-skills

harness-engineering

Component Overview

Install

Component Details

Commands (9)

Agents (1)

Skills (25)

Hooks (1)

README

claude-harness-engineering

What is this?

Philosophy

Features

1. Write/Edit → Lint Feedback Loop

2. Config Protection

3. Bash Firewall

4. Proto First-Class Support

5. Multi-perspective PR Review

Skills

Harness (bootstrap + guardrails)

Language stacks

Similar Plugins

fullstack-dev-skills

team-skills-platform

episodic-memory

context7-plugin

startup-business-analyst

dotnet-skills