By vasuag09
Lean full-SDLC Claude Code harness — Plan, Implement, Verify, Maintain — with subagent orchestration and memory persistence.
Implementation-planning specialist. Use PROACTIVELY for non-trivial features, refactors, or anything spanning multiple files. Turns a spec into a phased, risk-assessed task plan. Read-only — produces a plan, does not edit code.
System-design specialist. Use PROACTIVELY for architectural decisions, new subsystems, interface/data-model design, or significant refactors. Produces a design note / ADR with trade-offs. Read-only — designs, does not implement.
Expert code-review specialist. MUST BE USED immediately after writing or modifying code, before merge. Reviews quality, correctness, and maintainability. Read-only — reports findings with severity, does not edit.
Security vulnerability specialist. MUST BE USED for any change touching auth, user input, DB queries, file I/O, external calls, crypto, secrets, or payments — and before merging such code. Scans for OWASP Top-10 and secret leakage. Read-only — reports, does not edit.
Test-driven development specialist. Use PROACTIVELY when implementing a feature or fixing a bug. Enforces tests-first (RED → GREEN → REFACTOR) and ≥80% coverage. May write tests and implementation.
Make and record an architecture/design decision for non-trivial work — new subsystems, interfaces, data models, or significant refactors. Use when /plan flags a load-bearing decision. Delegates to the harness-claude:architect agent.
Measure whether a harness component (a skill, agent, or rule) actually earns its keep — run the same task k times with and without it, in isolated git worktrees, and report pass@k (works once) and pass^k (works consistently). Opt-in; not wired into the default pipeline. Use to decide keep-vs-cut on evidence, or to fill the R5 benchmark for a /extract proposal.
Resolve build failures and type/compile errors fast, with minimal diffs. Use when a build or typecheck fails during implementation. Delegates to the harness-claude:build-error-resolver agent.
Checkpoint a change against its spec's acceptance criteria — run the machine-checks, adjudicate the prose ones, and gate ship until every criterion passes. Opt-in; not wired into the default pipeline. Use late in Verify, alongside /test and /verify.
Turn a repeatable workflow observed this session into a PROPOSED reusable skill — draft it, evaluate it against the rubric, and stage it for your approval. Never writes into skills/. Opt-in; not wired into the default pipeline. Use when a Stop candidate appears or a workflow felt worth keeping.
Matches all tools
Hooks run on every tool call, not just specific ones
Executes bash commands
Hook triggers when Bash tool is used
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Modifies files
Hook triggers on file write and edit operations
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools
A lean, full-SDLC Claude Code harness — personal, isolated, and testable. It runs a four-phase pipeline (Plan → Implement → Verify → Maintain) backed by scoped subagents, runtime hooks, and cross-session memory. Built to evolve toward eval loops, retrieval, long-running agents, multi-agent orchestration, and computer-use (see ROADMAP.md).
Stack focus: TypeScript/JS (React, Next, Vercel) and Python.
| Guide | Read it for |
|---|---|
| docs/USAGE.md | Install → full pipeline → reuse-an-existing-codebase → common scenarios |
| docs/SKILLS.md | Every command + agent, what each does, what it delegates to |
| docs/HOOKS.md | Exactly what runs on your machine, and how to disable any hook |
| docs/TROUBLESHOOTING.md | Hooks not firing, slow tsc, MCP, git boundary — fixes |
| CONTRIBUTING.md | Adding skills/hooks, portability rules, PR checklist |
| ROADMAP.md | Where this is going (eval loops → retrieval → multi-agent → computer-use) |
New here? Start with docs/USAGE.md.
It lives in its own repo and installs as a Claude Code plugin/marketplace, so you can:
/plugin, run a real task, judge the result.
Toggle it off to fall back. It never overwrites your existing ~/.claude.| Layer | Where | Count | Role |
|---|---|---|---|
| Rules | rules/ | 7 | always-on guidance (cited by skills; imported by CLAUDE.md) |
| Skills | skills/<name>/SKILL.md | 24 | 15 atomic /spec … /ship drivers + 5 phase orchestrators + 4 opt-in eval (/eval /extract /benchmark /health) |
| Agents | agents/*.md | 7 | scoped subagents the skills delegate to |
| Hooks | hooks/hooks.json + scripts/hooks/ | — | tmux, format, typecheck, quality/design gates, strategic compact, build analysis, memory persistence |
| MCPs | .mcp.json | 3 | memory · sequential-thinking · magic (load only when enabled) |
PLAN ─────────────► IMPLEMENT ─────► VERIFY ─────────────► MAINTAIN
/spec /research /implement /review /security-review /refactor-clean
/plan /architect /build-fix /test /verify /ship /onboard
memory: /save-session · /resume-session (cross-cutting)
Phase orchestrators run a whole phase in one command — thin wrappers that sequence the atomic skills, delegate to subagents, and pause for your input at each gate:
| Command | Runs |
|---|---|
/harness-plan | spec → research → plan → architect* |
/harness-implement | implement (TDD) → build-fix |
/harness-verify | review → security-review → test → verify → ship |
/harness-maintain | onboard* → refactor-clean |
/harness | the full pipeline, with a checkpoint between phases |
* conditional. Use the atomic skills (/plan, /review, …) when you only want one step.
| Requirement | Why | Without it |
|---|---|---|
| Claude Code ≥ 2.x | Host for the plugin | — |
| Node.js ≥ 18 | All runtime hooks are Node scripts | Hooks no-op; rest of the harness works |
| git | Phase gates, /ship, session snapshots | Most value is inside a repo |
bash + jq (optional) | Status line only | Status line degrades gracefully |
Project tools (prettier, eslint, tsc, ruff, pytest) are not required — the
hooks that use them skip silently when absent. See docs/HOOKS.md for
exactly what runs on your machine.
# 1) Add this repo as a marketplace — from GitHub, or via a local clone path
claude plugin marketplace add vasuag09/harness-claude # straight from GitHub
# …or, if you've cloned it locally:
claude plugin marketplace add /path/to/harness-claude
# 2) In Claude Code, enable the plugin
/plugin # find "harness-claude", enable it
# 3) Verify the pieces loaded
/plugin # confirm skills/agents/hooks registered
/mcp # confirm the 3 MCP servers (enable only what you need)
Then run a real task through the pipeline:
/spec describe the feature you want
/research /plan /architect
/implement
/review /security-review /test /verify /ship
Toggle the plugin off in /plugin to return to your previous setup at any time.
These can't live inside an isolated plugin, so they're shipped as configs you apply
when ready. Your real ~/.claude stays untouched until you do.
npx claudepluginhub vasuag09/harness-claude --plugin harness-claudeHarness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Efficient skill management system with progressive discovery — 410+ production-ready skills across 33+ domains