From agents
Audit harness configs and apply fixes. Use when tuning Claude Code, Codex, Cursor, Gemini CLI, Antigravity, Copilot, or OpenCode. NOT for agents (agent-conventions) or MCP servers (mcp-creator).
npx claudepluginhub wyattowalsh/agents --plugin agentsThis skill uses the workspace's default tool permissions.
Audit AI harness configuration quality, then apply approved fixes in the same session.
evals/absent-surfaces.jsonevals/all-both.jsonevals/antigravity-global.jsonevals/antigravity-project-evidence-boundary.jsonevals/apply-approved.jsonevals/apply-without-review.jsonevals/cursor-project.jsonevals/degraded-docs.jsonevals/empty-invocation.jsonevals/harness-only.jsonevals/implicit-trigger.jsonevals/install-guidance.jsonevals/level-only.jsonevals/multi-harness-mixed-order.jsonevals/negative-control.jsonevals/unsupported-harness.jsonreferences/evidence-boundaries.mdreferences/harness-checklists.mdreferences/harness-surfaces.mdreferences/install-guidance.mdCreates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.
Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.
Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.
Audit AI harness configuration quality, then apply approved fixes in the same session.
NOT for: creating agents (agent-conventions), building MCP servers (mcp-creator), or generic code review without harness/config focus (honest-review).
Input: the argument string after the skill name — harness names, level selection, apply approved, install, or natural-language harness requests.
$ARGUMENTS | Mode | First move |
|---|---|---|
Empty | Intake | Ask for harnesses or all, then ask for project, global, or both |
all | Intake | Ask only for level |
all <level> / <level> all | Audit | Dry-run all 7 harnesses in deterministic order |
<harness...> | Intake | Resolve one or more harnesses, then ask only for missing level |
<level> | Intake | Resolve level, then ask only for missing harnesses |
<harness...> <level> / <level> <harness...> | Audit | Run dry-run review for the selected harnesses and level |
apply approved | Apply Approved | Apply the last matching dry-run batch only if the user already approved it in this session |
apply <harness...> <level> / apply <level> <harness...> | Apply Approved | Apply only if the same scope was already reviewed and approved in this session; otherwise rerun audit first |
install <harness...> | Install Guidance | Show exact npx skills add ... commands only; do not edit configs |
| Natural language: "review/audit/check/tune config" | Audit | Normalize harnesses + level, then run dry-run review |
| Natural language approval like "approved", "do it", "apply those changes" | Apply Approved | Continue only if the immediately preceding harness-master review matches the scope and is still current |
| Requests to create agents or MCP servers | Refuse + redirect | Redirect to the correct specialized skill |
| Unsupported harness names | Refuse + clarify | List supported harnesses and ask the user to choose from that set |
If the user invokes /harness-master with no arguments:
all.project, global, or both.claude-code, codex, cursor, gemini-cli, antigravity, github-copilot, opencodeproject, global, bothclaude, claude-code -> claude-codecodex -> codexcursor -> cursorgemini, gemini-cli -> gemini-cliantigravity, google-antigravity -> antigravitycopilot, gh-copilot, github-copilot -> github-copilotopencode, open-code -> opencodeproject, repo, local -> projectglobal, user -> globalboth, all-levels -> bothall and named harnesses, ask which form they want.all order: claude-code, codex, cursor, gemini-cli, antigravity, github-copilot, opencode$ARGUMENTS into harnesses, level, all, install intent, apply intent, and unresolved tokens.install ..., run Install Guidance only.apply ... or an approval phrase, run Apply Approved only if the matching dry-run review already exists in this session. Otherwise rerun audit first.all.project, global, or both.Gate 0 — Discover surfaces
uv run python skills/harness-master/scripts/discover_surfaces.py --repo-root . --level <level> --harness <canonical-harness> [--harness ...]
present, missing, generated, merged, repo-observed, or blind-spot.Glob, Read, and Grep.Gate 1 — Inspect project context before recommending fixes
README.md, AGENTS.md, harness-facing project files, key manifests, CI/workflow signals, and repo-native orchestration logic when present.config/tooling-policy.json and config/sync-manifest.json as canonical context for managed/generated/merged harness surfaces.Gate 2 — Refresh latest official guidance
references/latest-doc-sources.md.llms.txt first when available, then first-party docs, then canonical vendor repo docs, then web fallback only if needed.Gate 3 — Audit each selected harness independently
references/harness-surfaces.md, references/harness-checklists.md, and references/evidence-boundaries.md selectively for the selected harnesses only.both, compare project/global precedence before recommending edits.Gate 4 — Report dry-run findings
references/output-format.md.harness-master dry-run review in the same session already covered the same harness set and level.npx skills add ... guidance from references/install-guidance.md.| Scope | Strategy |
|---|---|
| 1 harness, 1 level | Inline review with one per-harness report |
2-3 harnesses or both on 1 harness | Review each harness independently, then add a short synthesis |
4-7 harnesses or all | Keep per-harness sections strict, batch discovery/research where possible, then finish with a cross-harness synthesis and ranked cleanup order |
Use the most current authoritative guidance available:
llms.txt or equivalent official indexNever claim latest without evidence from a current source.
verified-file, verified-doc, repo-observed, or blind-spot.both, report project/global conflicts before recommending changes.references/install-guidance.md only when the user asks how to install harness-master, or when missing skill availability is the actual root cause.npx skills add <source> --skill harness-master -y -g --agent <agent>
wagents install --local only if the user explicitly asks for project-local installation.After all per-harness reviews, synthesize:
Every per-harness report must include:
Then add a cross-harness synthesis section when 2+ harnesses were reviewed.
| File | Content | Read When |
|---|---|---|
references/workflow.md | Gate-by-gate audit/apply workflow, precedence rules, degraded mode, and approval gate details | Audit, Apply Approved |
references/latest-doc-sources.md | Official llms.txt and docs URLs per harness, plus fallback order | Latest-doc refresh |
references/harness-surfaces.md | Project/global surfaces, precedence, install agent names, generated/merged notes | Surface interpretation |
references/harness-checklists.md | Per-harness audit checklist and edge cases | Per-harness review |
references/evidence-boundaries.md | Evidence tags, blind-spot handling, and contradiction policy | Reporting findings |
references/install-guidance.md | Exact npx skills add ... commands, when to surface them, and anti-patterns | Install Guidance |
references/output-format.md | Per-harness and cross-harness report templates | Final output |
Read only the references needed for the active step. Do not preload all references.
Use these terms exactly throughout:
| Term | Meaning | NOT |
|---|---|---|
harness | One supported agent/runtime target | editor, toolchain |
level | project, global, or both scope for the review | environment |
project surface | Repo-local file or generated artifact used by a harness | global config |
global surface | User-level harness config outside the repo | project file |
dry-run | Findings + patch preview only; no edits | apply |
approval gate | Explicit user consent required before edits | implied approval |
blind-spot | A surface or behavior that is not observable in the current session | guess |
repo-observed | Behavior inferred from the current codebase's real harness wiring | official-doc evidence |
patch preview | Proposed diff or snippet shown before edits | applied change |
canonical source | The file or config that should be changed instead of a generated output | generated surface |
harness-master run starts in dry-run audit mode.Apply Approved is valid only after a matching dry-run review in the current session.verified-file, verified-doc, repo-observed, or blind-spot.npx skills add ... only when installation is genuinely the right next step.