Help us improve
Share bugs, ideas, or general feedback.
From harness-eval
Runs harness evaluations in quick, standard, full, or compare modes by activating corresponding skill playbooks for project assessment.
npx claudepluginhub whchoi98/harness-eval --plugin harness-evalHow this command is triggered — by the user, by Claude, or both
Slash command
/harness-eval:CLAUDEThe summary Claude sees in its command listing — used to decide when to auto-load this command
# Commands Module ## Role User-facing slash commands for evaluation. Each `.md` file in this directory is auto-discovered by Claude Code as a `/harness-eval:<name>` command. ## Key Files - `harness-eval.md` — Main entry point with `argument-hint: [quick|standard|full|compare]` - `quick.md` — Direct quick evaluation command - `standard.md` — Direct standard evaluation command - `full.md` — Direct full evaluation command - `compare.md` — Direct compare evaluation command ## Rules - Commands are auto-discovered from this directory (not registered in plugin.json) - Must include frontmatter w...
/evalManages eval-driven development workflow with subcommands to define, check, report, and list evals. Tracks capability and regression criteria with pass rates.
/phase3-meta-evalRuns meta-evaluation Python script on evaluation skills, checking recursive quality, TOC, tests, and docs; produces severity reports, pass rates, and action items.
/skills-evalEvaluates Claude Skills for quality, frontmatter compliance, token efficiency, activation reliability, and security. Generates scores, breakdowns, prioritized improvements, and roadmaps for all or specific skills.
/harness-auditAudits the repository's agent harness setup across tool coverage, context efficiency, quality gates, and more, producing an overall score out of 70, category breakdowns, findings, top 3 actions with file paths, and suggested ECC skills. Supports optional scopes and JSON format.
/evalEvaluates a plugin or skill directory for quality via static analysis and optional LLM judging, producing overall score with badge, dimension breakdowns, anti-patterns, and recommendations.
/evaluateRuns three-stage verification pipeline to evaluate execution, checking for correctness, safety, and performance in the current context.
Share bugs, ideas, or general feedback.
User-facing slash commands for evaluation. Each .md file in this directory is auto-discovered by Claude Code as a /harness-eval:<name> command.
harness-eval.md — Main entry point with argument-hint: [quick|standard|full|compare]quick.md — Direct quick evaluation commandstandard.md — Direct standard evaluation commandfull.md — Direct full evaluation commandcompare.md — Direct compare evaluation commanddescription and allowed-tools fieldsargument-hint in frontmatter makes options visible in Claude Code UIskills/<mode>/SKILL.md