Search everything...

Skill

gsd:eval-review

From gsd

Audits evaluation coverage of completed AI phases against AI-SPEC.md, scores dimensions as COVERED/PARTIAL/MISSING, and generates EVAL-REVIEW.md with verdict, gaps, and remediation plan.

ai-ml

testing

npx claudepluginhub jnuyens/gsd-plugin --plugin gsd

Tool Access

This skill is limited to using the following tools:

ReadWriteBashGlobGrepTaskAskUserQuestion

Preview

SKILL.md

Similar Skills

eval-audit

949

Audits LLM eval pipelines for issues like missing error analysis, unvalidated judges, and vanity metrics. Produces prioritized findings with fixes when inheriting systems or verifying trustworthiness.

evals-skills

agentv-eval-writer

Writes, edits, reviews, and validates AgentV EVAL.yaml files for agent skill evaluations. Adds test cases, configures graders, converts from evals.json or chat transcripts.

4 files

agentv-dev

start-evals

Generates 20 test cases (15 happy path + 5 edge) for AI features in spreadsheet format using PM-Friendly Evals. Launches simple eval workflow with optional Linear project.

bette-think

Stats

Stars18

Forks3

Last CommitApr 24, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

gsd:eval-review | gsd | ClaudePluginHub

Back to Skills

Skill

gsd:eval-review

From gsd

Audits evaluation coverage of completed AI phases against AI-SPEC.md, scores dimensions as COVERED/PARTIAL/MISSING, and generates EVAL-REVIEW.md with verdict, gaps, and remediation plan.

ai-ml

testing

npx claudepluginhub jnuyens/gsd-plugin --plugin gsd

Tool Access

This skill is limited to using the following tools:

ReadWriteBashGlobGrepTaskAskUserQuestion

Preview

SKILL.md

Conduct a retroactive evaluation coverage audit of a completed AI phase. Checks whether the evaluation strategy from AI-SPEC.md was implemented. Produces EVAL-REVIEW.md with score, verdict, gaps, and remediation plan.

<execution_context> @${CLAUDE_PLUGIN_ROOT}/workflows/eval-review.md @${CLAUDE_PLUGIN_ROOT}/references/ai-evals.md </execution_context>

Phase: $ARGUMENTS — optional, defaults to last completed phase. Execute @${CLAUDE_PLUGIN_ROOT}/workflows/eval-review.md end-to-end. Preserve all workflow gates.

Similar Skills

eval-audit

949

evals-skills

agentv-eval-writer

Writes, edits, reviews, and validates AgentV EVAL.yaml files for agent skill evaluations. Adds test cases, configures graders, converts from evals.json or chat transcripts.

4 files

agentv-dev

start-evals

Generates 20 test cases (15 happy path + 5 edge) for AI features in spreadsheet format using PM-Friendly Evals. Launches simple eval workflow with optional Linear project.

bette-think

Stats

Stars18

Forks3

Last CommitApr 24, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.