From evaluator
Draft Verification critique: self-review + git history comparison. Run before marking any non-trivial task complete.
npx claudepluginhub artmin96/forge-studio --plugin evaluatorThis skill is limited to using the following tools:
Guides Next.js Cache Components and Partial Prerendering (PPR) with cacheComponents enabled. Implements 'use cache', cacheLife(), cacheTag(), revalidateTag(), static/dynamic optimization, and cache debugging.
Migrates code, prompts, and API calls from Claude Sonnet 4.0/4.5 or Opus 4.1 to Opus 4.5, updating model strings on Anthropic, AWS, GCP, Azure platforms.
Proposes cuts, reorganization, and simplification to improve document structure, clarity, and flow while preserving comprehension. Use for structural or editorial reviews.
Two-stage review inspired by Meta-Harness's Draft Verification pattern (arXiv 2603.28052). Stage 1 is your self-critique (the draft). Stage 2 retrieves historical evidence — both confirmers and challengers — to verify or refute the draft assessment.
Review the code you just wrote. Answer honestly:
For each file you changed, retrieve evidence from git history. This stage verifies your Stage 1 assessment against real project history.
For each modified file, run:
git log --oneline --diff-filter=M -10 -- <file>
Then for the most relevant commits, check what patterns they used:
git show <commit> -- <file>
Ask: Do your changes follow the patterns that have worked before in these files?
Search for reverted or fix-up commits touching the same files:
git log --oneline --all --grep="revert\|fix\|hotfix\|rollback" -- <file>
Search for changes that were reverted within a week:
git log --oneline --diff-filter=M --since="3 months ago" -- <file> | head -20
Ask: Have similar changes to these files caused issues before? Are you repeating a known anti-pattern?
If git history is available, also check:
git log --oneline --diff-filter=M --since="1 month" -- <file> — is this a file hotspot? High churn = high riskIf no git history is available (new repo, new files), skip Stage 2 and note it in the report.
CHALLENGE REPORT (Draft Verification)
======================================
Stage 1 — Self-Critique
Simplification: [Can/Cannot be simplified. If can: how]
Risk: [Highest risk area and why]
Weakest Part: [What and why]
Scope Match: [Yes/No. If no: what was added beyond scope]
Staff Approval: [Yes/Likely/No. If no: what needs to change]
Stage 2 — Historical Verification
Confirmers: [N past changes to same files followed similar patterns / no history]
Challengers: [N past issues found in same files / none found]
Churn Risk: [Low/Medium/High — based on recent change frequency]
Verdict: [CONFIRMED / CAUTION / REFUTED — does history support this change?]
Be ruthlessly honest. The point is to catch issues BEFORE the user does.