Skill

review

Independent 4-stage code review: Spec compliance → Code quality → Security → Adversarial edge cases. Spawns specialized reviewers, then filters by Succinctness/Accuracy/Actionability. Use before a major commit or as a review gate. NOT for trivial changes.

Install

npx claudepluginhub hir4ta/qult --plugin qult

Tool Access

This skill uses the workspace's default tool permissions.

Preview

Four-stage code review: independent specialized reviewers → Judge filter.

SKILL.md

Similar Skills

test-driven-development

1 file

Guides strict Test-Driven Development (TDD): write failing tests first for features, bugfixes, refactors before any production code. Enforces red-green-refactor cycle.

antigravity-bundle-qa-testing

32.8k

systematic-debugging

10 files

Guides systematic root cause investigation for bugs, test failures, unexpected behavior, performance issues, and build failures before proposing fixes.

antigravity-bundle-qa-testing

32.8k

ab-test-setup

Guides A/B test setup with mandatory gates for hypothesis validation, metrics definition, sample size calculation, and execution readiness checks.

antigravity-bundle-qa-testing

32.8k

Stats

Parent Repo Stars0

Parent Repo Forks0

Last CommitApr 5, 2026

Actions

View Source View Plugin View on GitHub View README

/qult:review

Four-stage code review: independent specialized reviewers → Judge filter.

Quality by Structure, Not by Promise. Four pairs of eyes, each seeing what the others miss. The Wall blocks completion until all stages pass.

Stage 0: Run on_review gates (runtime verification)

Before spawning reviewers, run any on_review gates defined in .qult/gates.json:

Read .qult/gates.json — if no on_review section, skip to Stage 1
For each gate in on_review, run the command via Bash with the gate's timeout value (in ms) as the Bash tool timeout. If no timeout is specified, default to 60000ms.

Collect results as a summary block:

## on_review gate results
- e2e: PASS (12.3s)
- e2e: FAIL (8.1s) — [first 500 chars of stderr]

Pass this block as context when spawning reviewers

If a gate times out or crashes, record it as ERROR and continue. Do not block the review.

Stage 0.5: Extract plan acceptance criteria

If an active plan exists in .claude/plans/, extract acceptance criteria:

Read the most recently modified .md file in .claude/plans/
For each ### Task N: block, extract the Verify line

Build a compact criteria block:

## Plan acceptance criteria
- Task 1: <name> — Verify: <test file>:<test function>
- Task 3: <name> — Verify: <test file>:<test function>

Only include tasks with a non-empty Verify field
If no plan file exists or no Verify fields are found, skip this stage entirely

Stage 0.7: Collect detector findings (ground truth)

Before spawning reviewers, collect computational detector results as ground truth:

Call mcp__plugin_qult_qult__get_detector_summary()
If the result is NOT "No detector findings.", store it as a ## Detector Findings block
This block will be included in each reviewer's prompt as context

These findings are deterministic (not LLM-generated) and serve as ground truth that reviewers must not contradict.

Stage 1: Spec Reviewer (implementation completeness)

Spawn one spec-reviewer agent.

In the agent prompt, include:

The on_review gate results from Stage 0 (if any)
The plan acceptance criteria from Stage 0.5 (if any)
The detector findings from Stage 0.7 (if any)
One-line instruction: "Verify the uncommitted changes match the plan and all consumers are updated."

The spec-reviewer evaluates Completeness and Accuracy in an independent context.

Collect output: Spec: PASS/FAIL, Score: Completeness=N Accuracy=N, findings.

Post-validation: Verify the agent output contains Spec: PASS or Spec: FAIL and Score: Completeness=N Accuracy=N. If the output does not contain a verdict line, the agent malfunctioned — re-spawn it with a clearer prompt. Do NOT fabricate scores.

If Spec: PASS, record the scores:

mcp__plugin_qult_qult__record_stage_scores({ stage: "Spec", scores: { completeness: N, accuracy: N } })

Stage 2: Quality Reviewer (design & maintainability)

Spawn one quality-reviewer agent.

In the agent prompt, include:

The on_review gate results from Stage 0 (if any)
The detector findings from Stage 0.7 (if any)
One-line instruction: "Review the uncommitted changes for design quality and maintainability issues."

The quality-reviewer evaluates Design and Maintainability in an independent context.

Collect output: Quality: PASS/FAIL, Score: Design=N Maintainability=N, findings.

Post-validation: Verify the agent output contains Quality: PASS or Quality: FAIL and Score: Design=N Maintainability=N. If the output does not contain a verdict line, the agent malfunctioned — re-spawn it with a clearer prompt. Do NOT fabricate scores.

If Quality: PASS, record the scores:

mcp__plugin_qult_qult__record_stage_scores({ stage: "Quality", scores: { design: N, maintainability: N } })

Stage 3: Security Reviewer (vulnerability & hardening)

Spawn one security-reviewer agent.

In the agent prompt, include:

The detector findings from Stage 0.7 (if any)
One-line instruction: "Review the uncommitted changes for security vulnerabilities and hardening gaps."

The security-reviewer evaluates Vulnerability and Hardening in an independent context.

Collect output: Security: PASS/FAIL, Score: Vulnerability=N Hardening=N, findings.

Post-validation: Verify the agent output contains Security: PASS or Security: FAIL and Score: Vulnerability=N Hardening=N. If the output does not contain a verdict line, the agent malfunctioned — re-spawn it with a clearer prompt. Do NOT fabricate scores. Also verify the agent did not modify any files (check git status for unexpected changes) — security reviewer is read-only.

If Security: PASS, record the scores:

mcp__plugin_qult_qult__record_stage_scores({ stage: "Security", scores: { vulnerability: N, hardening: N } })

Stage 4: Adversarial Reviewer (edge cases & logic correctness)

Spawn one adversarial-reviewer agent.

In the agent prompt, include:

The detector findings from Stage 0.7 (if any)
One-line instruction: "Find edge cases, logic errors, and silent failures in the uncommitted changes that other reviewers missed."

The adversarial-reviewer evaluates EdgeCases and LogicCorrectness in an independent context.

Collect output: Adversarial: PASS/FAIL, Score: EdgeCases=N LogicCorrectness=N, findings.

Post-validation: Verify the agent output contains Adversarial: PASS or Adversarial: FAIL and Score: EdgeCases=N LogicCorrectness=N. If the output does not contain a verdict line, the agent malfunctioned — re-spawn it with a clearer prompt. Do NOT fabricate scores. Also verify the agent did not modify any files — adversarial reviewer is read-only.

Note: Adversarial stage scores are included in the 4-stage aggregate (/40). All four stages contribute to the overall threshold check.

If Adversarial: PASS, record the scores:

mcp__plugin_qult_qult__record_stage_scores({ stage: "Adversarial", scores: { edgeCases: N, logicCorrectness: N } })

Stage 5: Judge filter

For EACH finding from ALL four reviewers, verify:

Succinctness: Clear and to the point? Not vague or rambling?
Accuracy: Technically correct in this codebase's context? Not a false positive?
Actionability: Includes a concrete fix? Not just "consider X"?

Discard findings that fail any criterion. Report only what passes all three.

Stage 6: Score aggregation & iteration

After Stage 5, aggregate all scores:

Total: Completeness + Accuracy + Design + Maintainability + Vulnerability + Hardening + EdgeCases + LogicCorrectness = N/40

The SubagentStop hook enforces score thresholds for each reviewer independently. When any reviewer's score is below threshold or verdict is FAIL, SubagentStop blocks.

When blocked:

Fix the issues identified by the failing reviewer(s)
Re-spawn ONLY the failing reviewer(s) on the updated diff
Re-apply Judge filter on new findings

Maximum 3 iterations total. After max iterations, the review proceeds regardless.

Output

Summary block showing all four stages:

## Review Summary

### Spec: PASS — Completeness=5 Accuracy=4
No issues found.

### Quality: PASS — Design=4 Maintainability=4
1 finding (0 critical, 0 high, 1 medium)

### Security: PASS — Vulnerability=5 Hardening=4
No issues found.

### Adversarial: PASS — EdgeCases=4 LogicCorrectness=5
No issues found.

### Aggregate: 34/40

Then for each passing finding from the Judge filter:

[severity] file:line — description
Fix: concrete suggestion

If all four stages pass with no findings: "Review complete. All clear."

Stage 7: Record review completion

This step is mandatory. After all stages pass and the summary is output:

Call mcp__plugin_qult_qult__record_review({ aggregate_score: <total> }) to record the review completion in session state
This enables the commit gate to allow commits. Without this call, the commit gate will block.

This is the authoritative signal that review is complete. SubagentStop hooks provide additional enforcement but are not the primary mechanism.