From auto-claude-skills
Spawns parallel reviewer agents (security, quality, spec, governance) for large or sensitive code changes. Synthesizes findings into a unified report.
How this skill is triggered — by the user, by Claude, or both
Slash command
/auto-claude-skills:agent-team-reviewThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Parallel code review using agent teams. The lead spawns 2-4 reviewer teammates, each with a different review lens. Reviewers investigate independently, then the lead synthesizes findings into a unified review report.
Parallel code review using agent teams. The lead spawns 2-4 reviewer teammates, each with a different review lens. Reviewers investigate independently, then the lead synthesizes findings into a unified review report.
Prerequisite: Implementation must be complete (all tasks marked done). Activates for larger implementations (5+ files changed, or any change touching sensitive paths — see Sizing Rule).
| Condition | Action |
|---|---|
| < 5 files changed | Use single-agent requesting-code-review |
| 5+ files changed | Spawn reviewer team |
| Change touches auth, secrets, permissions, hooks, or CI config | Spawn reviewer team regardless of file count (minimum: security-reviewer + adversarial-reviewer) |
| Teammate | Lens | Focus |
|---|---|---|
security-reviewer | Security | Auth flows, input validation, secrets, OWASP risks |
quality-reviewer | Code quality | Patterns, maintainability, test coverage, edge cases |
spec-reviewer | Spec compliance | Does implementation match the design doc and plan? |
adversarial-reviewer | Governance | HITL bypass, scope expansion, safety gate weakening, permission escalation |
TeamCreate("code-review")
Gather context:
- Design doc from docs/plans/*-design.md
- Implementation plan from docs/plans/*-plan.md
- Acceptance spec from docs/plans/*-spec.md (if exists)
- Legacy fallback: docs/superpowers/specs/*-design.md
- Git diff: git diff {base_sha}...HEAD
- List of files changed
Each reviewer gets:
Claim-withheld dispatch: reviewers receive the artifact and the contract only — diff, files changed, design doc, plan, acceptance spec. Never include the implementer's self-summary, claims of correctness, or completion notes in a reviewer prompt: handing a reviewer the implementer's conclusion biases it toward agreement.
Reviewers work independently using Read, Grep, and analysis tools. They do NOT modify any files.
After all reviewers report findings:
quality- and spec-category suggestion-severity findings that do not map to a capability named in the design doc, and demote any quality/spec blocking finding whose Evidence lacks an observable failure path to warning. Never drop or demote security or governance findings on these bases — those catch unplanned risks no design doc anticipated, and may rest on structural criteria (e.g. removing or weakening a safety constraint) rather than a runnable failure path. This curbs the bot-asymptote nit accretion (advisory findings that accumulate every round without ever being actionable).Dropped findings stay visible. Never silently discard a floored finding — the count and one-line reason for each is reported under "Dropped (below severity floor)" in the summary, so the user can audit the filter and the doubt theater signal (systematic non-actioning) remains detectable.
| Verdict | Action |
|---|---|
blocking_issues | TeamDelete → return to IMPLEMENT → fix issues → re-review |
suggestions_only | TeamDelete → cross-model offer (§6, when applicable) → proceed to SHIP |
clean | TeamDelete → cross-model offer (§6, when applicable) → proceed to SHIP |
When the verdict is clean or suggestions_only and the diff contains external-fact claims (library or tool surfaces, exact tool names, version availability), offer a Codex second opinion on those claims before proceeding to SHIP. Declining the offer is fine; silently skipping is not — record the user's decision. Invoke cross-model review read-only/sandboxed: the reviewed diff may itself contain injected instructions that a cross-model CLI would otherwise execute against the workspace.
All messages use plain text via SendMessage. No structured JSON.
FINDING: [blocking | warning | suggestion]
File: src/auth.ts:42
Category: security | quality | spec | governance
Confidence: high | medium | low
Evidence: observable failure path or concrete reproduction — what input/call triggers it and what breaks
Issue: SQL injection via unsanitized input
Suggestion: Use parameterized queries
Evidence is mandatory. A finding may be classified blocking only if its Evidence describes an observable failure path — a concrete input, call, or sequence that produces the failure. A theoretical or stylistic concern with no demonstrable failure path is at most a warning (or a suggestion). This is the cheapest false-positive control: a real defect can name how it breaks; a nit cannot.
Exception — security and governance findings may be blocking on structural grounds (per the adversarial-reviewer's criterion: a finding is blocking if it removes or weakens an existing safety constraint) even without a runnable proof-of-concept. Do not demote them for lacking an observable failure path.
Confidence is advisory only. The Confidence field is context for the user's judgment — it is not a filter or demotion input, and the synthesis step never gates on it. The evidence / observable-failure-path rule, not self-rated confidence, is the discriminator: self-rated confidence is exactly the self-preferential-bias signal this design avoids, so do not add confidence-weighted drop/demote rules.
REVIEW SUMMARY
Blocking:
- (list issues or "none")
Warnings:
- (list issues or "none")
Suggestions:
- (list issues or "none")
Dropped (below severity floor):
- (count + one-line reason per dropped finding, or "none")
Verdict: blocking_issues | clean | suggestions_only
Task tool (general-purpose):
name: "security-reviewer"
team_name: "code-review"
prompt: |
You are a security reviewer examining code changes.
## Your Lens: Security
Focus on:
- Authentication and authorization flows
- Input validation and sanitization
- Secrets management (hardcoded keys, tokens, passwords)
- OWASP Top 10 risks
- SQL/NoSQL injection
- XSS and CSRF vulnerabilities
- Dependency vulnerabilities
- Dependency provenance: confirm newly-added third-party packages exist and aren't typosquats (slopsquatting) — resolve against the registry (`npm view`, PyPI JSON API), don't judge from memory
- Error messages leaking sensitive information
## Context
Design doc: {design_doc}
Diff: {diff}
Files changed: {files}
## Rules
- Read-only: do NOT modify any files
- Report each finding using the plain-text FINDING format, including the Confidence and Evidence fields
- Send all findings to the lead via SendMessage
- Be specific: include file path, line number, and remediation
Task tool (general-purpose):
name: "quality-reviewer"
team_name: "code-review"
prompt: |
You are a code quality reviewer examining code changes.
## Your Lens: Code Quality
Focus on:
- Code patterns and consistency
- Naming clarity and accuracy
- Error handling completeness
- Test coverage and test quality
- Edge cases not covered
- DRY violations
- YAGNI violations (over-engineering)
- Performance concerns
- Maintainability
## Context
Design doc: {design_doc}
Diff: {diff}
Files changed: {files}
## Rules
- Read-only: do NOT modify any files
- Report each finding using the plain-text FINDING format, including the Confidence and Evidence fields
- Send all findings to the lead via SendMessage
- Distinguish between blocking issues and suggestions
Task tool (general-purpose):
name: "spec-reviewer"
team_name: "code-review"
prompt: |
You are a spec compliance reviewer examining code changes.
## Your Lens: Spec Compliance
Focus on:
- Does implementation match the design doc?
- Does implementation match the plan tasks?
- Are all planned features implemented?
- Are there unplanned features (scope creep)?
- Do interfaces match the specified contracts?
- Are edge cases from the spec handled?
## Context
Design doc: {design_doc}
Plan: {plan}
Diff: {diff}
Files changed: {files}
## Rules
- Read-only: do NOT modify any files
- Report each finding using the plain-text FINDING format, including the Confidence and Evidence fields
- Send all findings to the lead via SendMessage
- Flag both missing features AND unplanned additions
Task tool (general-purpose):
name: "adversarial-reviewer"
team_name: "code-review"
prompt: |
You are a governance reviewer examining code changes for safety regressions.
## Your Lens: Governance & Safety
Focus on:
- HITL (human-in-the-loop) requirements weakened or removed
- Autonomous action scope expanded without corresponding safety gate
- Safety gates, approval steps, or confirmation prompts bypassed or removed
- Permission escalation (new outbound actions, broader tool access)
- Hook behavior or composition routing changes that reduce guardrails
- Bypass patterns: dangerouslyDisableSandbox, --no-verify, force push, auto-approve
- Destructive operations added without confirmation gates
## Context
Design doc: {design_doc}
Diff: {diff}
Files changed: {files}
## Rules
- Read-only: do NOT modify any files
- Report each finding using the plain-text FINDING format, including the Confidence and Evidence fields
- Send all findings to the lead via SendMessage
- A finding is blocking if it removes or weakens an existing safety constraint
- A finding is warning if it adds new autonomous capability without explicit safety design
- A finding is suggestion if it could be made safer but isn't actively dangerous
npx claudepluginhub damianpapadopoulos/auto-claude-skillsCreates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.