Skill

code-review

Mandatory quality gate before delivery. Trigger on 'code review', 'review this diff', 'any issues with this', 'check for bugs', 'quality review', 'ready to merge'. ALSO trigger automatically: (1) after verification-before-completion confirms tests pass, (2) before opening a PR/shipping the change, (3) on any non-trivial change that will be committed. Do not skip — correctness bugs and security issues discovered here are cheaper to fix than after merge.

npx claudepluginhub j0hanz/claude-agent-dev-plugin

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-agent-dev:code-review [target: branch, commit, file, or "current diff"]

User invocable

Model invocable

Inline context

Default effort

Argument hint[target: branch, commit, file, or "current diff"]

Tool Access

This skill is limited to the following tools:

Bash(git *)

Disallowed Tools

These tools are removed from Claude's available pool while this skill is active:

WriteEdit

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Code review is not verification (does it work?) and not architecture (is it well-designed?). It is a focused scan for **correctness bugs, security risks, missed reuse, and API hygiene** that survived testing. Catching these in the diff is 10× cheaper than after merge.

Supporting Files

references/patterns.md

SKILL.md

232 lines · ~2.6k tokens

Similar Skills

karpathy-guidelines

168.3k

Provides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.

andrej-karpathy-skills

skill-lookup

163.4k

Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.

prompts.chat

debugging-and-error-recovery

52.4k

Guides systematic root-cause debugging when tests fail, builds break, or unexpected errors occur. Provides a structured triage checklist to preserve evidence, localize, and fix issues instead of guessing.

agent-skills

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitJun 7, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Code Review

Code review is not verification (does it work?) and not architecture (is it well-designed?). It is a focused scan for correctness bugs, security risks, missed reuse, and API hygiene that survived testing. Catching these in the diff is 10× cheaper than after merge.

Integration Check: Before proceeding, verify that associated unit tests have passed. If you are unsure, ask the user to confirm that tests passed in the current session or run them using verification-before-completion. Do not perform a review if the code hasn't been verified.

NEVER

NEVER accept "I don't see any issues" as a conclusion — state what you checked and why it passed
NEVER review code in isolation — always get a git diff; you need the before/after to spot regressions
NEVER report PASS after only skimming — work through the Phase 2 risk tiers in order
NEVER conflate advisory and blocking issues — mixing them buries the things that actually stop a merge
NEVER review without knowing what the code is supposed to do — read the PR description or ask before scanning

Scope

In scope — hunt for these:

Correctness bugs: off-by-one, type mismatches, unhandled edge cases, logic inversions, null dereferences
Security: command injection, XSS, SQL injection, hardcoded secrets, unsafe deserialization, insecure defaults
Performance regressions: N+1 queries, unbounded loops, unnecessary copies in hot paths
Missed reuse: utilities that already exist in the codebase, duplication across 3+ call sites
API hygiene: breaking changes, undocumented parameters, confusing names on public interfaces

Out of scope — route elsewhere:

Architecture problems → architecture skill
Test adequacy → verification-before-completion
Improving readability/structure → refactor skill
Performance optimization (not regression) → not a review concern

Phase 1: Get the Diff

Get the exact diff that is being reviewed. Use the argument if provided; otherwise default to comparing against the main branch.

git diff origin/main..HEAD            # full branch diff (most common)
git diff HEAD~1                       # last commit only
git diff <commit-sha>                 # specific commit
git show <commit-sha>                 # show single commit with context
git diff --stat origin/main..HEAD     # summary of what changed (run first)

Run --stat first to understand the shape of the change before reading the full diff.

If no git history is available: ask the user for the diff using this format:

File: <path/to/file>
Before:
<paste before-code block>
After:
<paste after-code block>

Note explicitly that you are reviewing without the before/after context and your confidence is lower.

Plugin-specific: For agent-dev plugin files, validate frontmatter in addition to logic — see Phase 2 section.

Phase 2: Risk-Ordered Scan

Scan in this order. Higher tiers surface issues earlier. Do not skip to lower tiers until you have cleared the ones above.

Tier 1 — Security (Stop Everything If Found)

Pattern	Check
Shell / exec call	Are arguments shell-escaped? Is shell interpolation needed at all?
New SQL / query construction	Are all values parameterized — no string concatenation?
Auth / permission check	Is the check before the action, not after?
Secrets / credentials	Are any tokens, keys, or passwords hardcoded or logged?
Deserialization (JSON.parse, pickle, yaml.load)	Is input validated before deserializing?
File path construction	Is user input sanitized against path traversal?

Any Tier 1 finding is a blocking issue — stop and flag immediately.

Tier 2 — Correctness

Pattern	Check
Empty catch / swallowed error	Is the error logged or re-raised with context?
Off-by-one in loops / slices	Check boundary conditions: `< n` vs `<= n`, `[i:]` vs `[i+1:]`
Null / undefined access	Is the value checked before use?
State mutation in unexpected places	Does the function mutate its input? Is that documented?
Async / await gaps	Are all async calls awaited? Is error handling on the awaited value?
Boolean logic	De Morgan's law: `!(A && B)` ≠ `!A && !B`; verify complex conditions

Tier 3 — Performance Regressions

Check only for regressions (new problems introduced by this diff), not pre-existing inefficiency.

Database call or I/O inside a loop that wasn't there before
Copying a large collection where a reference or generator would work
Unbounded retry / recursion with no depth limit
Missing index on a newly-queried column

Tier 4 — Missed Reuse & API Hygiene

Missed reuse: Grep the codebase for the function's purpose before declaring it new.

# Example: looking for existing date formatting utilities
git grep -n "formatDate\|format_date\|dateToString" --

API hygiene (public interface changes only):

Does the function name describe what it returns, not how it works?
Are new optional parameters truly optional (backward-compatible defaults)?
Is any previously-private function now public without documentation?

Plugin-Aware Checks (agent-dev specific)

When the diff touches agent-dev plugin files, run these additional checks:

Skills (skills/*/SKILL.md):

name: kebab-case, max 64 chars, no spaces
description: no angle brackets, max 1024 chars, ends with a triggering phrase
Body: imperative form throughout — no "you should", "you might"
disable-model-invocation: true skills must not be in another skill's skills: preload list

Agents (agents/*.md):

color: must be a named color — red blue green yellow purple orange pink cyan
type: agent present
name matches filename (minus .md)
No undocumented frontmatter fields

Hooks (hooks/hooks.json):

Event names are valid: SessionStart, PreToolUse, PostToolUse, PostToolUseFailure, UserPromptSubmit, Stop
Commands use ${CLAUDE_PLUGIN_ROOT} not hardcoded paths
Matcherless groups have a _comment key documenting intent

Commands (commands/*.md):

Frontmatter only contains documented fields: description, argument-hint, allowed-tools
No name: field (name is derived from filename)

Phase 3: Required Output

Always conclude with this exact structure. Do not omit it even when there are no findings.

## Code Review Result

**Status**: PASS ✓ | FAIL ✗ ([N] blocking)

### Blocking Issues

<!-- One per bullet. Empty if none. -->

- [file:line] [Issue type] — [what the problem is] → [what to fix]

### Advisory Issues

<!-- One per bullet. Empty if none. -->

- [file:line] [Issue type] — [observation] → [suggested improvement]

### What Was Checked

<!-- Provide a concise summary of checks performed per tier. -->

- Tier 1 (Security): [summary]
- Tier 2 (Correctness): [summary]
- Tier 3 (Performance): [summary]
- Tier 4 (Reuse/API): [summary]
- Plugin checks: [if applicable]

Status definitions:

PASS — zero blocking issues; advisory issues may exist but do not stop delivery
FAIL — one or more blocking issues; delivery must not proceed until all are resolved

Blocking vs. Advisory

Category	Blocking	Advisory
Security vulnerabilities	Always	—
Data loss / corruption bug	Always	—
Crash in a code path reachable by input	Always	—
Breaking API change without migration	Always	—
Swallowed error hiding a real failure	Usually	If debug-only path
Missed reuse of an existing utility	Rarely	Normally
Confusing name on internal helper	Never	Yes
Performance concern (non-regression)	Never	Yes

When in doubt, lean advisory — the goal is to unblock delivery, not to gatekeep.

Transition

After Phase 3:

PASS → proceed to open a pull request or merge the changes (zero blocking issues required)
FAIL → route back to implementation; re-run this skill only after the developer confirms all blocking issues are resolved — do not re-review the same unchanged code, and do not downgrade blocking findings to advisory to unblock delivery

Do not merge or submit the pull request until code review returns PASS with zero blocking issues.

Reference Files (Conditional Loading)

Do NOT load these by default. Load only when needed:

Extended pattern catalog with code examples: MANDATORY — READ ENTIRE FILE: references/patterns.md when a finding needs a precise name, canonical anti-pattern description, or example fix.

code-review

Invocation

Tool Access

Disallowed Tools

Context Preview

Supporting Files

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

code-review

Invocation

Tool Access

Disallowed Tools

Context Preview

Supporting Files

SKILL.md

Code Review

NEVER

Scope

Phase 1: Get the Diff

Phase 2: Risk-Ordered Scan

Tier 1 — Security (Stop Everything If Found)

Tier 2 — Correctness

Tier 3 — Performance Regressions

Tier 4 — Missed Reuse & API Hygiene

Plugin-Aware Checks (agent-dev specific)

Phase 3: Required Output

Blocking vs. Advisory

Transition

Reference Files (Conditional Loading)

Similar Skills

Help us improve

Code Review

NEVER

Scope

Phase 1: Get the Diff

Phase 2: Risk-Ordered Scan

Tier 1 — Security (Stop Everything If Found)

Tier 2 — Correctness

Tier 3 — Performance Regressions

Tier 4 — Missed Reuse & API Hygiene

Plugin-Aware Checks (agent-dev specific)

Phase 3: Required Output

Blocking vs. Advisory

Transition

Reference Files (Conditional Loading)