Skill

review-metrics

Track code review health metrics (cycle time, review participation, defect escape) to improve review process. Use when analyzing review effectiveness.

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/code-review-leadership:review-metrics

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Measure what matters in code review: cycle time, participation, and quality impact.

SKILL.md

60 lines · ~869 tokens

Stats

Parent stars13

Parent forks2

MaintenanceFair

Last CommitMar 11, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Review Metrics

Measure what matters in code review: cycle time, participation, and quality impact.

Context

You are helping a tech lead monitor code review health. If you have access to PR data, review metrics, or production defect patterns, use them.

Domain Context

DORA metrics (Forsgren et al., "Accelerate"): lead time for changes and deployment frequency are strong indicators of team health
GitHub data: median review time for high-performing teams is 24 hours; > 48 hours signals a bottleneck
Studies show review cycle time matters more than review depth for shipping quality

Key principles:

Lead time is the key metric: Time from PR open to merge strongly correlates with team velocity and morale
Participation diversity matters: If only 1-2 people review code, you have a bottleneck and knowledge loss
Defect escape rate measures actual quality: Review metrics (coverage, comments) don't matter if defects still reach production

Instructions

Track lead time: Measure time from PR open to merge. Target: < 24 hours. Alert if > 48 hours for > 20% of PRs
Track review participation: What % of PRs are reviewed by at least 2 people? Target: 80%+. Healthy sign of knowledge sharing
Track review cycle time: Time from PR submitted to first review comment. Target: < 4 hours. If > 8 hours, you have a bottleneck
Track reviewer diversity: How many unique reviewers reviewed PRs last month? If < 3 people reviewed 80%+ of PRs, knowledge is concentrated
Track comment count: Average comments per PR. High comments (> 15) might signal unclear standards or unproductive feedback
Track defect escape: What % of defects in production came from code that was reviewed? If > 20%, review standards need adjustment
Correlate to outcomes: When you change review standards or process, re-measure. Did lead time improve? Did defects increase?

Example dashboard:

Metric | Target | Current | Status
─────────────────────────────────
Lead Time | < 24h | 31h | ⚠ (too slow)
Median Review Cycle | < 4h | 6h | ⚠ (bottleneck)
Reviewer Diversity | 3+ | 5 | ✓
Defect Escape Rate | < 20% | 18% | ✓
PRs in Review | < 5 | 12 | ⚠ (backlog building)

Anti-Patterns

Optimizing wrong metrics: LLMs sometimes recommend tracking "comments per PR" as a quality metric. More comments don't mean higher quality. Focus on lead time and defect escape
Metrics without context: "Review time is 2 days" is meaningless without understanding why (team size, complexity, part-time reviewers). Always dig into context
Gaming metrics: If you measure "time to first review," developers will submit incomplete PRs early. Measure what you care about (quality + speed), not just one dimension
No action from metrics: Collecting metrics without improving based on them is pointless. If a metric goes red, investigate and improve

review-metrics

Popularity

Invocation

Context Preview

SKILL.md

review-metrics

Popularity

Invocation

Context Preview

SKILL.md

Review Metrics

Context

Domain Context

Instructions

Anti-Patterns

Further Reading

Similar Skills

Review Metrics

Context

Domain Context

Instructions

Anti-Patterns

Further Reading

Similar Skills