Help us improve
Share bugs, ideas, or general feedback.
From builder-product
Use before writing any implementation code for a new feature. North star metric, guardrail metrics, and diagnostic metrics must be defined — with baselines — before the build begins. Blocks "we'll figure out the metrics from the data" completions.
npx claudepluginhub rbraga01/a-team --plugin builder-productHow this skill is triggered — by the user, by Claude, or both
Slash command
/builder-product:metric-definitionThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
```
Provides a checklist for code reviews covering functionality, security, performance, maintainability, tests, and quality. Use for pull requests, audits, team standards, and developer training.
Share bugs, ideas, or general feedback.
A METRIC DEFINED AFTER THE BUILD DESCRIBES WHAT WAS BUILT, NOT WHETHER IT SHOULD HAVE BEEN.
"We'll figure out what to measure from the data" means you cannot distinguish success from noise, or know whether to keep, kill, or iterate the feature.
North star + guardrail + diagnostic metrics with baselines, defined before code IS a measurement plan.
Trigger before:
Every feature needs all three types. Missing any one creates a measurement blind spot.
The single metric that answers "did this feature achieve its purpose?"
Properties:
Common failure:
"We'll track engagement" — engagement is not a north star; engagement of what action, by which users, compared to what baseline?
Metrics that must NOT get worse when the north star improves.
Every feature optimisation can create a trade-off. Guardrail metrics make the trade-offs explicit and non-negotiable.
Required: at least two guardrails:
Guardrail 1: [Core product metric] must not drop by more than X%
Guardrail 2: [User trust / satisfaction metric] must not drop below Y
Example:
Feature: improve onboarding completion rate (north star) Guardrail 1: D7 retention must not drop by more than 2pp Guardrail 2: Support ticket volume must not increase by more than 10%
If a guardrail is breached, the feature is not a success — regardless of the north star movement.
Leading indicators that explain why the north star is or isn't moving.
Diagnostic metrics are not success/fail signals — they are debugging tools. When the north star underperforms, diagnostics tell you which part of the funnel broke.
Step 1 metric → Step 2 metric → Step 3 metric → North Star
[If NS is flat, which step is breaking?]
Diagnostic metrics must be available before launch. If they require new instrumentation, that instrumentation is part of the feature scope.
Metrics require data. Data requires logging. Define the logging plan before the build — not after.
For each metric, specify:
If any metric requires new tracking, add it to the feature scope. A metric without a logging plan is not a metric — it is an intention.
Write it in the form: "[Metric] increases from [baseline] to [target] within [timeframe], measured in [system]."
Instrument the baseline before the build if it doesn't exist yet.
For each: name the metric, state the acceptable tolerance, and identify the data source.
List the funnel steps between user entry and the north star metric. Name one diagnostic metric per step.
For every metric, name the event, properties, log destination, and aggregation method.
Run a query against each metric. If the data doesn't exist yet, the logging implementation is part of the feature and must be in scope.
Store at product/metrics/<feature>-<date>.md. This document is the agreement between product and data on what will be measured and how.
These thoughts mean metrics were not defined — stop:
When metric-definition is satisfied, state it like this:
Metrics defined.
File: product/metrics/<feature>-<date>.md ✓
North star: [metric] from [baseline] to [target] by [date] via [data source] ✓
Baseline verified: queried on <date> — result: <value> ✓ / instrumentation required: <scope item>
Guardrail 1: [metric] — tolerance: ≤ X% drop — data source: [system] ✓
Guardrail 2: [metric] — tolerance: ≤ Y% drop — data source: [system] ✓
Diagnostics: <N funnel-step metrics named> ✓
Logging plan: all events defined with properties, destination, aggregation ✓
Instrumentation in scope: <yes — PR: link / no — all events already logged> ✓
A metric without a verified baseline is not a metric. A logging plan that exists only in this document is not a plan.
Features measured after the fact are measured by the outcome they produced, not the outcome they were supposed to produce. Confirmation bias will find the metric that shows success. The north star defined before launch is the only anchor against that bias. Guardrails prevent trading long-term health for short-term north star movement. Diagnostics prevent "the metric isn't moving" from being a dead end.