Help us improve
Share bugs, ideas, or general feedback.
From builder-product
Use before any AI feature is approved for build. Covers four AI-specific risks that standard PRD review misses: hallucination UX, trust calibration, scope creep, and reversibility. Blocks "we validated it like any other feature" completions.
npx claudepluginhub rbraga01/a-team --plugin builder-productHow this skill is triggered — by the user, by Claude, or both
Slash command
/builder-product:ai-feature-validationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
```
Provides a checklist for code reviews covering functionality, security, performance, maintainability, tests, and quality. Use for pull requests, audits, team standards, and developer training.
Share bugs, ideas, or general feedback.
AI FEATURES HAVE FAILURE MODES THAT DETERMINISTIC FEATURES DO NOT.
"We validated it like any other feature" misses hallucination UX, trust calibration, scope creep, and irreversibility — none of which appear in a standard acceptance criterion.
All four AI-specific risks addressed in the PRD IS AI feature validation.
Trigger before:
Standard PRD review asks: does the feature solve the user problem? AI feature validation asks four additional questions that standard PRDs never reach.
What does the product look like when the model is wrong?
Every AI feature produces incorrect outputs. The question is not "will it hallucinate?" but "what does the UX do when it does?"
Required answers in the PRD:
BLOCK if:
Is the user's trust in this feature appropriately calibrated?
AI features fail in two trust directions:
Required answers in the PRD:
ai-onboarding-design from builder-design)BLOCK if:
What happens as the model gets better — does the feature's scope expand automatically?
Model improvements can silently expand what the AI can do. Without boundaries:
Required answers in the PRD:
Can the consequences of an AI-assisted action be undone?
For AI features that take actions (send, publish, modify, delete, pay):
| Action type | Reversibility requirement |
|---|---|
| Read-only (summarise, explain, draft) | No additional requirement |
| Low-stakes write (tag, categorise, label) | Audit log + bulk revert |
| High-stakes write (send, publish, post) | Explicit user confirmation before execution |
| Irreversible (delete, transfer, pay) | Human-in-the-loop checkpoint, no autonomous execution |
BLOCK if:
Write one sentence: "The worst plausible incorrect output for this feature is [specific example]." Then design the UX for that scenario.
Decide: should users trust this output (1) always with verification, (2) as a draft to review, or (3) as a suggestion to consider? Design the visual treatment to match.
One sentence: "This AI [answers / generates / predicts] [specific scope]. It does not [specific exclusion]." This goes in the empty state and the first-run experience.
List every action the AI assists with or takes. Assign a reversibility tier. Any irreversible action with no confirmation checkpoint is a BLOCK.
These four risks must appear as named sections in the PRD, with explicit answers. They are not implicit in acceptance criteria.
Store at product/ai-validation/<feature>-<date>.md.
These thoughts mean AI-specific risks were not addressed — stop:
When ai-feature-validation is satisfied, state it like this:
AI feature validation complete.
File: product/ai-validation/<feature>-<date>.md ✓
Hallucination UX:
Worst-case incorrect output: [described] ✓
Distinguishable from correct output: yes (confidence signal: [type]) ✓
Correction mechanism: [override / flag / report] ✓
High-stakes disclaimer: [yes / N/A] ✓
Trust calibration:
Target trust level: [authoritative / draft-to-review / suggestion] ✓
Design signal: [confidence score / citation / verify copy / etc.] ✓
Trust measurement plan: [metric and method] ✓
Capability boundary:
Stated: "[this AI answers/generates X, does not Y]" ✓
Communicated to users: [empty state / onboarding / always-visible label] ✓
Scope expansion control: [model version pinned / output filter / explicit gating] ✓
Reversibility:
Action audit: <N actions, each assigned a tier>
Irreversible actions: [none / listed — each has confirmation checkpoint ✓]
Audit trail: [yes / N/A — read-only feature] ✓
BLOCK items: [none / listed]
Any BLOCK item means the PRD is not approved for development. Not for negotiation, not for "we'll address it in v2."
Deterministic features fail loudly: a button that doesn't submit shows an error. AI features fail silently: an incorrect output that looks correct is acted on. The four risks above are the failure modes that standard product quality gates are not designed to catch — because they weren't needed before LLMs. Without this validation, AI features ship with hallucination UX designed by default, trust levels set by accident, capability boundaries discovered by users, and irreversible actions taken without checkpoints.