Help us improve
Share bugs, ideas, or general feedback.
From grimoire
Conducts usability review using Nielsen's 10 heuristics to identify interface problems without user testing.
npx claudepluginhub jeffreytse/grimoire --plugin grimoireHow this skill is triggered — by the user, by Claude, or both
Slash command
/grimoire:run-heuristic-evaluationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Apply Nielsen's 10 usability heuristics as an expert inspection method to systematically identify usability problems in an interface before or without user testing.
Evaluates digital interfaces using Nielsen's 10 usability heuristics to identify issues, rate severity (0-4), and recommend fixes.
Evaluates user interfaces using Nielsen's 10 usability heuristics to identify violations like poor system status visibility, inconsistency, and error-prone designs. Reports severity, location, and fixes for UX reviews.
Evaluates interfaces against Nielsen's 10 usability heuristics, produces severity-rated findings (0-4: Cosmetic to Catastrophic), and generates remediation recommendations with effort estimates.
Share bugs, ideas, or general feedback.
Apply Nielsen's 10 usability heuristics as an expert inspection method to systematically identify usability problems in an interface before or without user testing.
Adopted by: Google, Microsoft, Apple, IBM, and virtually every UX team conducting design review, from enterprise software to consumer apps. Taught in every accredited HCI program globally. Impact: Nielsen & Molich (1990) demonstrated that 3–5 evaluators using heuristics find 75% of major usability problems; Nielsen (1993) showed the cost-to-benefit ratio of heuristic evaluation is 48:1 — for every dollar spent, $48 of downstream usability cost is avoided. NNG research shows heuristic evaluation finds approximately 35% more problems per hour than informal design review. Why best: Heuristic evaluation provides a structured vocabulary for usability problems that makes findings communicable, prioritizable, and actionable. Without heuristics, design reviews devolve into aesthetic preference debates. With them, teams can objectively categorize and severity-rate problems, enabling efficient remediation prioritization.
Sources: Nielsen, Jakob — "Usability Engineering" (1993); Nielsen, J. & Molich, R. — "Heuristic evaluation of user interfaces," CHI '90 Proceedings (1990); Nielsen Norman Group — "10 Usability Heuristics for User Interface Design" (updated 2020); ISO 9241-110:2020 — Ergonomics of human-system interaction.
Assemble 3–5 independent evaluators — Nielsen (1993) established that a single evaluator finds approximately 35% of problems; 3 evaluators find 60%; 5 evaluators find 75%. Beyond 5, marginal returns diminish sharply. Each evaluator must conduct their review independently before group discussion to avoid anchoring bias.
Define the evaluation scope — Specify which user flows will be evaluated. A complete product heuristic evaluation is impractical; focus on the highest-risk flows: primary onboarding, core task completion, error recovery, and checkout or conversion paths. Document the scope so all evaluators cover the same ground.
Brief evaluators on the target user context — Evaluators must approach the interface as the target user, not as design experts. Provide: (a) a one-paragraph user persona, (b) the task scenarios to evaluate, (c) the user's technical sophistication level. An evaluator applying expert-user mental models to a novice-user interface will find the wrong problems.
Conduct two passes through each flow — First pass: navigate freely, building a sense of the overall interaction structure. Second pass: evaluate methodically against each heuristic, noting problems as they occur. Both passes are necessary — the first pass prevents premature fixation; the second pass is where systematic documentation happens.
Evaluate against all 10 Nielsen heuristics — For each screen or interaction, check: (1) Visibility of system status; (2) Match between system and real world; (3) User control and freedom; (4) Consistency and standards; (5) Error prevention; (6) Recognition rather than recall; (7) Flexibility and efficiency of use; (8) Aesthetic and minimalist design; (9) Help users recognize, diagnose, and recover from errors; (10) Help and documentation. Document every violation found.
Record problems in a structured log — Each problem entry must include: (a) heuristic violated (number and name); (b) location (screen name, element); (c) description of the problem; (d) screenshot or annotation; (e) severity rating (0–4 scale). Use a shared spreadsheet so all evaluators log to the same format.
Rate severity using the Nielsen scale — 0: Not a usability problem; 1: Cosmetic only — fix if time allows; 2: Minor — low priority; 3: Major — important to fix; 4: Usability catastrophe — imperative to fix before release. Severity is a function of: frequency (how often it occurs), impact (how seriously it affects task completion), and persistence (whether it is a one-time or repeated problem).
Aggregate findings across all evaluators — Combine all individual logs, merge duplicate findings, and calculate the frequency with which each problem was independently identified. Problems found by multiple evaluators are more severe in practice than individual-only finds. Sort the aggregated list by severity × evaluator agreement.
Facilitate a group debrief session — Bring all evaluators together to review the aggregated list. Discuss borderline severity ratings, resolve disagreements about heuristic categorization, and identify patterns (e.g., all H6 violations occur in the settings section — suggesting a systemic problem, not isolated issues). Group discussion also generates preliminary remediation ideas.
Produce a prioritized findings report — Deliver: (a) an executive summary (top 5 issues and business impact); (b) the full findings list sorted by severity; (c) screenshot annotations for each finding; (d) one recommended fix per issue. Frame findings in terms of user behavior and impact, not designer preference. "Users will not know whether their action succeeded" (H1 violation) is more actionable than "the feedback is insufficient."
H1 violation (System status): A file upload interface shows a spinner but provides no progress indicator, no estimated time remaining, and no confirmation when the upload completes. Users cannot distinguish a slow upload from a failed one.
H5 violation (Error prevention): A form allows users to submit without completing required fields, then shows an error. A constrained design disables the submit button until all required fields pass validation, preventing the error rather than recovering from it.
H9 violation (Error recovery): An error message reads: "Error 404: Request failed." It does not explain what went wrong, why it happened, or what the user should do next. A heuristic-compliant message: "We couldn't find that page. It may have moved or been deleted. Return to home or search for what you need."