Skill

visual-regression-testing

Percy/Chromatic visual regression with screenshot comparison, component visual tests, and responsive breakpoints

Install

Run in your terminal

npx claudepluginhub javimontano/jm-adk-alfa

Tool Access

This skill uses the workspace's default tool permissions.

Supporting Assets

View in Repository

agents/guardian.md

agents/lead.md

agents/specialist.md

agents/support.md

evals/evals.json

knowledge/body-of-knowledge.md

knowledge/knowledge-graph.md

prompts/meta.md

prompts/primary.md

prompts/variations/deep.md

prompts/variations/quick.md

templates/output.docx.md

templates/output.html

Skill Content

Similar Skills

skill-lookup

Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.

prompts.chat

157.5k

prompt-lookup

Searches prompts.chat for AI prompt templates by keyword or category, retrieves by ID with variable handling, and improves prompts via AI. Use for discovering or enhancing prompts.

prompts.chat

157.5k

agent-eval

Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.

everything-claude-code

139.2k

Stats

Stars1

Forks0

Last CommitMar 28, 2026

Actions

View Source View Plugin View on GitHub View README

086 — Visual Regression Testing {Testing}

Purpose

Detect unintended visual changes by comparing screenshots against approved baselines. Catch CSS regressions, layout shifts, and responsive breakpoint failures before they reach production. [EXPLICIT]

Physics — 3 Immutable Laws

Law of Pixel Truth: The rendered pixel output is the final source of truth. Code review cannot catch all visual regressions — screenshots can. [EXPLICIT]
Law of Approved Baselines: Every visual test has an approved baseline image. Changes require explicit human approval — never auto-approve. [EXPLICIT]
Law of Viewport Coverage: Test at minimum 3 breakpoints: mobile (375px), tablet (768px), desktop (1280px). Responsive bugs hide between breakpoints. [EXPLICIT]

Protocol

Phase 1 — Tool Setup

Choose platform: Chromatic (Storybook integration) or Percy (framework-agnostic). [EXPLICIT]
For Chromatic: npx chromatic --project-token=<token> in CI. [EXPLICIT]
For Percy: @percy/cli + @percy/playwright or @percy/cypress. [EXPLICIT]
Configure viewports: [375, 768, 1280] widths. [EXPLICIT]

Phase 2 — Test Authoring

Component-level: Capture every Storybook story variant (default, hover, active, disabled, error). [EXPLICIT]
Page-level: Capture critical pages at each breakpoint after data loads. [EXPLICIT]
Interaction states: Screenshot after modal open, dropdown expand, form validation. [EXPLICIT]
Set diff threshold: 0.1% pixel difference triggers review. [EXPLICIT]

Phase 3 — Review Workflow

CI posts visual diff link to PR. [EXPLICIT]
Reviewer approves or rejects each changed snapshot. [EXPLICIT]
Approved snapshots become new baselines. [EXPLICIT]
Rejected snapshots require code fix and re-run. [EXPLICIT]

I/O

Input	Output
Storybook stories / page routes	Baseline screenshot set
Code change (PR)	Visual diff report with before/after
Reviewer approval	Updated baseline images
Viewport config	Multi-breakpoint screenshots

Quality Gates — 5 Checks

All component stories have visual snapshots — new component = new snapshot. [EXPLICIT]
3 viewport widths minimum per visual test. [EXPLICIT]
Diff threshold <= 0.1% — anything above requires review. [EXPLICIT]
No auto-approved changes — human reviews every visual diff. [EXPLICIT]
CI blocks merge on unreviewed visual changes. [EXPLICIT]

Edge Cases

Animation/transition: Disable CSS animations in test env (* { animation: none !important; }).
Dynamic content: Use fixed seed data (skill 085) to eliminate data-driven diffs.
Font loading: Wait for fonts via document.fonts.ready before screenshot.
Dark mode: Capture both light and dark theme variants.

Self-Correction Triggers

Frequent false positives (>20% of diffs are noise) → increase threshold or stabilize test data.
New component merged without visual test → block in review checklist.
Baseline drift (100+ pending approvals) → dedicated baseline update session.
Screenshot flakiness → check font loading, animation disabling, data seeding.

Usage

Example invocations:

"/visual-regression-testing" — Run the full visual regression testing workflow
"visual regression testing on this project" — Apply to current context

Assumptions & Limits

Assumes access to project artifacts (code, docs, configs) [EXPLICIT]
Requires English-language output unless otherwise specified [EXPLICIT]
Does not replace domain expert judgment for final decisions [EXPLICIT]