Skill

visual-review

Install
1
Install the plugin
$
npx claudepluginhub dsifry/metaswarm-marketplace --plugin metaswarm

Want just this skill?

Add to a custom plugin, then install with one command.

Description

Take screenshots of web pages and UI using Playwright for visual review and iteration

Tool Access

This skill uses the workspace's default tool permissions.

Skill Content

Visual Review

Purpose

Agents cannot see rendered web pages, presentations, or UI. This skill bridges that gap by using Playwright to capture screenshots of web pages (local files, localhost servers, or deployed URLs) so agents can visually inspect their work, identify layout issues, and iterate.

This is especially useful for:

  • Reveal.js presentations — capture each slide individually
  • Web UIs — verify layout, styling, and responsiveness
  • Landing pages — check visual design matches intent
  • Email templates — verify rendering

Prerequisites

This skill requires Playwright to be available. Before first use, check and install if needed:

# Check if Playwright is available
npx playwright --version 2>/dev/null

# If not available, install it (one-time setup)
npx playwright install chromium

Important: Only the chromium browser is needed. Do not install all browsers — it wastes disk space and time.

If npx playwright fails entirely, the user may need to install it:

npm install -g playwright
npx playwright install chromium

Workflow

Phase 1: Setup

  1. Verify Playwright is available — run npx playwright --version
  2. If not installed — run npx playwright install chromium and inform the user
  3. Create output directorymkdir -p /tmp/visual-review
  4. Determine the target — local file path, localhost URL, or deployed URL

Phase 2: Capture Screenshots

For a single page:

npx playwright screenshot \
  --browser chromium \
  --viewport-size "1456,816" \
  --wait-for-timeout 3000 \
  "<URL_OR_FILE_PATH>" \
  /tmp/visual-review/page.png

Parameters:

  • --viewport-size "1456,816" — 16:9 aspect ratio, good for presentations
  • --wait-for-timeout 3000 — wait 3 seconds for fonts, images, and animations to settle
  • Use file:///absolute/path/to/file.html for local files
  • Use full URLs for deployed sites

For Reveal.js presentations (multiple slides):

Capture each slide by appending the slide hash to the URL:

# Capture all slides (adjust count as needed)
for i in $(seq 0 15); do
  npx playwright screenshot \
    --browser chromium \
    --viewport-size "1456,816" \
    --wait-for-timeout 2000 \
    "<BASE_URL>#/$i" \
    "/tmp/visual-review/slide-$i.png"
done

Tip: To capture slides with all fragments visible, add ?fragments=true or use JavaScript injection via a Playwright script. The simpler approach is to accept that fragment-heavy slides will appear in their initial (pre-click) state.

For responsive testing:

# Mobile
npx playwright screenshot --viewport-size "390,844" "<URL>" /tmp/visual-review/mobile.png

# Tablet
npx playwright screenshot --viewport-size "768,1024" "<URL>" /tmp/visual-review/tablet.png

# Desktop wide
npx playwright screenshot --viewport-size "1920,1080" "<URL>" /tmp/visual-review/desktop.png

Phase 3: Show and Review Screenshots

Use the Read tool to analyze each screenshot:

Read /tmp/visual-review/slide-0.png
Read /tmp/visual-review/slide-1.png
...

Showing screenshots to the user: The agent can see screenshots via Read, but the user cannot. When the user wants to see what you see — or when you want to discuss a visual issue together — there are two approaches depending on the environment:

Local machine (has a display):

# Open a specific screenshot
open /tmp/visual-review/slide-2.png

# Open all screenshots at once
open /tmp/visual-review/slide-*.png

Remote/headless machine (SSH, tmux, etc.):

Serve the screenshots directory over HTTP so the user can view them in their local browser:

# Start a simple file server (runs in background)
# Use a high port (>10000) to avoid conflicts with dev tools
python3 -m http.server 18080 --directory /tmp/visual-review &

# User can now browse to:
#   http://<hostname>:18080/
#   http://<hostname>:18080/slide-2.png
#   etc.

Pick any open high-numbered port (18080, 19090, etc.) — low ports like 8080 are often taken by dev tools. This serves the entire /tmp/visual-review/ directory with an auto-generated file listing. The user can click through all screenshots in their browser. Stop the server when done:

# Stop the background server
kill %1
# Or find and kill by port
lsof -ti:18080 | xargs kill

Use whichever approach fits the environment. This is useful when collaborating on visual issues, when the user asks what something looks like, or when you want confirmation on a subjective design choice.

For each screenshot, evaluate:

  • Layout — Is content centered/aligned as intended?
  • Typography — Are headings, body text, and code readable?
  • Colors — Do accent colors, backgrounds, and contrasts look right?
  • Spacing — Is there enough whitespace? Any cramped areas?
  • Content — Is all expected content visible? (Note: fragment content may be hidden)
  • Responsiveness — Does it work at the target viewport size?

Phase 4: Document Findings

Create a findings summary:

## Visual Review — [Project Name]

### Reviewed: [date]

| Slide/Page | Status | Notes |
|-----------|--------|-------|
| slide-0   | Good   | Title centered, accent color correct |
| slide-1   | Issue  | Text overflows right edge on mobile |
| ...       | ...    | ... |

### Issues to Fix
1. [Description of issue + which slide/page]
2. [...]

### Looks Good
- [Things that are working well]

Phase 5: Fix and Re-capture

After making fixes:

  1. Re-capture only the affected slides/pages
  2. Review the new screenshots
  3. Repeat until satisfied

Reveal.js-Specific Tips

  • Slide numbering: Reveal.js uses #/0 for the first slide, #/1 for the second, etc.
  • Vertical slides: Use #/horizontal/vertical format (e.g., #/2/1)
  • Fragments: Content with class="fragment" is hidden until clicked. Screenshots show the initial state. This is expected behavior, not a bug.
  • Speaker notes: Not visible in screenshots (they appear in a separate window with S key)
  • Theme loading: Use --wait-for-timeout 3000 to ensure CDN-loaded themes and fonts are ready

Common Mistakes

MistakeWhy It's WrongDo This Instead
Not waiting for page loadFonts/images may not renderUse --wait-for-timeout 3000
Using http:// for local filesFile URLs need file:/// protocolUse file:///absolute/path
Installing all Playwright browsersWastes time and diskOnly install chromium
Treating fragment-hidden content as a bugFragments are designed to reveal on clickNote it as expected
Forgetting to create output directoryScreenshots fail silentlyAlways mkdir -p /tmp/visual-review first
Taking too many screenshots at onceFloods context windowReview in batches of 4-6
Not offering to open when discussing visualsUser can't see what you're describingUse open when collaborating on visual issues

Integration Points

Upstream: Any skill that produces visual output (presentations, web UIs, email templates)

Downstream: Fixes feed back into the source files, then re-capture to verify

Verification Checklist

  • Playwright is installed and working
  • Screenshots captured at correct viewport size
  • All pages/slides reviewed
  • Issues documented with specific descriptions
  • Fixes verified with re-captured screenshots
Stats
Stars132
Forks13
Last CommitFeb 14, 2026
Actions

Similar Skills