Skill

capture

Automates browser interactions for UI validation and testing via capture CLI: accessibility-driven clicks, typing, screenshots, a11y trees, JS execution, and HAR recording over CDP. Use for validating web app states and features.

JavaScript

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/capture:capture

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

Bash(capture:*)ReadGlobGrep

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Browser automation via CDP for UI validation. For the full command reference, see [cli-reference.md](cli-reference.md).

Supporting Files

cli-reference.md

SKILL.md

42 lines · ~597 tokens

Stats

LanguageJavaScript

Parent stars24

Parent forks3

MaintenanceExcellent

Last CommitApr 18, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Capture — Browser Automation Skill

Browser automation via CDP for UI validation. For the full command reference, see cli-reference.md.

Session Basics

capture session start --url <url> — open a tab and start recording
Interact: screenshot, click, type, a11y, exec, navigate
capture session stop <id> — bundle artifacts; capture session view <id> to inspect

Session context auto-fills --target and --har — no manual flag threading needed.

Target selection: If the user hasn't specified which app or browser to target, ask before starting. Don't assume Chrome, a specific port, or a running tab — run capture detect to see what's available and confirm with the user when ambiguous.

Validation Pattern

When validating a UI feature:

Start a session targeting the page under test
Use a11y --interactive to understand what's on the page
Interact via click and type using accessible names (not selectors)
Take screenshots at key states
Use exec to check DOM or JS state when a11y isn't sufficient
Use har read to verify network requests if relevant (supports --filter-url, --filter-status, --filter-method, --limit; ID optional in an active session)
Stop the session and report pass/fail with evidence

HAR caveat: the session HAR is populated by the commands that run — most reliably navigate. click/type capture the traffic that fires inside their settle window, but cross-navigation traffic after the frame changes is lossy. For continuous "click around" capture, run capture record --duration N in parallel.

Interaction Tips

Always prefer click "Name" over exec with selectors — a11y names are stable
If click reports multiple matches, add --role to disambiguate
Run a11y --interactive before interacting to see available elements
type --into "Field Name" clicks the field first, then types — no separate click needed
After interactions, screenshot to capture the resulting state

capture

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

Help us improve

Help us improve

Find plugins for your project

capture

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

Capture — Browser Automation Skill

Session Basics

Validation Pattern

Interaction Tips

Similar Skills

Help us improve

Capture — Browser Automation Skill

Session Basics

Validation Pattern

Interaction Tips

Similar Skills