Help us improve
Share bugs, ideas, or general feedback.
From capture
Automates browser interactions for UI validation and testing via capture CLI: accessibility-driven clicks, typing, screenshots, a11y trees, JS execution, and HAR recording over CDP. Use for validating web app states and features.
How this skill is triggered — by the user, by Claude, or both
Slash command
/capture:captureThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Browser automation via CDP for UI validation. For the full command reference, see [cli-reference.md](cli-reference.md).
Share bugs, ideas, or general feedback.
Browser automation via CDP for UI validation. For the full command reference, see cli-reference.md.
capture session start --url <url> — open a tab and start recordingscreenshot, click, type, a11y, exec, navigatecapture session stop <id> — bundle artifacts; capture session view <id> to inspectSession context auto-fills --target and --har — no manual flag threading needed.
Target selection: If the user hasn't specified which app or browser to target, ask before starting. Don't assume Chrome, a specific port, or a running tab — run capture detect to see what's available and confirm with the user when ambiguous.
When validating a UI feature:
a11y --interactive to understand what's on the pageclick and type using accessible names (not selectors)exec to check DOM or JS state when a11y isn't sufficienthar read to verify network requests if relevant (supports --filter-url, --filter-status, --filter-method, --limit; ID optional in an active session)HAR caveat: the session HAR is populated by the commands that run — most reliably navigate. click/type capture the traffic that fires inside their settle window, but cross-navigation traffic after the frame changes is lossy. For continuous "click around" capture, run capture record --duration N in parallel.
click "Name" over exec with selectors — a11y names are stableclick reports multiple matches, add --role to disambiguatea11y --interactive before interacting to see available elementstype --into "Field Name" clicks the field first, then types — no separate click needednpx claudepluginhub crouton-labs/crouton-kit --plugin capturePlaywright browser automation: navigates URLs, captures screenshots and accessibility snapshots, interacts with UI elements (click, type, fill form), and reports findings with visual evidence.
Automates headless browser via agent-browser CLI: open/navigate sites, snapshot interactive elements for refs, click/fill forms, verify UI, scrape data, e2e test web apps.
Controls a live Chrome browser via puppeteer-core for automation, testing, and performance auditing. Use for clicking, typing, screenshots, DOM/AX tree, network interception, HAR export, Lighthouse audits, and device emulation.