Automate Playwright E2E testing workflows: run test suites with capture of DOM snapshots, network traces, console output, and screenshots; triage failures by type; investigate root causes; apply targeted fixes to tests or page objects; generate new tests via interactive app exploration and POM patterns.
Use this agent to investigate a single failing E2E test. Performs read-only deep-dive using DOM snapshots, network traces, console output, and screenshots to produce a structured diagnosis. Never modifies code. Examples: <example>Context: A test failure needs root cause analysis. user: 'Investigate why the checkout test at tests/checkout.spec.ts:42 is failing' assistant: 'I will use the bug-investigator agent to analyze the failure evidence and produce a diagnosis.' <commentary>Deep investigation of a single failure, producing a diagnosis report for the fixer.</commentary></example><example>Context: Triage identified a new failure that needs understanding. user: 'This test started failing after the deploy, figure out why' assistant: 'I will use the bug-investigator to trace the root cause through DOM state, network, and console logs.' <commentary>Read-only investigation to understand what changed.</commentary></example>
Use this agent to apply a code fix to a failing E2E test based on an investigation diagnosis. Takes a structured diagnosis and applies the minimal change to make the test pass. Runs in a worktree for parallel safety. Examples: <example>Context: An investigator diagnosed a missing test step. user: 'Fix checkout.spec.ts:42 — needs selectShippingMethod call after line 42' assistant: 'I will use the test-fixer agent to apply the fix and verify the test passes.' <commentary>Targeted code fix based on a diagnosis.</commentary></example><example>Context: A test has a stale selector after UI changes. user: 'The login test selector needs updating from role button Submit to role button Sign In' assistant: 'I will use the test-fixer agent to update the selector and verify.' <commentary>Simple code fix with clear instructions.</commentary></example>
Use this agent to write new Playwright E2E tests from scratch by interactively exploring the application. Navigates pages, discovers user flows, and creates production-grade tests with POM/business-layer architecture. Runs in a worktree for parallel safety. Examples: <example>Context: User needs tests for a new feature. user: 'Write e2e tests for the new checkout flow' assistant: 'I will use the test-writer agent to explore the checkout flow and create comprehensive tests.' <commentary>New feature needs end-to-end test coverage from scratch.</commentary></example><example>Context: Coverage gaps identified by triage. user: 'We have no tests for the settings page' assistant: 'I will use the test-writer agent to explore the settings page and write tests for it.' <commentary>Missing test coverage for an existing feature.</commentary></example>
Use this agent to triage the full E2E test suite. Runs all tests, classifies every failure, manages project knowledge (error patterns, flows, triage history), creates tasks, and dispatches investigator/fixer agents. Examples: <example>Context: User wants a full suite health check. user: 'Triage the e2e suite' assistant: 'I will use the triage-lead agent to run all tests, classify failures, and coordinate fixes.' <commentary>Full suite triage with failure classification and parallel fixer coordination.</commentary></example><example>Context: CI is red and needs investigation. user: 'CI tests are failing, figure out what is broken' assistant: 'I will use the triage-lead agent to run the suite, identify root causes, and dispatch fixers.' <commentary>Suite-level investigation requires the triage coordinator.</commentary></example>
Systematically investigate and fix failing Playwright E2E tests using captured action data, screenshots, DOM snapshots, network requests, and console output.
Run all E2E tests, categorize failures (known issue, app bug, test update, flaky, new), cross-reference Jira, and generate a management-ready report.
Write new Playwright E2E tests following project conventions, with POM/business-layer architecture, network-aware stability patterns, and quality validation.
Admin access level
Server config contains admin-level keywords
Uses power tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Uses Bash, Write, or Edit tools
Uses Bash, Write, or Edit tools
A Claude Code plugin that debugs and fixes Playwright E2E tests autonomously. It runs your tests with full action capture — DOM snapshots, network requests, console output, screenshots — then investigates failures like a senior QA engineer and ships the fix.
https://github.com/user-attachments/assets/26f734a5-d05e-41c9-bc3f-2b58561c2ce0
# Add the marketplace
/plugin marketplace add kaizen-yutani/playwright-autopilot
# Install the plugin
/plugin install kaizen-yutani/playwright-autopilot
Then ask Claude to fix a failing test or triage your whole suite:
/playwright-autopilot:fix-e2e tests/checkout.spec.ts
/playwright-autopilot:triage-e2e e2e
Or just describe what you need — Claude will use the MCP tools automatically:
Fix all failing e2e tests in the "e2e" project
Every browser action during a test run is captured with:
When a test fails, Claude doesn't guess — it reads the actual page state, checks for failed API calls, and traces the root cause through the action timeline.
A lightweight CJS hook (captureHook.cjs) is injected via NODE_OPTIONS --require into Playwright's test worker processes. It monkey-patches BrowserContext._initialize to add an instrumentation listener that captures every browser action with full context. No modifications to Playwright's source code required — works with any Playwright installation.
The plugin exposes 37 tools via the Model Context Protocol that Claude calls on-demand. This is token-efficient by design — instead of dumping entire traces into context, Claude pulls only what it needs:
Test Execution & Debugging:
| Tool | Purpose |
|---|---|
e2e_list_projects | List Playwright projects from config |
e2e_list_tests | Discover test files and cases |
e2e_run_test | Run tests with action capture, flaky detection (retries, repeatEach) |
e2e_get_failure_report | Error + DOM + network + console summary |
e2e_get_evidence_bundle | All failure evidence in one call — ready for Jira |
e2e_generate_report | Self-contained HTML or JSON report file |
e2e_suggest_tests | Test coverage gap analysis |
e2e_get_actions | Step-by-step action timeline |
e2e_get_action_detail | Deep dive into a single action |
e2e_get_dom_snapshot | Aria tree before/after an action |
e2e_get_dom_diff | What changed in the DOM |
e2e_get_network | Network requests with filtering |
e2e_get_console | Console output with filtering |
e2e_get_screenshot | Failure screenshot as image |
e2e_get_test_source | Test file with failing line highlighted |
e2e_find_elements | Search DOM for specific elements |
e2e_scan_page_objects | Index all page objects and methods |
e2e_get_app_flows | Read stored application flows |
e2e_save_app_flow | Save a verified user journey |
e2e_get_context | Flows + page object index in one call |
e2e_discover_flows | Auto-scan specs for draft flow map |
e2e_build_flows | Auto-run uncovered tests and save their flows |
e2e_get_stats | Suite health dashboard: pass rate trends, flaky scores, category breakdowns |
e2e_save_triage_run | Save a categorized triage run for trend tracking |
e2e_get_triage_config | Read triage settings (Jira config, flaky threshold) |
Interactive Browser Exploration:
| Tool | Purpose |
|---|---|
browser_navigate | Open a URL (launches browser automatically) |
browser_navigate_back | Go back in browser history |
browser_snapshot | Capture ARIA accessibility tree with [ref=X] markers |
browser_click | Click an element by ref |
browser_type | Type into an input field, optionally submit |
browser_fill_form | Fill multiple form fields in one call |
browser_select_option | Select a dropdown option |
browser_press_key | Press a key (Enter, Escape, Tab, etc.) |
browser_hover | Hover over an element |
browser_take_screenshot | Capture a PNG screenshot |
browser_set_headers | Set custom HTTP headers (same-origin only for CORS safety) |
browser_close | Close the browser |
The browser_* tools launch a real Chrome instance and let Claude explore your application interactively — navigate pages, click elements, fill forms, and observe page state through ARIA snapshots. Each interaction returns timing, network requests, DOM changes, and an updated snapshot. Use this to understand an app before writing tests, debug UI issues visually, or verify fixes.
npx claudepluginhub kaizen-yutani/playwright-autopilot --plugin playwright-autopilotBrowser automation and E2E testing with Playwright. Auto-detects dev servers, writes clean test scripts. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use for cross-browser testing, visual regression, API testing, component testing in TypeScript/JavaScript and Python projects.
End-to-end test automation with Playwright, Cypress, and Selenium for browser-based testing
Production-grade Playwright testing toolkit. Generate tests from specs, fix flaky failures, migrate from Cypress/Selenium, sync with TestRail, run on BrowserStack. 55+ ready-to-use templates, 3 specialized agents, smart reporting that plugs into your existing workflow.
End-to-end test execution and recording for web applications
Ultra-specialized agent for E2E testing using Playwright MCP. Creates comprehensive test plans, tests all pages, verifies user flows by role, and ensures complete test coverage with visual browser testing.
Three specialized agents (planner, generator, healer) and shared skills for Playwright E2E testing with Page Object Model pattern