This skill handles browser automation tasks using **subagent architecture** to keep the main context window clean.
From kasnpx claudepluginhub kastheco/claude-plugins --plugin kasThis skill uses the workspace's default tool permissions.
Designs and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Enables AI agents to execute x402 payments with per-task budgets, spending controls, and non-custodial wallets via MCP tools. Use when agents pay for APIs, services, or other agents.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
This skill handles browser automation tasks using subagent architecture to keep the main context window clean.
NEVER call Claude-in-Chrome MCP tools directly in main context.
All browser automation is delegated to the browser-automation Task subagent which returns concise summaries.
Activate this skill when detecting:
When a browser automation task is detected:
Task("Browser: {brief_description}
**Task:** {user_request}
**URL:** {target_url if provided}
**Actions:** {list of expected actions}
Using Claude-in-Chrome MCP tools:
1. Start with tabs_context_mcp to get current tab state
2. Create new tab or use existing based on context
3. Navigate, interact, extract data as needed
4. Capture screenshots/GIFs for multi-step workflows
5. Monitor console/network if debugging
Return ONLY a concise summary:
- Actions taken (e.g., "Navigated to X, filled form, clicked submit")
- Key findings (e.g., "Form submitted successfully", "Error: XYZ")
- Screenshots/visual evidence if captured
- Any errors or unexpected behavior
Do NOT include:
- Raw HTML or page source
- Verbose console logs (use pattern filtering)
- Detailed network request payloads
- Implementation details
subagent_type: browser-automation
")
User: "Test if the login page at example.com works"
Delegation:
Task("Browser: Test login page functionality
**Task:** Verify login page at example.com loads and form is interactive
**URL:** https://example.com/login
**Actions:**
- Navigate to URL
- Screenshot the page
- Verify form elements exist (username, password, submit)
- Check console for errors
subagent_type: browser-automation
")
User: "Get all product prices from shop.example.com/products"
Delegation:
Task("Browser: Extract product prices
**Task:** Scrape product prices from shop.example.com/products
**URL:** https://shop.example.com/products
**Actions:**
- Navigate to products page
- Find all price elements
- Extract text content
- Return structured list
subagent_type: browser-automation
")
User: "Record a GIF of searching for 'laptops' on the site"
Delegation:
Task("Browser: Record search workflow
**Task:** Create GIF recording of search process
**URL:** {provided or current}
**Actions:**
- Start GIF recording
- Navigate to search
- Enter 'laptops' in search box
- Submit search
- Wait for results
- Stop recording and export GIF
subagent_type: browser-automation
")
The browser-automation agent is configured to:
After subagent returns, summarize for user:
{Brief action summary}
{Key findings or results}
{Screenshot/GIF if captured}
{Next steps or recommendations if applicable}
Keep it concise and actionable.