Help us improve
Share bugs, ideas, or general feedback.
From claudecode-research-harness-workflow
Automates browser interactions via the agent-browser CLI: navigation, form filling, clicking, screenshotting, and UI state checking. Use AI snapshot workflow to interact with elements by reference.
npx claudepluginhub maxwell2732/claudecode-research-harness-workflow --plugin claudecode-research-harness-workflowHow this skill is triggered — by the user, by Claude, or both
Slash command
/claudecode-research-harness-workflow:agent-browser [url] [--headless][url] [--headless]This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
A skill for browser automation. Uses the agent-browser CLI to perform UI debugging, verification, and automated interaction.
Guides technical evaluation of code review feedback: read fully, restate for understanding, verify against codebase, respond with reasoning or pushback before implementing.
Share bugs, ideas, or general feedback.
A skill for browser automation. Uses the agent-browser CLI to perform UI debugging, verification, and automated interaction.
This skill is automatically invoked by the following phrases:
| Feature | Details |
|---|---|
| Browser Automation | See references/browser-automation.md |
| AI Snapshot Workflow | See references/ai-snapshot-workflow.md |
# Check installation
which agent-browser
# If not installed
npm install -g agent-browser
agent-browser install
| Request Type | Action |
|---|---|
| Open a URL | agent-browser open <url> |
| Click an element | Snapshot → agent-browser click @ref |
| Fill a form | Snapshot → agent-browser fill @ref "text" |
| Check state | agent-browser snapshot -i -c |
| Take screenshot | agent-browser screenshot <path> |
| Debug | agent-browser --headed open <url> |
For most operations, first take a snapshot and then interact using element references:
# 1. Open the page
agent-browser open https://example.com
# 2. Take a snapshot (AI-optimized, interactive elements only)
agent-browser snapshot -i -c
# Sample output:
# - link "Home" [ref=e1]
# - button "Login" [ref=e2]
# - input "Email" [ref=e3]
# - input "Password" [ref=e4]
# - button "Submit" [ref=e5]
# 3. Interact via element references
agent-browser click @e2 # Click Login button
agent-browser fill @e3 "user@example.com"
agent-browser fill @e4 "password123"
agent-browser click @e5 # Submit
# Check current state via snapshot
agent-browser snapshot -i -c
# Or check the URL
agent-browser get url
# Take a screenshot
agent-browser screenshot result.png
| Command | Description |
|---|---|
open <url> | Open a URL |
snapshot -i -c | AI-optimized snapshot |
click @e1 | Click an element |
fill @e1 "text" | Fill a form |
type @e1 "text" | Type text |
press Enter | Press a key |
screenshot [path] | Take a screenshot |
close | Close the browser |
| Command | Description |
|---|---|
back | Go back |
forward | Go forward |
reload | Reload |
| Command | Description |
|---|---|
get text @e1 | Get text |
get html @e1 | Get HTML |
get url | Current URL |
get title | Page title |
| Command | Description |
|---|---|
wait @e1 | Wait for element |
wait 1000 | Wait 1 second |
| Command | Description |
|---|---|
--headed | Show browser |
console | Console logs |
errors | Page errors |
highlight @e1 | Highlight element |
Manage multiple tabs/sessions in parallel:
# Specify a session
agent-browser --session admin open https://admin.example.com
agent-browser --session user open https://example.com
# List sessions
agent-browser session list
# Operate in a specific session
agent-browser --session admin snapshot -i -c
| Tool | Recommendation | Use Case |
|---|---|---|
| agent-browser | ★★★ | First choice. Powerful AI-optimized snapshots |
| chrome-devtools MCP | ★★☆ | When Chrome is already open |
| playwright MCP | ★★☆ | Complex E2E testing |
Principle: Try agent-browser first; use MCP tools only if it doesn't work.
--headed to show the browserclosed