Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By zm2231
Web browser automation for Claude Code and AI agents. Headless or headed mode, persistent auth, stealth mode, video recording.
npx claudepluginhub zm2231/browser-skillShare bugs, ideas, or general feedback.
Web browser automation for Claude Code, OpenCode, and AI agents. Uses your Chrome logins via CDP auto-detection.
Step 1: Install the CLI
npm install -g z-agent-browser
z-agent-browser install
Step 2: Install the skill
/plugin marketplace add zm2231/browser-skill
/plugin install browser-skill@browser-skill-marketplace
Paste this into a fresh Claude Code / OpenCode session:
Install and configure browser automation by following the instructions at https://raw.githubusercontent.com/zm2231/browser-skill/main/README.md
If you're an LLM Agent helping set up browser-skill, follow these steps:
npm install -g z-agent-browser
z-agent-browser install
Prompt user to run these commands (copy-paste for them):
/plugin marketplace add zm2231/browser-skill
/plugin install browser-skill@browser-skill-marketplace
Wait for user to confirm they ran the commands.
z-agent-browser open "https://example.com"
z-agent-browser snapshot -i
z-agent-browser close
Say: "Browser automation is ready! Key commands: open, snapshot -i, click @ref, fill @ref "text", close."
z-agent-browser open "https://example.com"
z-agent-browser snapshot -i # Get interactive elements with refs
z-agent-browser click @e1 # Click by ref
z-agent-browser fill @e2 "text" # Fill input
z-agent-browser close
Save login sessions to a JSON file and restore them later:
# First time: Login manually in headed mode
z-agent-browser start --headed
z-agent-browser open "https://github.com"
# [User logs in manually]
z-agent-browser state save ~/.z-agent-browser/github.json
z-agent-browser stop
# Later: Restore session headlessly
z-agent-browser start
z-agent-browser state load ~/.z-agent-browser/github.json
z-agent-browser open "https://github.com" # Already logged in!
Key points:
Connect to your actual Chrome browser with saved passwords.
Important: Chrome 136+ blocks CDP on the default profile for security. You must use
--user-data-dirpointing to a separate directory.
First-time setup (copy your Chrome profile once):
# macOS
mkdir -p ~/.z-agent-browser
cp -R "$HOME/Library/Application Support/Google/Chrome" ~/.z-agent-browser/chrome-profile
# Linux
cp -R ~/.config/google-chrome ~/.z-agent-browser/chrome-profile
Launch Chrome with CDP:
# 1. Quit all Chrome instances
pkill -9 "Google Chrome"
# 2. Launch Chrome with debugging + custom profile
"/Applications/Google Chrome.app/Contents/MacOS/Google Chrome" \
--remote-debugging-port=9222 \
--user-data-dir="$HOME/.z-agent-browser/chrome-profile" &
# 3. z-agent-browser auto-connects!
z-agent-browser open "https://github.com" # You're logged in!
When to use which:
| Use Case | Recommended Mode |
|---|---|
| Background automation | State Save/Load (headless) |
| User needs to see what's happening | start --headed |
| Sites needing saved Chrome passwords | CDP Mode |
| User needs to interact (CAPTCHA, 2FA) | CDP Mode |
| Google/Gmail | CDP Mode (detects headless) |
z-agent-browser open <url> - Navigate to pagez-agent-browser snapshot -i - Get interactive elements with refs (@e1, @e2)| Feature | Command |
|---|---|
| Navigate | open <url> |
| Get elements | snapshot -i |
| Click | click @e1 |
| Fill input | fill @e2 "text" |
| Screenshot | screenshot [path] |
| Headed mode | open <url> --headed |
| Connect CDP | connect 9222 |
| Close | close |
Daemon behavior: z-agent-browser runs as a persistent daemon. To switch modes, close first with z-agent-browser close.
When to use headed mode: Use --headed when user needs to log in manually, solve CAPTCHAs, or verify visual state.
| Feature | Usage |
|---|---|
| Stealth mode | --stealth or AGENT_BROWSER_STEALTH=1 |
| State save | state save ~/.z-agent-browser/site.json |
| State load | state load ~/.z-agent-browser/site.json |
| CDP mode | connect 9222 (for real Chrome) |
| Video recording | record start ./demo.webm / record stop |
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows.
Browser automation plugin for agent-driven web app testing, inspection, and debugging.
Anti-detect browser automation skill using Patchright (undetected Playwright fork)
AI-powered browser automation -- lets Claude control real web browsers to navigate, click, type, extract content, and automate workflows
Security auditing for vibe-coded apps. Auto-detects stack (Vercel, Supabase, Cloudflare Workers, Firebase) and runs targeted vulnerability checks.
One-shot HTML visualization for Claude Code. Turn architecture, diffs, plans, and data into styled pages you actually want to read.
Design engineering for Claude Code. Craft, memory, and enforcement for consistent UI. Extracts from live pages, local code, or Stitch projects.
Browser automation skill for Claude Code using Stagehand. Automate web interactions, extract data, and navigate websites using natural language.
Ultra-fast browser automation with 98% token reduction via batch execution and compact snapshots