Builds and runs Gemini 2.5 Computer Use agents with Playwright to automate browser tasks via screenshot-function-action loops and safety confirmations for risky actions.
npx claudepluginhub joshuarweaver/cascade-code-general-misc-4 --plugin am-will-codex-skills-5This skill uses the workspace's default tool permissions.
1. Source the env file and set your API key:
Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.
Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.
Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.
Source the env file and set your API key:
cp env.example env.sh
$EDITOR env.sh
source env.sh
Create a virtual environment and install dependencies:
python -m venv .venv
source .venv/bin/activate
pip install google-genai playwright
playwright install chromium
Run the agent script with a prompt:
python scripts/computer_use_agent.py \
--prompt "Find the latest blog post title on example.com" \
--start-url "https://example.com" \
--turn-limit 6
COMPUTER_USE_BROWSER_CHANNEL.COMPUTER_USE_BROWSER_EXECUTABLE.If both are set, COMPUTER_USE_BROWSER_EXECUTABLE takes precedence.
function_call actions in the response.safety_decision is require_confirmation, prompt the user before executing.function_response objects containing the latest URL + screenshot.--exclude to block risky actions you do not want the model to take.scripts/computer_use_agent.pyreferences/google-computer-use.mdenv.example