Skill

test-browser

Standalone browser E2E testing skill. Runs agent-browser tests against changed routes from a PR, branch, or current HEAD — without spawning an agent team. Use when the user says "test browser", "run browser tests", "E2E test my changes", "test routes", "browser test PR", "test-browser", or invokes /rune:test-browser. Scope input (optional first argument): - PR number (e.g., "42") → fetches changed files from gh pr view - Branch name (e.g., "feature/login") → diffs against default branch - Empty → diffs current HEAD against default branch Flags: --headed Show browser window (requires display server) --max-routes Cap route count (default 5) Keywords: browser test, E2E test, agent-browser, standalone test, route test, PR test, frontend test, UI verify, screenshot, browser automation. <example> user: "/rune:test-browser 42" assistant: "Fetching changed files from PR #42, mapping to routes, running browser tests..." </example> <example> user: "/rune:test-browser feature/auth --headed" assistant: "Diffing feature/auth vs main, running headed browser tests on changed routes..." </example> <example> user: "/rune:test-browser --max-routes 3" assistant: "Testing current branch changes (capped at 3 routes)..." </example>

From rune

Install

Run in your terminal

npx claudepluginhub vinhnxv/rune --plugin rune

Tool Access

This skill uses the workspace's default tool permissions.

Supporting Assets

View in Repository

CREATION-LOG.md

references/failure-handling.md

references/human-gates.md

references/test-loop-and-report.md

Skill Content

/rune:test-browser — Standalone Browser E2E Testing

Runs browser E2E tests against changed routes without spawning an agent team. Designed for interactive failure handling during development — no team coordination overhead.

Load skills: agent-browser, testing, zsh-compat

# ISOLATION CONTRACT
# This skill MUST NOT call TeamCreate, Agent, or Task.
# All execution is inline — no agent teams, no background workers.
# Rationale: standalone mode enables interactive failure recovery and simpler state management.

References:

scope-detection.md — resolveTestScope() algorithm
test-discovery.md — diff-scoped route discovery
service-startup.md — verifyServerWithSnapshot()
file-route-mapping.md — framework-specific route mapping
human-gates.md — OAuth/payment/2FA gate handling
failure-handling.md — interactive failure recovery

Argument Parsing

Parse $ARGUMENTS before starting:

args = $ARGUMENTS.trim().split(/\s+/)

// Flags
headed = args.includes("--headed")

// --max-routes N
maxRoutesIdx = args.indexOf("--max-routes")
maxRoutes = maxRoutesIdx >= 0 ? parseInt(args[maxRoutesIdx + 1], 10) || 5 : 5

// Scope: first positional arg (not a flag, not the value after --max-routes)
flagsWithValues = ["--max-routes"]
scopeInput = args.filter((a, i) => {
  if (a.startsWith("--")) return false                    // skip flags
  if (i > 0 && flagsWithValues.includes(args[i - 1])) return false  // skip flag values
  return true
})[0] ?? ""

// Validate maxRoutes (injection prevention — must be a positive integer)
if isNaN(maxRoutes) OR maxRoutes < 1 OR maxRoutes > 50:
  maxRoutes = 5

Workflow Overview

Step	Name	Description
0	Installation Guard	Check agent-browser is available
1	Scope Detection	Resolve changed files via resolveTestScope()
2	Route Discovery	Map changed files to testable URLs
3	Mode Selection	Headed vs headless determination
4	Server Verification	Confirm dev server is up via verifyServerWithSnapshot
5	Test Loop	Per-route: navigate → snapshot → interact → verify → screenshot
6	Human Gates	Pause for OAuth/payment/2FA flows (standalone only)
7	Failure Handling	Offer Fix / Todo / Skip for each failure
8	Summary Report	Markdown table with per-route results

Step 0: Installation Guard

result = Bash(`agent-browser --version 2>/dev/null`)
if result is empty OR exit code != 0:
  STOP with message:
    "agent-browser is not installed or not in PATH.
    Install with: npm install -g @vercel/agent-browser
    (Requires explicit user consent for global installation)
    /rune:test-browser cannot proceed without agent-browser."

Do NOT auto-install. User consent for global tool installations is mandatory.

Step 1: Scope Detection

scope = resolveTestScope(scopeInput)
// Returns: { files: string[], source: "pr" | "branch" | "current", label: string }
// See references/scope-detection.md for full algorithm.

log INFO: "Test scope: ${scope.label} (${scope.files.length} changed files)"

if scope.files.length == 0:
  WARN: "No diff scope detected — will attempt full-repo E2E discovery (may be slow)"

Step 2: Route Discovery

// Map changed files to testable routes using E2E Route Discovery algorithm
// See testing/references/test-discovery.md — "E2E Route Discovery" section
// See testing/references/file-route-mapping.md — framework-specific patterns

routes = discoverE2ERoutes(scope.files)

// Cap at maxRoutes (from argument or talisman)
// testingConfig hoisted — single readTalismanSection call for entire skill
testingConfig = readTalismanSection("testing")
talismanMax = testingConfig?.testing?.tiers?.e2e?.max_routes ?? 5
effectiveMax = Math.min(maxRoutes, talismanMax)
routes = routes.slice(0, effectiveMax)

if routes.length == 0:
  log WARN: "No testable routes found for changed files."
  log WARN: "Changed files: ${scope.files.join(', ')}"
  STOP with message: "No routes to test. Verify that changed files map to frontend views."

log INFO: "Routes to test (${routes.length}): ${routes.join(', ')}"

Step 3: Headed/Headless Mode

// Priority: --headed flag > talisman config > headless default
// NOTE: DISPLAY env var intentionally not used — prevents unintended headed mode in CI

if headed:
  mode = "headed"
  modeFlag = "--headed"
else:
  talismanHeaded = testingConfig?.testing?.browser?.headed ?? false

  if talismanHeaded:
    mode = "headed"
    modeFlag = "--headed"
  else:
    mode = "headless"
    modeFlag = ""

log INFO: "Browser mode: ${mode}"

// Warn if headed on a remote/CI machine
if mode == "headed":
  WARN: "--headed mode requires a display server. Do not use on shared/remote machines."

Step 4: Server Verification

// Verify the dev server is responding before running tests
// verifyServerWithSnapshot is defined in testing/references/service-startup.md

baseUrl = testingConfig?.testing?.tiers?.e2e?.base_url ?? "http://localhost:3000"
sessionName = `test-browser-${Date.now()}`

verifyResult = verifyServerWithSnapshot(baseUrl, sessionName)
// verifyServerWithSnapshot opens a page, takes a snapshot, and checks for blank/error states.

if verifyResult != "ok":
  // Standalone mode: abort with instructions
  // verifyServerWithSnapshot returns "ok" | "blank" | "error" | "loading" (plain string)
  instructionMap = {
    "error": "The page returned a server error.",
    "blank": "The page appears blank or empty.",
    "loading": "The page did not finish loading."
  }
  STOP with message:
    "Dev server not responding at ${baseUrl}.
    ${instructionMap[verifyResult] ?? 'Unknown verification issue.'}
    Start your server and re-run /rune:test-browser."

Steps 5-8: Test Loop, Human Gates, Failure Handling, Report

Per-route test loop: navigate → wait networkidle → snapshot → human gate check (human-gates.md) → assertions (console errors, blank/error patterns, snapshot length) → screenshot → failure handling (failure-handling.md, interactive Fix/Todo/Skip). Then generates a markdown summary report with per-route status table, pass/fail/partial/error stats, human gate summary, and screenshot paths.

See test-loop-and-report.md for the full Step 5 test loop and Step 8 report template pseudocode.

Version Guards for agent-browser Features

Before using version-gated features, check availability:

agentBrowserVersion = Bash(`agent-browser --version 2>/dev/null`).trim()
// e.g., "0.15.2" or "agent-browser/0.13.0"
// Parse: extract semver from output

parseVersion(vStr) → { major, minor, patch }
  match = vStr.match(/(\d+)\.(\d+)\.(\d+)/)
  if not match: return { major: 0, minor: 0, patch: 0 }
  return { major: int(match[1]), minor: int(match[2]), patch: int(match[3]) }

v = parseVersion(agentBrowserVersion)

// Feature gates
canAnnotate    = v.minor >= 12  // --annotate flag (v0.12.0+)
canDiff        = v.minor >= 13  // diff subcommand (v0.13.0+)
hasDomainAllow = v.minor >= 15  // AGENT_BROWSER_ALLOWED_DOMAINS (v0.15.0+)
hasAuthVault   = v.minor >= 15  // agent-browser auth save/login (v0.15.0+)
hasContentBounds = v.minor >= 15  // AGENT_BROWSER_CONTENT_BOUNDARIES (v0.15.0+)

Use canAnnotate, canDiff, etc. before calling version-gated commands. Always fail gracefully (skip the feature, not the entire test run) when gated.

Key Differences from arc Phase 7.7

Aspect	/rune:test-browser	arc Phase 7.7 TEST
Agent teams	None — inline execution	Yes — 4 testing agents
Scope input	PR#, branch, or current	Arc plan scope
Failure handling	Interactive (FIX/TODO/SKIP)	Analyst agent
Human gates	AskUserQuestion pause	Auto-skip (PARTIAL)
Server startup	Must already be running	Auto-detect + start
Test tiers	E2E only	Unit + Integration + E2E
Parallel execution	No — serial per route	Yes — parallel workers
Use case	Development feedback	CI / arc pipeline

Talisman Configuration

testing:
  max_routes: 5            # Default cap for route testing
  browser:
    headed: false          # Default headless; override with --headed flag
  human_gates:
    enabled: true          # Set false to auto-skip all gates
  tiers:
    e2e:
      base_url: "http://localhost:3000"  # Override if app runs on different port

Similar Skills

cache-components

Expert guidance for Next.js Cache Components and Partial Prerendering (PPR). **PROACTIVE ACTIVATION**: Use this skill automatically when working in Next.js projects that have `cacheComponents: true` in their next.config.ts/next.config.js. When this config is detected, proactively apply Cache Components patterns and best practices to all React Server Component implementations. **DETECTION**: At the start of a session in a Next.js project, check for `cacheComponents: true` in next.config. If enabled, this skill's patterns should guide all component authoring, data fetching, and caching decisions. **USE CASES**: Implementing 'use cache' directive, configuring cache lifetimes with cacheLife(), tagging cached data with cacheTag(), invalidating caches with updateTag()/revalidateTag(), optimizing static vs dynamic content boundaries, debugging cache issues, and reviewing Cache Component implementations.

138.5k

brainstorming

7 files

You MUST use this before any creative work - creating features, building components, adding functionality, or modifying behavior. Explores user intent, requirements and design before implementation.

109.2k

dispatching-parallel-agents

Use when facing 2+ independent tasks that can be worked on without shared state or sequential dependencies

109.2k

Stats

Parent Repo Stars1

Parent Repo Forks0

Last CommitMar 12, 2026

Actions

View Source View Plugin View on GitHub View README