Skill

camoufox

Automates browser interactions with Camoufox MCP: browse URLs, extract page text/structure, fill forms, click, screenshot, and bypass bot walls with realistic fingerprints.

automation

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/camoufox:camoufox

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Camoufox is a privacy-focused Firefox exposed through the `camoufox-mcp-server` MCP server. Reach for it when a plain HTTP fetch is not enough: JavaScript-rendered pages, bot walls, forms, multi-step flows, screenshots, or anything that benefits from a realistic browser fingerprint.

Supporting Files

agents/openai.yamlevals/evals.jsonreferences/json-rpc-debug.mdreferences/sequence-actions.md

SKILL.md

273 lines · ~3.6k tokens

Stats

LanguageTypeScript

Parent stars32

Parent forks8

MaintenanceExcellent

Last CommitJun 19, 2026

Actions

View Source View Plugin View on GitHub View README

Camoufox Browser Automation

Camoufox is a privacy-focused Firefox exposed through the camoufox-mcp-server MCP server. Reach for it when a plain HTTP fetch is not enough: JavaScript-rendered pages, bot walls, forms, multi-step flows, screenshots, or anything that benefits from a realistic browser fingerprint.

Every tool launches or reuses a real browser, so each call costs time and tokens. The whole skill is about getting the answer in the fewest, narrowest calls. Two habits do most of the work:

Pick the narrowest tool for the question (see Choosing a Tool). A page's link list, headings, or one text match is far cheaper than its full rendered text.
Bound every call. Set maxChars, selector, or outputMode: "metadata" so a giant page can't blow up your context.

This skill does not start the server. Confirm an MCP server named camoufox is available first. The installable plugin ships this config with unsafe browser options enabled so hard-site tuning can use firefox_user_prefs, args, and exclude_addons:

{
  "mcpServers": {
    "camoufox": {
      "command": "npx",
      "args": ["-y", "camoufox-mcp-server@latest"],
      "env": { "CAMOUFOX_MCP_ALLOW_UNSAFE_OPTIONS": "1" }
    }
  }
}

Bare npx -y camoufox-mcp-server@latest remains safe by default unless the host config adds that env var.

Tool Names

Hosts expose MCP tool names differently. Use whatever the host lists; common forms:

Claude/Hermes style: mcp_camoufox_browse
OpenClaw bundle style: camoufox__browse
Raw MCP name: browse

This skill uses raw names (browse, browse_find, ...). Map them to your host's form.

In Hermes, browser_navigate is the built-in browser tool, not Camoufox. Use the mcp_camoufox_* tools after hermes mcp test camoufox succeeds.

Choosing a Tool

Start from the question, not from browse. browse returns a wall of page text; the extractor tools return only the slice you asked for, which keeps your context small and the answer easy to read.

You want	Use	Why
Just confirm a page loads / get title + status	`browse` with `outputMode: "metadata"`	No body text at all
The page's readable text / article body	`browse` (default `text`), set `maxChars`	Bounded visible text
Raw HTML (only if you truly need markup)	`browse` with `outputMode: "html"`	Skip unless parsing markup
All links on the page	`browse_links`	Structured list, no body noise
Form fields and submit buttons	`browse_forms`	Names, types, controls only
Page structure / headings / sections	`browse_outline`	Headings + landmarks
Does the page contain "X"? Where?	`browse_find` with `query`	Bounded context around matches
What can I click/type? (interactive map)	`browse_snapshot`	Visible text + ARIA + elements
Navigate + a few actions, then read once	`browse_sequence`	One round trip, bounded actions
A screenshot	`browse_screenshot` (or `screenshot: true` on `browse`)	Image output
Console errors / failed requests	`browse_console`, `browse_network_summary`	Focused diagnostics
Multi-step flow with state between calls	session tools	Persistent page across calls

Rules of thumb:

Need one fact from a page? browse_find beats reading the whole thing.
If the page you want has a predictable URL (.../page/2/, a category, a permalink), navigate straight to it. Reserve browse_sequence click-throughs for when the destination URL isn't knowable up front: search results, JS-built navigation, a control with no stable href. Clicking your way to a page you could have requested directly is slower and more fragile.
Need to act, then read the result, in one shot? browse_sequence. Need to keep a logged-in / cookie-bearing page alive across several unscripted decisions? A session.
selector scopes an extractor to one matching element (e.g. selector: "main"). Two failure modes follow: a selector that matches nothing returns empty (not an error), and a per-item selector on a list (e.g. .quote .author) returns only the first item. For a whole list, scope to the container or drop the selector. If a scoped call comes back empty or short, widen it rather than piling on more calls chasing the same wrong selector.
Want exact or long text the page truncates visually (ellipsised titles, styled labels)? browse_snapshot reads element names from the ARIA tree, which keeps the full string where browse visible-text may cut it off.

First Check

Call camoufox_status before relying on advanced behavior. It returns server, browser, queue, session, and policy state without launching a page.

Fields worth reading:

browserAvailable: must be true, or nothing will run.
unsafeOptionsAllowed: must be true before sending firefox_user_prefs, args, or exclude_addons.
evaluateAllowed: must be true before using the evaluate action in a sequence/session.
maxConcurrency, maxQueue, maxSessions, sessionTtlMs: capacity limits. Sessions auto-expire after sessionTtlMs; don't start more than maxSessions.
activeSessions, queuedRequests: current load.
networkSecurity: the server's application-layer URL policy. ssrfPolicy: "app_layer_best_effort" means best-effort SSRF filtering, not proof of network isolation. Check warning and strictSandboxRequired.

The active default wait strategy and stealth profile are advertised separately, during MCP initialize, at result.capabilities.extensions["camoufox-mcp"].policy (defaultWaitStrategy, defaultStealthProfile) — not in the camoufox_status body.

Packaged plugin default: the bundled config sets CAMOUFOX_MCP_ALLOW_UNSAFE_OPTIONS=1. Bare server installs do not. Do not add CAMOUFOX_MCP_ALLOW_EVALUATE unless the user or project config explicitly opts in.

Common Calls

Metadata only (does it load? title? status):

{ "url": "https://example.com", "outputMode": "metadata" }

Bounded visible text:

{ "url": "https://example.com", "maxChars": 12000 }

Find one thing on a page (cheap, targeted):

{ "url": "https://example.com", "query": "pricing", "maxMatches": 3 }

Map what's interactive before acting:

{ "url": "https://example.com", "maxElements": 80 }

Navigate, act, read once (no session needed):

{
  "url": "https://example.com/login",
  "actions": [
    { "type": "fill", "selector": "#user", "value": "alice" },
    { "type": "fill", "selector": "#pass", "value": "secret" },
    { "type": "click", "selector": "button[type=submit]" },
    { "type": "waitFor", "loadState": "domcontentloaded" }
  ],
  "maxChars": 8000
}

Screenshot:

{ "url": "https://example.com", "screenshot": true }

Sequence Actions

browse_sequence (one round trip) and browse_session_action (one action in a live session) both take the same action objects. Available types: click, hover, fill, type, select, press, waitFor, scroll, evaluate.

Read references/sequence-actions.md for every field, clickMode (DOM vs pointer), frame (acting inside an iframe), and waitFor states. Two things to remember up front:

Prefer fill for setting an input's value; use type only when you need real per-keystroke events (delays, key handlers).
evaluate runs arbitrary JS and is disabled unless the operator sets CAMOUFOX_MCP_ALLOW_EVALUATE=1 (check evaluateAllowed in status first).

Interactive Sessions

Use a session when you need the same page (cookies, login, scroll position, JS state) across several decisions you can't script up front — e.g. log in, look at the result, then decide where to go next. For a fixed known sequence, browse_sequence is cheaper because it's a single call.

Sessions are short-lived and auto-expire after sessionTtlMs. Always close them when done so you don't hold a slot.

Lifecycle:

browse_session_start → returns a sessionId. Pass stealth/privacy options here; they apply for the session's life.
browse_session_navigate → go to a URL in that session.
browse_session_action → run one action (same action objects as sequences).
browse_session_snapshot → read current visible text + interactive elements without acting.
browse_session_resume → after a paused CAPTCHA or human step, wait for load state and re-read.
browse_session_close → free the slot.

Worked example — log in, then branch based on what you see:

// 1. start
{}  // → { "sessionId": "abc123", ... }

// 2. navigate (browse_session_navigate)
{ "sessionId": "abc123", "url": "https://example.com/login" }

// 3. fill + submit (browse_session_action, one per call)
{ "sessionId": "abc123", "action": { "type": "fill", "selector": "#user", "value": "alice" } }
{ "sessionId": "abc123", "action": { "type": "fill", "selector": "#pass", "value": "secret" } }
{ "sessionId": "abc123", "action": { "type": "click", "selector": "button[type=submit]" } }

// 4. read state and decide (browse_session_snapshot)
{ "sessionId": "abc123", "maxElements": 60 }

// 5. close (browse_session_close)
{ "sessionId": "abc123" }

If a navigation or action returns a CAPTCHA pause, hand control to the user, then call browse_session_resume with the same sessionId once they've solved it.

Stealth Profiles

Use stealthProfile as a shortcut, then override individual options only when needed.

Profile	Use
`normal`	Default for most browsing. Humanized cursor, GeoIP, WebRTC blocked.
`privacy`	Adds WebGL blocking. More private, but can be more detectable on strict sites.
`human_assisted`	Visible browser and cache enabled, for when a human may need to interact.
`fast`	Blocks images and disables humanization for speed. More detectable.
`debug`	Enables console and network diagnostics.

Hard-Site Tuning

This is carried-over tuning guidance, not a fresh verification claim. Re-test with camoufox_status and a small browse call in your environment before relying on it.

For Reddit and similarly strict sites:

Prefer stealthProfile: "normal", os: "windows", locale: "en-US".
Keep the default waitStrategy: "domcontentloaded"; it's safer for sites that hold connections open or redirect after the first HTML.
Avoid stealthProfile: "privacy" if WebGL blocking itself seems to trigger detection.
firefox_user_prefs requires unsafeOptionsAllowed: true. Some prefs (e.g. dom.serviceWorkers.enabled) are denied even then; remove any pref the server rejects.

Opt-in tuning payload:

{
  "url": "https://www.reddit.com/",
  "stealthProfile": "normal",
  "os": "windows",
  "locale": "en-US",
  "timeout": 30000,
  "firefox_user_prefs": {
    "media.navigator.enabled": false,
    "privacy.resistFingerprinting": true,
    "network.http.altsvc.enabled": false,
    "dom.battery.enabled": false,
    "intl.accept_languages": "en-US,en;q=0.9"
  }
}

CAPTCHA Handling

The server does not solve CAPTCHAs. It surfaces bounded challenge context for the user or host agent. Set captchaPolicy:

detect: report challenge signals only.
pause: return state for manual action, then call browse_session_resume.
fail: return an error when a challenge is detected.
attempt: return enhanced metadata (provider, iframe hints, suggested strategy, bounded screenshot). Still no hidden bypass.

CAPTCHA_AUTONOMOUS=true marks handling as LLM-assisted and may add provider playbooks, but the server still performs no covert bypass. Use disable_coop: true only when iframe interaction needs it.

Debugging

Read references/json-rpc-debug.md when the host hasn't registered the server or you need to test it through raw JSON-RPC.

Host Setup Failures

Native module errors such as better-sqlite3 compiled for the wrong Node.js version are host or gateway dependency problems, not Camoufox server dependencies. Rebuild the host dependency under the Node version used by that host, then restart the gateway because the old MCP process keeps the old native module loaded.

If the host blocks direct config edits, do not patch protected files. Use the host CLI or tell the operator exactly what to add. For Hermes, this verified command registers Camoufox with unsafe browser options enabled:

hermes mcp add camoufox --command npx --env CAMOUFOX_MCP_ALLOW_UNSAFE_OPTIONS=1 --args -y camoufox-mcp-server@latest

Hermes --env values are KEY=VALUE. --args must be the last option and receives plain argv tokens, not a JSON array string. If repairing ~/.hermes/config.yaml, the entry should look like this:

mcp_servers:
  camoufox:
    command: npx
    args:
      - -y
      - camoufox-mcp-server@latest
    enabled: true
    env:
      CAMOUFOX_MCP_ALLOW_UNSAFE_OPTIONS: "1"

For local development, use command: node and a one-item args list containing the absolute dist/index.js path. Then run hermes mcp list, hermes mcp test camoufox, restart the gateway from a separate terminal, and confirm with mcp_camoufox_camoufox_status.unsafeOptionsAllowed.

If Hermes reports an ambiguous camoufox skill, keep only one installed Camoufox skill path or load the categorized path explicitly.

Common failures:

Missing tools: server not installed/enabled, or plugin installed but not reloaded.
Unsafe option rejected: CAMOUFOX_MCP_ALLOW_UNSAFE_OPTIONS not set, or a denied pref/arg was sent. The server logs which option family it rejected.
evaluate rejected: CAMOUFOX_MCP_ALLOW_EVALUATE not set (evaluateAllowed: false).
Hanging navigation: a call overrode waitStrategy to load/networkidle; revert to domcontentloaded and try a shorter timeout.
Empty output: narrow with selector, switch to browse_snapshot, or check browse_console and browse_network_summary.

camoufox

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

camoufox

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Camoufox Browser Automation

Tool Names

Choosing a Tool

First Check

Common Calls

Sequence Actions

Interactive Sessions

Stealth Profiles

Hard-Site Tuning

CAPTCHA Handling

Debugging

Host Setup Failures

Similar Skills

Camoufox Browser Automation

Tool Names

Choosing a Tool

First Check

Common Calls

Sequence Actions

Interactive Sessions

Stealth Profiles

Hard-Site Tuning

CAPTCHA Handling

Debugging

Host Setup Failures

Similar Skills