By spronta
Run technical SEO and AI-search (GEO) audits from your agent: crawl sites for broken links, 404s, redirect chains, missing metadata, and structured data gaps, then get a pass/fail verdict for pre-launch gates or CI pipelines.
Find and fix broken links, dead pages, and redirect chains on a website with crawlie. Use when the user asks to find broken links, 404s, dead links, link rot, redirect chains/loops, or 4xx/5xx pages, or to check that a site's links are healthy. Works even if crawlie is not installed.
Assess how ready a website is to be cited and answered by AI search and LLMs (Generative Engine Optimization / GEO) using crawlie. Use when the user asks about AI search, generative engine optimization, GEO, getting cited by ChatGPT / Perplexity / Google AI Overviews, llms.txt, structured data for AI, or whether their content is "answerable" by LLMs. Works even if crawlie is not installed.
Run a pre-launch / pre-deploy SEO gate on a site or set of pages with crawlie, returning a clear pass/fail. Use when the user is about to launch, deploy, or ship and wants to verify there are no blocking SEO issues (broken links, 5xx, missing titles/canonicals, noindex slips), or wants a CI gate for SEO regressions. Works even if crawlie is not installed.
Run a complete technical SEO + AI-search audit of a website with crawlie. Use when the user asks to audit, crawl, or check a site for SEO issues, broken links, redirects, missing titles/descriptions/metadata, canonical or heading problems, image alt text, duplicate or thin content, or generative-engine (GEO / AI-search) readiness. Works even if crawlie is not installed.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
The fast, free, open-source technical SEO + GEO crawler — built for humans and agents.
Crawl any site for broken links, redirects, missing metadata, and 40+ SEO & Generative-Engine checks — with plain-English guidance on every fix. Runs locally, ships a CLI and an MCP server, and costs nothing.
Setup · CLI · MCP & agents · Use cases · Why I built this · Desktop app · Checks · Compare · Architecture
.png)
The easy way — npm (installs the crawlie CLI and the crawlie-mcp server):
npm i -g crawlie
The macOS app — grab the signed .dmg from Releases.
From source — needs Rust (engine/CLI/MCP) and, for the desktop app, pnpm + Node:
git clone https://github.com/spronta/crawlie
cd crawlie
cargo build --release
# → target/release/crawlie and target/release/crawlie-mcp
# or install onto your PATH:
cargo install --path crates/crawlie-cli # installs `crawlie`
cargo install --path crates/crawlie-mcp # installs `crawlie-mcp`
How it ships: the CLI + MCP come only through npm — the right native binary installs automatically as a platform package (nothing to download or unblock). The desktop app is the only direct download: a signed, notarized
.dmgon Releases.
# Crawl a whole site (respects robots.txt, seeds from sitemap.xml)
crawlie crawl https://example.com --format pretty
# Audit a single page, or a specific set of pages
crawlie audit https://example.com/pricing
crawlie audit https://example.com/a https://example.com/b
# Save a shareable, self-contained HTML report
crawlie crawl https://example.com --format html -o report.html
# Clean JSON on stdout (perfect for piping / scripting / agents)
crawlie crawl https://example.com --format json -o report.json
# Learn why any finding matters and how to fix it
crawlie explain geo-not-answerable
Output formats: pretty (terminal), json (machine-readable, the default), csv (issues), html (shareable file).
Common flags:
| Flag | What it does |
|---|---|
--max-pages <n> | Cap pages fetched (default 500) |
--max-depth <n> | Max click depth from the seed |
--concurrency <n> | Parallel requests (default 16) |
--include <glob> / --exclude <glob> | Scope the crawl by URL pattern |
--no-robots / --no-sitemap / --no-external | Turn off robots.txt, sitemap seeding, external link checks |
--severity error|warning|notice | Only output findings at/above a level |
--save | Save to local report history (crawlie reports, crawlie report <id>) |
--fail-on error|warning | Non-zero exit code for CI gating |
Every crawl returns two scores: a Health score (technical SEO) and a GEO score (AI-search readiness).
crawlie ships a Model Context Protocol server so an LLM agent can run a full audit and act on it — no human in the loop. This is the part most SEO tools don't have.
After npm i -g crawlie, crawlie-mcp is on your PATH. For Claude Desktop, edit claude_desktop_config.json:
{
"mcpServers": {
"crawlie": {
"command": "crawlie-mcp"
}
}
}
For Claude Code:
claude mcp add crawlie crawlie-mcp
(If you built from source instead, use the absolute path to target/release/crawlie-mcp.)
(Any MCP-compatible client works — Cursor, Cline, your own agent. It speaks JSON-RPC over stdio.)
The fastest path. The crawlie plugin bundles the MCP server and a set of skills (audit playbooks) in a single install — the MCP server auto-runs via npx, so you don't even pre-install the binary:
# add this repo as a marketplace, then install the plugin
claude plugin marketplace add spronta/crawlie
claude plugin install crawlie@spronta
npx claudepluginhub spronta/crawlie --plugin crawlieSEO Dungeon skill suite for Claude Code compatibility. 25 sub-skills (21 core + 1 orchestrator + 1 framework integration + 2 extension mirrors) and 18 sub-agents cover technical SEO, content quality, schema, sitemaps, Core Web Vitals, local SEO, backlinks, GEO, and Google APIs.
Scored SEO audits, technical health checks, Schema.org JSON-LD, E-E-A-T content scoring, domain authority rating, and GEO for AI Overviews, ChatGPT, and Perplexity. 24 diagnostic scripts. Modes: Audit → Plan → Execute.
Website audit tool for SEO, performance, security & 230+ rules
Claude Code plugin that tells you why your site won't get cited by AI — and fixes the discoverability files automatically.
Comprehensive SEO analysis plugin for Claude Code. 25 sub-skills (21 core + 1 orchestrator + 1 framework + 2 extension mirrors) and 18 sub-agents cover technical SEO, content quality, schema, sitemaps, Core Web Vitals, local SEO, backlinks, AI/GEO, ecommerce, hreflang, SXO, clustering, drift monitoring, and Google APIs. Includes optional MCP extensions, SPA-aware rendering, portability, and hardened SSRF/DNS-rebinding safe fetchers.
SEO site crawler: page inventory, on-page audit, internal linking analysis, navigation audit, competitor research via Crawl4AI