Help us improve
Share bugs, ideas, or general feedback.
From casper
Scrapes web pages and websites using Firecrawl API, converting to clean markdown. Handles JavaScript rendering, anti-bot protection, paywalled content, and dynamic sites for articles, blogs, docs.
npx claudepluginhub casper-studios/casper-marketplace --plugin casperHow this skill is triggered — by the user, by Claude, or both
Slash command
/casper:firecrawl-scrapingThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Scrape individual web pages and convert them to clean, LLM-ready markdown. Handles JavaScript rendering, anti-bot protection, and dynamic content.
Scrapes single pages or crawls sites using Firecrawl v2.5 API to LLM-ready markdown and structured data. Handles JS rendering, bot bypass, browser automation for dynamic content extraction.
Extracts clean markdown from any URL, including JavaScript-rendered SPAs. Supports concurrent scraping, JS wait times, and content filtering.
Scrape single pages or crawl sites into LLM-ready markdown via Firecrawl JS library. Handles sync/async jobs, depth limits, path filters, JS rendering.
Share bugs, ideas, or general feedback.
Scrape individual web pages and convert them to clean, LLM-ready markdown. Handles JavaScript rendering, anti-bot protection, and dynamic content.
What are you scraping?
│
├── Single page (article, blog, docs)
│ └── references/single-page.md
│ └── Script: scripts/firecrawl_scrape.py
│
└── Entire website (multiple pages, crawling)
└── references/website-crawler.md
└── (Use Apify Website Content Crawler for multi-page)
# Required in .env
FIRECRAWL_API_KEY=fc-your-api-key-here
Get your API key: https://firecrawl.dev/app/api-keys
python scripts/firecrawl_scrape.py "https://example.com/article"
python scripts/firecrawl_scrape.py "https://wsj.com/article" \
--proxy stealth \
--format markdown summary \
--timeout 60000
| Mode | Use Case |
|---|---|
basic | Standard sites, fastest |
stealth | Anti-bot protection, premium content (WSJ, NYT) |
auto | Let Firecrawl decide (recommended) |
markdown - Clean markdown content (default)html - Raw HTMLsummary - AI-generated summaryscreenshot - Page screenshotlinks - All links on page~1 credit per page. Stealth proxy may use additional credits.
FIRECRAWL_API_KEY in .env file (never commit to git).tmp/ directorySymptoms: API returns "insufficient credits" or quota exceeded error Cause: Account credits depleted Solution:
basic proxy mode to conserve creditsSymptoms: Empty content or partial HTML returned Cause: JavaScript-heavy page not fully loading Solution:
--js-render flag--timeout 60000 (60 seconds)stealth proxy mode for protected sites--wait-for selectorSymptoms: Script returns 403 status code Cause: Site blocking automated access Solution:
stealth proxy modeSymptoms: Scrape succeeds but markdown is empty or malformed Cause: Dynamic content loaded after page load, or unusual page structure Solution:
--wait-for to wait for specific contenthtml format to see raw contentSymptoms: Request times out before completion Cause: Slow page load or large page content Solution:
basic proxy for faster responseSkills: firecrawl-scraping → parallel-research Use case: Scrape competitor pages, then analyze content strategy Flow:
Skills: firecrawl-scraping → content-generation Use case: Create summary documents from web research Flow:
Skills: firecrawl-scraping → attio-crm Use case: Enrich company records with website data Flow: