Help us improve
Share bugs, ideas, or general feedback.
From scrapedo-web-scraper
Scrapes web pages via Scrape.do API to bypass blocks, CAPTCHA, and WebFetch errors like 403, 401, 429, timeouts, access denied, Cloudflare. Auto-activates on failures.
npx claudepluginhub artwist-polyakov/polyakov-claude-skills --plugin scrapedo-web-scraperHow this skill is triggered — by the user, by Claude, or both
Slash command
/scrapedo-web-scraper:scrapedo-web-scraperThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Скрапинг веб-страниц через Scrape.do API. Используй когда обычный fetch не работает (блокировка, JavaScript).
Unblocks 4xx/WAF/captcha/JS-SPA web fetches via escalating free chain: public APIs, Jina Reader, curl/TLS impersonation, Playwright headless, archives until valid body. Zero keys.
Extracts data from JS-rendered, Cloudflare-protected, or dynamic SPA pages using the scrapling Python library with three-tier fetcher selection (HTTP, stealth Chromium, full browser automation) and CSS selectors. Use when WebFetch or simple HTTP requests fail due to anti-bot defenses or DOM-traversal needs.
Scrapes web pages and websites using Firecrawl API, converting to clean markdown. Handles JavaScript rendering, anti-bot protection, paywalled content, and dynamic sites for articles, blogs, docs.
Share bugs, ideas, or general feedback.
Скрапинг веб-страниц через Scrape.do API. Используй когда обычный fetch не работает (блокировка, JavaScript).
# Получить текст страницы
python scripts/scrape.py https://example.com
# Получить HTML
python scripts/scrape.py --html https://example.com
from scripts.scrape import fetch_via_scrapedo
result = fetch_via_scrapedo('https://example.com')
if result['success']:
print(result['content']) # текст
# result['html'] — оригинальный HTML
else:
print(result['content']) # описание ошибки
--html)Если вернулась ошибка — страница недоступна через этот метод.