Skill

scrape-analyze-page

Extracts all available structured fields (from HTML and JSON-LD) from a detail page, with optional schema-guided field mapping and strict extraction.

Python

automation

npx claudepluginhub zytedata/claude-skills-test

Popularity

Shared by

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/zyte-web-data:scrape-analyze-page [page-html-path] [work-path] [schema-path] [strict]

User invocable

Model invocable

Inline context

Default effort

Argument hint[page-html-path] [work-path] [schema-path] [strict]

Tool Access

This skill is limited to the following tools:

SkillBashReadWrite

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are extracting structured data from a detail page. Given saved HTML, identify all available fields and extract their values.

Supporting Files

scripts/clean_html.pyscripts/extract_metadata.py

SKILL.md

92 lines · ~1.1k tokens

Similar Skills

mempalace

55.4k

Mines projects and conversations into a searchable memory palace and retrieves past work via semantic search.

mempalace

payload

42.5k

Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.

11 files

payload

vector-database-engineer

37.9k

Implements vector databases with Pinecone, Weaviate, Qdrant, Milvus, pgvector for semantic search, RAG, recommendations, and similarity systems. Optimizes embeddings, indexing, and hybrid search.

antigravity-bundle-data-engineering

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitMay 24, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

mkdir -p {work_path}/analysis uv run ${CLAUDE_SKILL_DIR}/scripts/clean_html.py PAGE.html -l1 -o {work_path}/analyze-page/{page_id}.{html_variant}.cleaned.html uv run ${CLAUDE_SKILL_DIR}/scripts/extract_metadata.py PAGE.html -u PAGE_URL -o {work_path}/analyze-page/{page_id}.{html_variant}.metadata.json

{ "url": "https://...", "page_id": "detail-1", "html_variant": "rendered", "fields": { "name": {"type": "str", "value": "Widget X"}, "price": {"type": "str", "value": "$29.99"}, "description": {"type": "str", "value": "Full long description..."} } }

scrape-analyze-page

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

scrape-analyze-page

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

Input

Process

1. Clean and read the page

2. Extract fields

3. Handle large values

4. Save full output

5. Return summary

Similar Skills

Help us improve

Input

Process

1. Clean and read the page

2. Extract fields

3. Handle large values

4. Save full output

5. Return summary