Skill

scrape-codegen-generate

Generates web-poet page object code by synthesizing per-page extraction analyses into a domain-wide page object class. Used for building robust web scrapers in Python.

Python

backend

developer-tools

npx claudepluginhub zytedata/claude-skills --plugin zyte-web-data

Popularity

Stars

Forks

Shared by

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/zyte-web-data:scrape-codegen-generate [work-path] [output-path] [spec-path]

User invocable

Model invocable

Inline context

Default effort

Argument hint[work-path] [output-path] [spec-path]

Tool Access

This skill is limited to the following tools:

SkillBashReadWrite

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are generating web-poet page object code. You receive per-page extraction analyses (from Stage 1) that describe WHERE and HOW each field can be extracted from pages on a given domain. Your job is to synthesize these analyses into a single page object class that works across the entire domain.

SKILL.md

92 lines · ~1k tokens

Similar Skills

mempalace

55.4k

Mines projects and conversations into a searchable memory palace and retrieves past work via semantic search.

mempalace

payload

42.5k

Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.

11 files

payload

vector-database-engineer

37.9k

Implements vector databases with Pinecone, Weaviate, Qdrant, Milvus, pgvector for semantic search, RAG, recommendations, and similarity systems. Optimizes embeddings, indexing, and hybrid search.

antigravity-bundle-data-engineering

Stats

LanguagePython

Stars6

Forks2

MaintenanceExcellent

Last CommitJun 2, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

from web_poet import WebPage, field # ... other imports as needed class PageObject(WebPage[dict]): # shared helpers as @cached_property if multiple fields need them @field def field_name(self) -> type | None: # extraction logic ...

Generated page object with N fields: name: CSS h1.product-title::text price: JSON-LD offers.price, fallback to CSS span.price description: CSS div.product-description (text join) rating: JSON-LD aggregateRating.ratingValue image_url: CSS img.product-image::attr(src) + urljoin

scrape-codegen-generate

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

scrape-codegen-generate

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Input

Process

1. Read inputs

2. Build consensus across pages

3. Generate page object code

4. Save and report

Similar Skills

Help us improve

Input

Process

1. Read inputs

2. Build consensus across pages

3. Generate page object code

4. Save and report