Skill

webpage-reader

Extracts clean Markdown from web pages by stripping navigation, ads, sidebars, footers, and boilerplate using Defuddle. Use for URLs to documentation, articles, blog posts, research papers, release notes.

Html

Markdown

documentation

developer-tools

npx claudepluginhub ericgandrade/claude-superskills --plugin claude-superskills

Tool Access

This skill uses the workspace's default tool permissions.

Preview

This skill extracts clean, readable Markdown from any web page URL by stripping navigation menus, advertisements, sidebars, footers, cookie banners, and all other non-content elements. It produces token-efficient output that focuses exclusively on the meaningful content of the page.

Supporting Assets

README.mdevals/evals.jsonevals/trigger-eval.json

SKILL.md

Similar Skills

defuddle

36.4k

Extracts clean markdown from web pages using Defuddle CLI, removing ads, navigation, and clutter to save tokens. Prefer for user-provided URLs of docs, articles, or blogs over WebFetch.

antigravity-awesome-skills

defuddle

25.1k

Extracts clean markdown from web pages using Defuddle CLI, removing navigation, ads, and clutter to save tokens. Use for online docs, articles, blogs—not .md files.

obsidian

defuddle

Extracts clean markdown from web pages using Defuddle CLI, stripping clutter like navigation and ads to save tokens. Use for URLs to analyze docs, articles, or blogs instead of WebFetch.

obsidian-skills

Stats

Stars31

Forks0

Last CommitApr 3, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

# Extract and display as Markdown defuddle parse <url> --md # Extract and save to a file defuddle parse <url> --md -o output-filename.md # Extract specific metadata only defuddle parse <url> -p title defuddle parse <url> -p description defuddle parse <url> -p author defuddle parse <url> -p domain

Flag	Output	When to use
`--md`	Markdown	Default choice for all content
`--json`	JSON with HTML and Markdown	When structured metadata is needed
(none)	HTML	Avoid — use `--md` instead
`-p <name>`	Single metadata property	When only title, author, or description is needed

Flag

Output

When to use

--md

Markdown

Default choice for all content

--json

JSON with HTML and Markdown

When structured metadata is needed

(none)

HTML

Avoid — use --md instead

-p <name>

Single metadata property

When only title, author, or description is needed

Situation	Response
403 Forbidden	"This site blocks automated access. Try opening it manually in a browser."
404 Not Found	"Page not found. Please verify the URL is correct."
Timeout	Retry once automatically; if it fails again, report the timeout
Login required	"This page requires authentication. Log in first and share the content manually."
Paywall detected	"This content is behind a paywall and cannot be extracted automatically."
Empty extraction	Fall back to WebFetch and note the reduced quality
Invalid SSL	Report the SSL error and ask if the user wants to proceed anyway

Situation

Response

403 Forbidden

"This site blocks automated access. Try opening it manually in a browser."

404 Not Found

"Page not found. Please verify the URL is correct."

Timeout

Retry once automatically; if it fails again, report the timeout

"This page requires authentication. Log in first and share the content manually."

Paywall detected

"This content is behind a paywall and cannot be extracted automatically."

Empty extraction

Fall back to WebFetch and note the reduced quality

Invalid SSL

Report the SSL error and ask if the user wants to proceed anyway

Mode	Command flag	Output type	Best for
Markdown	`--md`	Clean `.md` text	Reading articles, documentation, blog posts
JSON	`--json`	JSON with `html` and `markdown` fields	When structured metadata and body are both needed
HTML	(no flag)	Raw cleaned HTML	Avoid — use `--md` for readability
Metadata	`-p <name>`	Single string value	When only title, author, description, or domain is needed

Mode

Command flag

Output type

Best for

Markdown

--md

Clean .md text

Reading articles, documentation, blog posts

JSON

--json

JSON with html and markdown fields

When structured metadata and body are both needed

HTML

(no flag)

Raw cleaned HTML

Avoid — use --md for readability

Metadata

-p <name>

Single string value

When only title, author, description, or domain is needed

User: I am researching LLM evaluation frameworks. Read these three pages: - https://docs.ragas.io/en/latest/ - https://www.trulens.org/ - https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/

Flag	Output	When to use
`--md`	Markdown	Default choice for all content
`--json`	JSON with HTML and Markdown	When structured metadata is needed
(none)	HTML	Avoid — use `--md` instead
`-p <name>`	Single metadata property	When only title, author, or description is needed

Flag

Output

When to use

--md

Markdown

Default choice for all content

--json

JSON with HTML and Markdown

When structured metadata is needed

(none)

HTML

Avoid — use --md instead

-p <name>

Single metadata property

When only title, author, or description is needed

Situation	Response
403 Forbidden	"This site blocks automated access. Try opening it manually in a browser."
404 Not Found	"Page not found. Please verify the URL is correct."
Timeout	Retry once automatically; if it fails again, report the timeout
Login required	"This page requires authentication. Log in first and share the content manually."
Paywall detected	"This content is behind a paywall and cannot be extracted automatically."
Empty extraction	Fall back to WebFetch and note the reduced quality
Invalid SSL	Report the SSL error and ask if the user wants to proceed anyway

Situation

Response

403 Forbidden

"This site blocks automated access. Try opening it manually in a browser."

404 Not Found

"Page not found. Please verify the URL is correct."

Timeout

Retry once automatically; if it fails again, report the timeout

"This page requires authentication. Log in first and share the content manually."

Paywall detected

"This content is behind a paywall and cannot be extracted automatically."

Empty extraction

Fall back to WebFetch and note the reduced quality

Invalid SSL

Report the SSL error and ask if the user wants to proceed anyway

Mode	Command flag	Output type	Best for
Markdown	`--md`	Clean `.md` text	Reading articles, documentation, blog posts
JSON	`--json`	JSON with `html` and `markdown` fields	When structured metadata and body are both needed
HTML	(no flag)	Raw cleaned HTML	Avoid — use `--md` for readability
Metadata	`-p <name>`	Single string value	When only title, author, description, or domain is needed

Mode

Command flag

Output type

Best for

Markdown

--md

Clean .md text

Reading articles, documentation, blog posts

JSON

--json

JSON with html and markdown fields

When structured metadata and body are both needed

HTML

(no flag)

Raw cleaned HTML

Avoid — use --md for readability

Metadata

-p <name>

Single string value

When only title, author, description, or domain is needed

webpage-reader

Tool Access

Preview

Supporting Assets

SKILL.md

Similar Skills

Help us improve

Help us improve

webpage-reader

Tool Access

Preview

Supporting Assets

SKILL.md

Purpose

When to Use

Workflow

Step 1: Validate the Input

Step 2: Check Defuddle Availability

Step 3a: Extract Content with Defuddle (Preferred Path)

Step 3b: WebFetch Fallback (When Defuddle Is Unavailable)

Step 4: Handle Extraction Errors

Step 5: Format and Deliver the Output

Output Formats Reference

Integration with Other Skills

Critical Rules

Example Usage

Example 1: Read a documentation page

Example 2: Save an article to a local file

Example 3: Get page metadata only

Example 4: Batch extraction for research

Example 5: Defuddle not installed — install offer

Similar Skills

Help us improve

Purpose

When to Use

Workflow

Step 1: Validate the Input

Step 2: Check Defuddle Availability

Step 3a: Extract Content with Defuddle (Preferred Path)

Step 3b: WebFetch Fallback (When Defuddle Is Unavailable)

Step 4: Handle Extraction Errors

Step 5: Format and Deliver the Output

Output Formats Reference

Integration with Other Skills

Critical Rules

Example Usage

Example 1: Read a documentation page

Example 2: Save an article to a local file

Example 3: Get page metadata only

Example 4: Batch extraction for research

Example 5: Defuddle not installed — install offer