Skill

tavily-extract

Extracts clean markdown or text content from up to 20 URLs using Tavily CLI. Handles JavaScript-rendered pages, LLM-optimized output, and query-focused chunking for targeted extraction.

Bash

Markdown

cli-tools

automation

npx claudepluginhub tavily-ai/skills --plugin tavily

Tool Access

This skill is limited to using the following tools:

Bash(tvly *)

Preview

Extract clean markdown or text content from one or more URLs.

SKILL.md

Similar Skills

firecrawl-scrape

Scrapes clean, LLM-optimized markdown from URLs including JavaScript-rendered SPAs. Handles multiple concurrent URLs, main-content extraction, JS rendering waits, and optional queries.

2 tools

firecrawl

tavily-crawl

203

Crawls websites to extract content from multiple pages via Tavily CLI. Saves pages as local markdown files with depth/breadth limits, path filtering, and semantic instructions. Use for bulk doc downloads or site content collection.

1 tool

tavily

convert-to-markdown

129

Extracts clean Markdown from any URL using ezycopy CLI. Handles JS-rendered pages with headless Chrome, retries on failure, and auto-installs tool if needed.

3 tools

claude-utilities

Stats

Stars203

Forks21

Last CommitMar 16, 2026

Used By2 plugins

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

tavily extract

Extract clean markdown or text content from one or more URLs.

Before running any command

If tvly is not found on PATH, install it first:

curl -fsSL https://cli.tavily.com/install.sh | bash && tvly login

Do not skip this step or fall back to other tools.

See tavily-cli for alternative install methods and auth options.

When to use

You have a specific URL and want its content
You need text from JavaScript-rendered pages
Step 2 in the workflow: search → extract → map → crawl → research

Quick start

# Single URL
tvly extract "https://example.com/article" --json

# Multiple URLs
tvly extract "https://example.com/page1" "https://example.com/page2" --json

# Query-focused extraction (returns relevant chunks only)
tvly extract "https://example.com/docs" --query "authentication API" --chunks-per-source 3 --json

# JS-heavy pages
tvly extract "https://app.example.com" --extract-depth advanced --json

# Save to file
tvly extract "https://example.com/article" -o article.md

Options

Option	Description
`--query`	Rerank chunks by relevance to this query
`--chunks-per-source`	Chunks per URL (1-5, requires `--query`)
`--extract-depth`	`basic` (default) or `advanced` (for JS pages)
`--format`	`markdown` (default) or `text`
`--include-images`	Include image URLs
`--timeout`	Max wait time (1-60 seconds)
`-o, --output`	Save output to file
`--json`	Structured JSON output

Extract depth

Depth	When to use
`basic`	Simple pages, fast — try this first
`advanced`	JS-rendered SPAs, dynamic content, tables

Tips

Max 20 URLs per request — batch larger lists into multiple calls.
Use --query + --chunks-per-source to get only relevant content instead of full pages.
Try basic first, fall back to advanced if content is missing.
Set --timeout for slow pages (up to 60s).
If search results already contain the content you need (via --include-raw-content), skip the extract step.

tavily-extract

Tool Access

Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

tavily-extract

Tool Access

Preview

SKILL.md

tavily extract

Before running any command

When to use

Quick start

Options

Extract depth

Tips

See also

Similar Skills

Help us improve

tavily extract

Before running any command

When to use

Quick start

Options

Extract depth

Tips

See also