Help us improve
Share bugs, ideas, or general feedback.
From ocr-toolkit
Extract text from PDF files using Apple Vision OCR, optimized for Apple Silicon
npx claudepluginhub varunr89/claude-marketplace --plugin ocr-toolkitHow this skill is triggered — by the user, by Claude, or both
Slash command
/ocr-toolkit:pdf-ocrThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Converts PDF pages to images using PyMuPDF, then runs Apple Vision OCR on each page in parallel. Produces Markdown, plain text, or JSONL output.
Converts PDF files to markdown using local GLM-OCR via Ollama. Renders each page to image, runs OCR, assembles clean text output. Use for extracting text from PDFs.
This skill should be used when the user says "process documents", "extract text from PDF", "OCR this document", "convert PDF to markdown", "extract emails from documents", "parse document", "document conversion", "batch OCR", "extract structured data from PDF", "read PDF", "extract tables from PDF", "convert Word document", "convert docx to markdown", or wants to extract, convert, or process documents and scanned images.
Extracts text and structured data from single or batch PDFs using 9 backends with automatic fallback, OCR for scanned docs, and markdown output via CLI.
Share bugs, ideas, or general feedback.
Converts PDF pages to images using PyMuPDF, then runs Apple Vision OCR on each page in parallel. Produces Markdown, plain text, or JSONL output.
Use this skill when the user wants to extract text from a PDF file -- especially scanned PDFs, image-based PDFs, or PDFs where copy-paste produces garbled text.
python3 ${CLAUDE_PLUGIN_ROOT}/scripts/pdf_ocr.py <input.pdf> \
[-o output_file] \
[-f markdown|text|jsonl] \
[--dpi 200] \
[--workers N] \
[--languages en-US] \
[--fast] \
[--keep-images] [--images-dir <dir>] \
[--stdout]
| Argument | Default | Description |
|---|---|---|
pdf (positional) | required | Input PDF file path |
-o, --output | <pdf_name>.md | Output file path |
-f, --format | markdown | Output format: markdown, text, or jsonl |
--dpi | 200 | Resolution for rendering PDF pages (higher = better quality, slower) |
--workers | CPU count | Number of parallel OCR workers |
--languages | en-US | Comma-separated recognition languages |
--fast | false | Use faster, less accurate recognition |
--keep-images | false | Keep the extracted page images after OCR |
--images-dir | temp dir | Directory to save page images (requires --keep-images) |
--stdout | false | Print extracted text to stdout instead of writing to file |
## Page N section with the OCR text=== Page N === separatorspage, text, and backend fields