Help us improve
Share bugs, ideas, or general feedback.
From mineru
Parses PDF, Office, and image files into structured Markdown using the MinerU API. Supports OCR, formula/table recognition, batch processing, and multi-format export (DOCX/HTML/LaTeX).
npx claudepluginhub nebutra/mineru-skillHow this skill is triggered — by the user, by Claude, or both
Slash command
/mineru:mineruThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Parse PDF, Office, and image documents into structured Markdown via the MinerU API.
references/api_reference.mdreferences/comparison.mdreferences/integrations.mdscripts/__init__.pyscripts/chunking.pyscripts/local_engine.pyscripts/mineru.pyscripts/mineru_mcp.pyscripts/sinks/__init__.pyscripts/sinks/_http.pyscripts/sinks/_md.pyscripts/sinks/airtable.pyscripts/sinks/base.pyscripts/sinks/coda.pyscripts/sinks/confluence.pyscripts/sinks/dingtalk.pyscripts/sinks/feishu.pyscripts/sinks/linear.pyscripts/sinks/local.pyscripts/sinks/notion.pyConverts local or remote documents (PDF, DOCX, images, web pages) to Markdown using MinerU OCR. Supports light-weight no-login mode and standard API with token. Includes token self-check and archive.
Parses local files (PDF, DOCX, XLSX, HTML, etc.) into clean markdown on disk. Offers AI summaries and Q&A over document content.
This skill should be used when the user says "process documents", "extract text from PDF", "OCR this document", "convert PDF to markdown", "extract emails from documents", "parse document", "document conversion", "batch OCR", "extract structured data from PDF", "read PDF", "extract tables from PDF", "convert Word document", "convert docx to markdown", or wants to extract, convert, or process documents and scanned images.
Share bugs, ideas, or general feedback.
Parse PDF, Office, and image documents into structured Markdown via the MinerU API.
# Zero-config: no token, no install (free Agent API)
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/mineru.py" ./document.pdf --output ./output/
# Pipe Markdown back to an agent
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/mineru.py" ./document.pdf --stdout
# Power mode: token unlocks large files / batch / extra formats
export MINERU_TOKEN="..." # https://mineru.net/apiManage/token
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/mineru.py" ./pdfs/ --output ./output/ --workers 8 --resume
--ocr with language selection (--lang)--stdout (Markdown) and --json (machine status)--resumeA token is optional — the Agent API works without one. Set a token to unlock the Standard API (≤ 200 MB / ≤ 200 pages, batch, DOCX/HTML/LaTeX):
export MINERU_TOKEN="your-token-here" # https://mineru.net/apiManage/token
Official API docs: https://mineru.net/apiManage/docs