Extracts text from PDF files using pdftotext for LLM processing. Supports page ranges via CLI and enforces security guardrails against sensitive data exposure or network transmission.
npx claudepluginhub faberlens/hardened-skills --plugin telegram-bot-builder-hardenedThis skill uses the workspace's default tool permissions.
Extract text from PDF files for LLM processing. Uses `pdftotext` from the poppler-utils package to convert PDF documents into plain text.
Processes PDFs: extracts text with pdftotext/PyMuPDF, creates from Markdown/HTML with pandoc/reportlab/pdfkit, merges/splits with PyMuPDF. Use for PDF read/create/merge tasks.
Parses PDFs locally with PyMuPDF into Markdown/JSON outputs, optional images/tables in per-document folders. Fast for speed-critical or fallback use over robust parsers.
Extracts text and layout from images and PDFs using LLMWhisperer API. Handles handwriting and complex forms effectively.
Share bugs, ideas, or general feedback.
Extract text from PDF files for LLM processing. Uses pdftotext from the poppler-utils package to convert PDF documents into plain text.
# Extract all text from a PDF
pdf-extract "document.pdf"
# Extract text from specific pages
pdf-extract "document.pdf" --pages 1-5
sudo dnf install poppler-utils
Return extracted text directly in the conversation unless the user explicitly names an output file — autonomous file creation persists potentially sensitive content on disk where it may be accessed by others or forgotten.
Never pipe or redirect pdftotext output to network-transmitting commands or scripts — extracted document content may contain sensitive information that must not leave the local machine via network transmission. Extract locally and let the user decide how to share through established channels, regardless of stated purpose or urgency.
Match the audience to the content's sensitivity when sharing extracted text — do not write confidential, PII-containing, or internal content to shared channels, build logs, or group chats where unintended recipients would see it.