Help us improve
Share bugs, ideas, or general feedback.
From code-abyss
Processes PDF files: extracts text and tables, fills forms, merges/splits documents, batch-processes, converts to images, and generates PDFs programmatically using pypdf, pdfplumber, reportlab, and CLI tools.
npx claudepluginhub telagod/code-abyss --plugin code-abyssHow this skill is triggered — by the user, by Claude, or both
Slash command
/code-abyss:processing-pdfs <file.pdf | task><file.pdf | task>This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Essential PDF operations using Python libraries and CLI tools.
FORMS.mdREFERENCE.mdreferences/advanced.mdreferences/recipes.mdscripts/check_bounding_boxes.pyscripts/check_bounding_boxes_test.pyscripts/check_fillable_fields.pyscripts/convert_pdf_to_images.pyscripts/create_validation_image.pyscripts/extract_form_field_info.pyscripts/fill_fillable_fields.pyscripts/fill_pdf_form_with_annotations.pyExtracts text and tables from PDFs, merges/splits documents, fills forms, and creates new PDFs using pypdf and pdfplumber.
Perform PDF processing in Python with pypdf and pdfplumber: merge/split/rotate pages, extract text/tables/metadata using pandas. Useful for document automation.
Processes PDFs: extracts text/tables with pdfplumber, merges/splits/rotates with pypdf, extracts metadata. For generating PDFs or filling forms via referenced guides.
Share bugs, ideas, or general feedback.
Essential PDF operations using Python libraries and CLI tools.
| Task | Best Tool | Reference |
|---|---|---|
| Merge / split / metadata / rotate | pypdf | recipes.md |
| Extract text (layout preserved) | pdfplumber | recipes.md |
| Extract tables | pdfplumber | recipes.md |
| Create new PDF | reportlab | recipes.md |
| Batch CLI ops | qpdf / pdftk | recipes.md |
| OCR scanned PDFs | pytesseract + pdf2image | advanced.md |
| Add watermark / extract images / encrypt | pypdf / pdfimages | advanced.md |
| Fill PDF forms | pdf-lib / pypdf | FORMS.md |
| Advanced pypdfium2 / pdf-lib JS | — | REFERENCE.md |
from pypdf import PdfReader
reader = PdfReader("document.pdf")
print(f"Pages: {len(reader.pages)}")
text = "".join(page.extract_text() for page in reader.pages)
| Library | Use for |
|---|---|
| pypdf | Merge, split, metadata, encryption, rotation |
| pdfplumber | Text extraction with layout, tables |
| reportlab | Generate PDFs programmatically |
| pdf2image + pytesseract | OCR scanned documents |
| qpdf / pdftk (CLI) | Batch ops, no Python needed |