Skill

processing-pdfs

Processes PDF files: extracts text and tables, fills forms, merges/splits documents, batch-processes, converts to images, and generates PDFs programmatically using pypdf, pdfplumber, reportlab, and CLI tools.

Python

developer-tools

npx claudepluginhub telagod/code-abyss --plugin code-abyss

Popularity

Stars

217

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/code-abyss:processing-pdfs <file.pdf | task>

Not user invocable

Model invocable

Inline context

Default effort

Argument hint<file.pdf | task>

Tool Access

This skill is limited to the following tools:

BashReadWriteEditGlob

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Essential PDF operations using Python libraries and CLI tools.

Supporting Files

FORMS.mdREFERENCE.mdreferences/advanced.mdreferences/recipes.mdscripts/check_bounding_boxes.pyscripts/check_bounding_boxes_test.pyscripts/check_fillable_fields.pyscripts/convert_pdf_to_images.pyscripts/create_validation_image.pyscripts/extract_form_field_info.pyscripts/fill_fillable_fields.pyscripts/fill_pdf_form_with_annotations.py

SKILL.md

52 lines · ~532 tokens

Similar Skills

pdf

Extracts text and tables from PDFs, merges/splits documents, fills forms, and creates new PDFs using pypdf and pdfplumber.

11 files

superpowers

pdf-official

39.3k

Perform PDF processing in Python with pypdf and pdfplumber: merge/split/rotate pages, extract text/tables/metadata using pandas. Useful for document automation.

11 files

antigravity-awesome-skills

pdf

2.9k

Processes PDFs: extracts text/tables with pdfplumber, merges/splits/rotates with pypdf, extracts metadata. For generating PDFs or filling forms via referenced guides.

11 files

all-skills

Stats

LanguageJavaScript

Stars217

Forks29

MaintenanceExcellent

Last CommitMay 31, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

PDF Processing

Essential PDF operations using Python libraries and CLI tools.

Decision Matrix

Task

Best Tool

Reference

Merge / split / metadata / rotate

pypdf

recipes.md

Extract text (layout preserved)

pdfplumber

recipes.md

Extract tables

pdfplumber

recipes.md

Create new PDF

reportlab

recipes.md

Batch CLI ops

qpdf / pdftk

recipes.md

OCR scanned PDFs

pytesseract + pdf2image

advanced.md

Add watermark / extract images / encrypt

pypdf / pdfimages

advanced.md

Fill PDF forms

pdf-lib / pypdf

FORMS.md

Advanced pypdfium2 / pdf-lib JS

—

REFERENCE.md

Quick Start

from pypdf import PdfReader reader = PdfReader("document.pdf") print(f"Pages: {len(reader.pages)}") text = "".join(page.extract_text() for page in reader.pages)

Workflow

Identify task — text extraction? table? creation? form? Pick row from matrix above.

Load reference — recipes.md covers 90% of tasks; advanced.md for OCR / encrypt; FORMS.md for forms.

Implement — copy-adapt recipe; verify output.

Validate — open in a viewer or grep extracted text.

Library Selection

Library

Use for

pypdf

Merge, split, metadata, encryption, rotation

pdfplumber

Text extraction with layout, tables

reportlab

Generate PDFs programmatically

pdf2image + pytesseract

OCR scanned documents

qpdf / pdftk (CLI)

Batch ops, no Python needed

PDF Processing

Essential PDF operations using Python libraries and CLI tools.

Decision Matrix

Task	Best Tool	Reference
Merge / split / metadata / rotate	pypdf	recipes.md
Extract text (layout preserved)	pdfplumber	recipes.md
Extract tables	pdfplumber	recipes.md
Create new PDF	reportlab	recipes.md
Batch CLI ops	qpdf / pdftk	recipes.md
OCR scanned PDFs	pytesseract + pdf2image	advanced.md
Add watermark / extract images / encrypt	pypdf / pdfimages	advanced.md
Fill PDF forms	pdf-lib / pypdf	FORMS.md
Advanced pypdfium2 / pdf-lib JS	—	REFERENCE.md

Quick Start

from pypdf import PdfReader
reader = PdfReader("document.pdf")
print(f"Pages: {len(reader.pages)}")
text = "".join(page.extract_text() for page in reader.pages)

Workflow

Identify task — text extraction? table? creation? form? Pick row from matrix above.
Load reference — recipes.md covers 90% of tasks; advanced.md for OCR / encrypt; FORMS.md for forms.
Implement — copy-adapt recipe; verify output.
Validate — open in a viewer or grep extracted text.

Library Selection

Library	Use for
pypdf	Merge, split, metadata, encryption, rotation
pdfplumber	Text extraction with layout, tables
reportlab	Generate PDFs programmatically
pdf2image + pytesseract	OCR scanned documents
qpdf / pdftk (CLI)	Batch ops, no Python needed

processing-pdfs

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

processing-pdfs

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

PDF Processing

Decision Matrix

Quick Start

Workflow

Library Selection

Similar Skills

Help us improve

PDF Processing

Decision Matrix

Quick Start

Workflow

Library Selection