Stats

Actions

Tags

Help us improve

Share bugs, ideas, or general feedback.

document-conversion | interpretive-orchestration-cowork

Skill

document-conversion

From interpretive-orchestration-cowork

This skill should be used when users need to convert PDFs (especially with tables or figures), mentions 'convert', 'PDF', 'document processing', has complex academic papers to import, or asks about document conversion options.

$

npx claudepluginhub linxule/interpretive-orchestration --plugin interpretive-orchestration-cowork

Popularity

Parent stars

6

Parent forks

1

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/interpretive-orchestration-cowork:skills/document-conversion

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Robust PDF and document conversion with intelligent tool selection. Chooses the best available conversion method based on document complexity and MCP availability.

SKILL.md

156 lines · ~1.1k tokens

Similar Skills

mineru

52

Parses PDF, Office, and image files into structured Markdown using the MinerU API. Supports OCR, formula/table recognition, batch processing, and multi-format export (DOCX/HTML/LaTeX).

20 files

pdf-conversion-router

40.4k

Routes PDF conversions through analysis to select the best extraction strategy and tools based on document type and output format.

antigravity-awesome-skills

docling

59

Parses PDFs, DOCX, PPTX, HTML, images (20+ formats) to Markdown/HTML/JSON/text with layout/tables/OCR. Chunks for RAG pipelines; batch converts via DocumentConverter.

4 files

Stats

LanguageJavaScript

Parent stars6

Parent forks1

MaintenanceExcellent

Last CommitFeb 6, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

document-conversion

Robust PDF and document conversion with intelligent tool selection. Chooses the best available conversion method based on document complexity and MCP availability.

When to Use

Use this skill when:

User needs to convert PDFs, especially with tables or figures
User mentions "convert", "PDF", "document processing"
User has complex academic papers to import
User asks about document conversion options

Conversion Options

Feature	MinerU (Optional)	Manual Conversion
API Key Required	Yes (MINERU_API_KEY)	No
PDF Accuracy	90%+ (VLM mode)	Varies
Table Extraction	Excellent	Manual cleanup
Figure Handling	Extracts + describes	Manual description
Formula Recognition	Yes	Limited
Multi-column	Excellent	Manual formatting
Cost	Pay per page	Free

When to Use Which

Use MinerU When:

PDF has complex tables with merged cells
Document has multi-column layouts
Figures/charts need extraction
Mathematical formulas present
Academic paper with structured formatting
Accuracy is critical

Use Manual Conversion When:

Simple text-based documents
MinerU API key not available
Cost is a concern
Document is straightforward

Tool Selection Logic

Is the document a PDF with tables/figures?
├── Yes, complex tables
│   └── MinerU available?
│       ├── Yes → Use MinerU (vlm mode)
│       └── No → Manual conversion + review
├── Yes, simple formatting
│   └── Manual conversion or external tool
└── No, other format
    └── Is it audio?
        ├── Yes → External transcription service
        └── No → Manual conversion

Usage Examples

MinerU (Complex PDF)

Use mineru_parse to convert this academic paper:
- URL: https://example.com/paper.pdf
- Model: vlm (for 90% accuracy)
- Enable: formula, table recognition

Manual Conversion (Simple Document)

For simple PDFs without MinerU:
1. Use Adobe Acrobat to export to Word/text
2. Or open in Google Docs for auto-OCR
3. Review and clean up formatting

Batch Processing

For multiple PDFs:
1. Check which have complex tables (use MinerU if available)
2. Process simple ones with manual conversion
3. Queue complex ones for MinerU batch if API key available

MinerU Specific Features

VLM vs Pipeline Mode

VLM Mode: Uses vision-language model, 90%+ accuracy, slower
Pipeline Mode: Traditional parsing, faster, lower accuracy

Page Selection

Parse only specific pages:
mineru_parse({
  url: "https://...",
  pages: "1-10,15,20-25"
})

Batch Processing

Process multiple documents:
mineru_batch({
  urls: ["url1", "url2", "url3"],
  model: "vlm"
})

Output Quality Checklist

After conversion, verify:

Text is accurately extracted
Tables maintain structure
Headers/sections are correct
Figures have descriptions (if MinerU)
Formulas are readable (if MinerU)
No garbled text from OCR errors

Integration with Research Workflow

For Literature (Stream A)

Identify papers to convert
Complex papers → MinerU
Simple papers → Manual conversion
Store in stream-a-theoretical/papers/

For Data Documents (Stream B)

Interview transcripts → External service (Otter.ai, Rev.com)
PDF field notes → MinerU or manual conversion
Store in appropriate stage folder

Fallback Options

If MinerU unavailable:

Adobe Acrobat - Export to Word
Google Docs - Open PDF for auto-OCR
Tesseract OCR - Command-line tool
Manual transcription - Last resort

Audio Transcription

For audio files, use external services:

Otter.ai - Good transcription service
Rev.com - Professional transcription
YouTube - Upload as unlisted video for auto-captions

Related

MCPs: MinerU (optional)
Skills: interview-ingest for audio, literature-sweep for papers
Configuration: .mcp.json defines MCP availability