From claude-mods
Converts local documents (PDF, DOCX, XLSX, PPTX, images via OCR, audio) to Markdown using Microsoft's markitdown CLI over Bash. Triggers on file parsing requests.
npx claudepluginhub 0xdarkmatter/claude-modsThis skill is limited to using the following tools:
Convert local documents to clean Markdown. One tool for PDF, Word, Excel, PowerPoint, images, and more.
Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.
Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.
Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.
Convert local documents to clean Markdown. One tool for PDF, Word, Excel, PowerPoint, images, and more.
| Use Case | Recommendation |
|---|---|
| Local files (PDF, Word, Excel) | ✅ Use markitdown - unique capability |
| Web pages | ❌ Use Jina (r.jina.ai/) - 5x faster |
| Blocked/anti-bot sites | ❌ Use Firecrawl |
| OCR on images | ✅ Use markitdown |
| Audio transcription | ✅ Use markitdown |
# Local files (primary use case)
markitdown document.pdf
markitdown report.docx
markitdown data.xlsx
markitdown slides.pptx
markitdown screenshot.png # OCR
# URLs (works, but Jina is faster)
markitdown https://example.com
# Save output
markitdown document.pdf > document.md
| Format | Extensions | Notes |
|---|---|---|
.pdf | Text extraction, tables | |
| Word | .docx | Formatting preserved |
| Excel | .xlsx | Tables to markdown |
| PowerPoint | .pptx | Slides as sections |
| Images | .jpg, .png | OCR text extraction |
| HTML | .html | Clean conversion |
| Audio | .mp3, .wav | Speech-to-text |
| Text | .txt, .csv, .json, .xml | Pass-through/structure |
| URLs | https://... | Works but slower than Jina |
| Tool | Avg Speed | Success Rate |
|---|---|---|
| Jina | 0.5s | 10/10 |
| markitdown | 2.5s | 9/10 |
| Firecrawl | 4.5s | 10/10 |
Verdict: For URLs, use Jina. For local files, markitdown is the only option.
# PDF to markdown (primary use case)
markitdown report.pdf > report.md
# Excel spreadsheet
markitdown financials.xlsx
# Image with text (OCR)
markitdown screenshot.png
# PowerPoint deck
markitdown presentation.pptx > slides.md
# Audio transcription
markitdown meeting.mp3 > transcript.md
| Task | markitdown | Alternative |
|---|---|---|
| PDF text | markitdown file.pdf | PyMuPDF, pdfplumber |
| Word docs | markitdown file.docx | python-docx |
| Excel | markitdown file.xlsx | pandas, openpyxl |
| OCR | markitdown image.png | Tesseract |
| Web pages | Use Jina instead | r.jina.ai/URL (5x faster) |
markitdown's advantage: One CLI for all local document formats. No code needed.