From se7en-skills
Converts PDF, Word, PPTX, PPT, and TXT documents to Markdown, preserving titles, lists, tables, structure, and PPT slide sections. Adds frontmatter, annotates images, suggests output path for archiving or import.
npx claudepluginhub yiliqi78/se7en-skillsThis skill uses the workspace's default tool permissions.
| 格式 | 扩展名 | 处理方式 |
Converts PDF, DOCX, PPTX, XLSX, images (with OCR), audio (with transcription), HTML, CSV, JSON, XML, ZIP, YouTube URLs, EPubs to structured Markdown for LLM processing.
Converts PDF, DOCX, PPTX, XLSX, HTML, images to structured Markdown or JSON using Docling. Preserves tables, layout, hierarchy for RAG pipelines and knowledge bases.
Converts PDFs, DOCX, PPTX, XLSX, HTML, images, URLs, CSV, JSON, and more to markdown via tiered fallbacks: MCP markitdown, native tools, or user notice. For ingesting non-plain-text files.
Share bugs, ideas, or general feedback.
| 格式 | 扩展名 | 处理方式 |
|---|---|---|
| 提取文本,保留结构 | ||
| Word | .docx, .doc | 提取文本和格式 |
| PowerPoint | .pptx, .ppt | 按幻灯片提取,保留层级 |
| 纯文本 | .txt | 直接转换 |
读取用户上传的文档,识别:
将文档内容转为 Markdown 格式:
[图片: 描述]询问用户确认输出路径:
文档转写完成,建议保存为
[建议文件名].md。 保存到哪里?(给个路径,或者我存到当前目录)
将转写内容写入确认的位置。
# 一级标题(文档标题)
## 二级标题(章节)
### 三级标题(小节)
---
title: 文档标题
source: 原文件名.pdf
converted: 2026-02-03
tags: [标签1, 标签2]
---
---
## 幻灯片 1: [标题]
[内容]
---
## 幻灯片 2: [标题]
[内容]