Help us improve
Share bugs, ideas, or general feedback.
From Obsidian Vault
Extracts and ingests document content (PDF, Office) into Obsidian vault using MinerU. Supports direct extraction or Zotero-linked pipeline for literature notes.
npx claudepluginhub luffysolution-svg/obsidian-vault-mcpHow this skill is triggered — by the user, by Claude, or both
Slash command
/obsidian-vault:obsidian-mineruThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this skill when the request is about full-document parsing and source-note ingestion rather than simple literature metadata import.
Imports and synchronizes Zotero literature with an Obsidian vault. Searches Zotero library, inspects items/collections, imports notes with attachments and annotations, and batch-ingests collections.
Builds Wikipedia-style Obsidian vaults from academic PDFs, extracting concepts into linked notes with atomic sentences and citations. Expands existing networks with new papers.
Ingests content from Confluence, Google Docs, GitHub repos, remote URLs, or local files (DOCX, PDF, etc.) into Second Brain vault. Converts to Markdown via docling, runs graphify extraction, persists entities.
Share bugs, ideas, or general feedback.
Use this skill when the request is about full-document parsing and source-note ingestion rather than simple literature metadata import. 处理全文提取、PDF 解析和来源笔记导入时优先使用。
obsidian_pipeline_ingest_item(parse_with_mineru=true) or obsidian_pipeline_parse_with_mineru.obsidian_pipeline_rename_mineru_images to normalize images and regenerate images-index.md.obsidian_write_file.obsidian_pipeline_doctor before pipeline extraction, or mineru-open-api --version before direct CLI extraction.attachments/mineru/<zoteroKey>/paper.md.fig-01-process-flow-diagram.png.attachments/mineru/<zoteroKey>/images-index.md.obsidian_pipeline_ingest_item.parse_with_mineru=true, or later call obsidian_pipeline_parse_with_mineru(zotero_key=...).Reading Notes, and AI Summary. To generate an empty ## AI Summary after parsing, pass write_ai_summary=true to obsidian_pipeline_parse_with_mineru.mineru.net, mineru.oss-cn-shanghai.aliyuncs.com, cdn-mineru.openxlab.org.cn, and *.openxlab.org.cn.To parse a PDF directly with MinerU without going through the Zotero pipeline:
Bash → mineru-open-api --version (or check MINERU_CLI_COMMAND env var).mineru-open-api --files "C:\path\to\paper.pdf" --output-dir "C:\vault\mineru-output" --method auto
paper.md — extracted Markdownimages/ — extracted figuresobsidian_read_file on the generated .md to review.obsidian_write_file to copy the content into the vault's literature folder.To extract a PDF and immediately create a literature note:
obsidian_write_file, including frontmatter and links to extracted images.obsidian_pipeline_rename_mineru_images (MCP) to rename extracted images to semantic English slugs.To process all PDFs in a folder:
Get-ChildItem "C:\zotero-exports" -Filter "*.pdf" | ForEach-Object {
$out = "C:\vault\mineru-batch\$($_.BaseName)"
mineru-open-api --files $_.FullName --output-dir $out --method auto
}
Then ingest each output folder individually following the "Extract and Ingest" workflow above.
To extract text from a Zotero-managed PDF without full MinerU parsing:
obsidian_zotero_list_pdf_attachments (MCP).import pypdf, sys
reader = pypdf.PdfReader(sys.argv[1])
text = "\n".join(page.extract_text() or "" for page in reader.pages)
print(text[:5000])
Run: python extract_text.py "C:\Zotero\storage\KEY\paper.pdf"
Use when the user asks a specific question about a figure, chart, or table in a parsed paper.
attachments/mineru/<zoteroKey>/images-index.md with obsidian_read_file.
The index lists every figure with its semantic slug filename and the original caption context, e.g.:
- fig-01-process-flow-diagram.png (was: image-a.png)
Caption context: "Figure 1 Process flow diagram showing…"
obsidian_search using the slug filename (e.g. fig-01-process-flow-diagram) as query to locate the surrounding paragraph in paper.md. The search snippet will include the figure's Markdown image tag and adjacent text.paper.md with obsidian_read_file if the search snippet is insufficient.Typical budget: 2–3 tool calls (read index → search → answer, or read index → read section → answer).
obsidian_pipeline_parse_with_mineru(write_ai_summary=true). Must preserve literature-note YAML, ## Reading Notes, and non-empty ## AI Summary.obsidian_pipeline_rename_mineru_images, then report renamed files and cleanup candidates. Must treat MinerU assets as machine-generated and avoid editing the literature note except tool-managed links.images-index.md, locate the semantic slug/caption, then search/read nearby paper.md context. Must answer from extracted text rather than image bytes.