From nx
Use when PDF documents need to be indexed into nx store for semantic search
npx claudepluginhub hellblazer/nexus --plugin nxThis skill uses the workspace's default tool permissions.
- When PDFs need to be indexed for semantic search
Indexes PDF documents with LightRAG, extracts text via PyMuPDF, builds embeddings and knowledge graphs, enables hybrid semantic searches with citations for document Q&A.
Ingests PDF datasheets or reference manuals into the embedded docs search index via ingest_docs tool. Reports chunks ingested and tables found.
Indexes registered code libraries from libraries.yaml by chunking files, embedding with FastEmbed, and storing in local Qdrant vector DB for semantic search.
Share bugs, ideas, or general feedback.
For a single PDF, run directly without spawning an agent:
# Index a PDF into T3 (docs__ collection)
nx index pdf /path/to/file.pdf --corpus {corpus-name} --monitor
# Index into knowledge__ collection (for reference material)
nx index pdf /path/to/file.pdf --collection knowledge__{name} --monitor
# Dry-run preview (extract and embed locally without storing)
nx index pdf /path/to/file.pdf --corpus {corpus-name} --dry-run
# Force re-index (bypass staleness check)
nx index pdf /path/to/file.pdf --corpus {corpus-name} --force --monitor
# Verify indexing — use search tool: query="representative query", corpus="docs__{corpus-name}", limit=3
Corpus naming: Use author-year-short-title pattern. The --corpus flag auto-prepends docs__ — do NOT include the prefix.
Delegates to the pdf-chromadb-processor agent (haiku) for:
## Relay: pdf-chromadb-processor
**Task**: [what needs to be done]
**Bead**: [ID] or 'none'
### Input Artifacts
- Files: [PDF paths or URLs]
### Deliverable
Indexed PDF content in T3 via `nx index pdf`, with chunk counts and searchability verified
### Quality Criteria
- [ ] All PDFs indexed via `nx index pdf`
- [ ] Content searchable via search tool
- [ ] Processing report with chunk counts
For full relay structure and optional fields, see RELAY_TEMPLATE.md.
nx index pdf (not manual store_put tool)author-year-short-title conventionnx index pdf native pipeline)Outputs generated by the pdf-chromadb-processor agent:
nx index pdf pipeline (Docling extraction, context-safe chunking, Voyage embeddings, atomic storage)Session Scratch (T1): Agent uses scratch tool for ephemeral working notes during the session. Flagged items auto-promote to T2 at session end.