By Ian-q
Maps, connects, and remembers your documentation. Structural scanning, LLM semantic audit, Qdrant embedding, and semantic search.
Bootstrap a Carta knowledge-graph environment in the current repository.
Audit repository documentation for structural and semantic issues, assign stable AUDIT-NNN IDs, and update docs/AUDIT_REPORT.md and docs/BACKLOG/TRIAGE.md.
Embed documents into the Carta knowledge graph and enrich newly embedded files with spec summaries.
Search the Carta knowledge graph with a natural language query and present results with source citations.
This plugin requires configuration values that are prompted when the plugin is enabled. Sensitive values are stored in your system keychain.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
ollama_urlOllama URL for proactive recall (default: http://localhost:11434 — leave blank to disable)
${user_config.ollama_url}qdrant_urlQdrant URL (default: http://localhost:6333 — leave blank to use default)
${user_config.qdrant_url}
Maps, connects, and remembers your documentation.
Carta is a Claude Code plugin that keeps your project docs honest — auditing for contradictions, embedding reference material into a searchable knowledge base, and surfacing the right context exactly when you need it.
Fast-moving projects accumulate documentation debt quietly. You write a spec. An AI agent writes a dozen more files based on it. The spec changes. Three weeks later, four different documents describe the same API endpoint four different ways, and nobody — human or AI — knows which one is right.
This problem gets worse the more you lean on AI agents to help you work. Agents are only as good as the context they can see, and when your docs/ folder is a fog of contradictions and stale frontmatter, you're giving your agent a map that leads off a cliff.
Carta started as a happy accident. While working through a project with a lot of PDFs, datasheets, and fast-changing markdown — the kind of repo where the hardware changes on Thursday and the docs are still describing Wednesday — we built a small structural scanner to flag stale and broken cross-references. Then we added a semantic pass. Then a vector store. Then a /doc-search skill so Claude could query the embedded knowledge directly.
At some point we looked at what we had and realized: this is a thing. It works. It's small, it runs locally, it requires no new services beyond what an LLM-augmented developer already has running. So we generalized it.
Three things, tightly integrated:
A two-pass system that runs on a schedule or on demand:
related: links, homeless markdown files, and orphaned content. Runs fast, runs often.docs/AUDIT_REPORT.md with stable AUDIT-NNN issue IDs that persist across runs.Ingests your reference material — PDFs, datasheets, manuals, audio transcripts — into a local Qdrant vector store via Ollama. Generates spec_summary blocks for dense documents so the audit agent can cross-reference them without re-reading 200 pages.
Natural language recall over everything that's been embedded. Ask Claude what the docs say about rate limiting, authentication flows, power supply constraints, sample naming conventions — whatever's in your knowledge base — and get cited answers back.
Search is hybrid (dense + BM25 with Reciprocal Rank Fusion) by default, with an optional
ColPali visual layer for image-heavy PDF pages. Measured on a real technical-docs corpus
(~160 markdown docs + 214 datasheet PDFs, local models — nomic-embed-text + Qdrant/bm25):
Text retrieval — markdown eval, 20 queries:
| Pipeline | recall@5 | MRR |
|---|---|---|
| Dense only (cosine) | 0.550 | 0.402 |
| Hybrid (BM25 + dense, RRF) | 0.700 | 0.546 |
On an expanded 62-query set over the same corpus (adds datasheet, supplier, and patent
reference docs): hybrid alone scores 0.790 / 0.641, and the LLM reranker (qwen3.5:9b,
candidate pool 40) lifts it to 0.871 / 0.778 — with rerank: applied on 61/62 queries
confirming the reranker actually ran on every scored query but one.
Visual retrieval — datasheet eval, 14 queries:
| Pipeline | recall@5 | MRR |
|---|---|---|
| Text / OCR only | 0.500 | 0.429 |
| + ColPali visual (two-pass) | 0.857 | 0.589 |
The datasheet set includes 6 "visual-only" queries whose answer lives on a diagram, package drawing, or derating curve that text search structurally can't reach — ColPali lifts those from 0/6 to 5/6. Text and visual hits are fused by rank (RRF), so the visual layer never crowds out text results.
These are one project's eval sets, not a public benchmark — they show the delta each layer adds on real technical docs, not an absolute SOTA claim.
When search.rerank.enabled is true, carta eval also prints rerank: applied on N/M queries
— and fails (exit 1) if the reranker ran on zero queries, so a silent fail-open (wrong model
name, Ollama down, reasoning-model misconfig) can never masquerade as a reranked result.
npx claudepluginhub ian-q/carta --plugin carta-ccMake your AI agent code with your project's architecture, rules, and decisions.
Claude + Obsidian knowledge companion. Sets up a persistent, compounding wiki vault (Karpathy's LLM Wiki pattern). v1.7 "Compound Vault" + v1.8 methodology modes close 5 of 5 priority gaps from the May 2026 compass artifact. Ships: substrate alignment with kepano/obsidian-skills, default Obsidian CLI transport, hybrid retrieval (contextual prefix + BM25 + cosine rerank per Anthropic's Sept 2024 research), per-file advisory locking for multi-writer safety, pre-commit verifier agent, AND methodology modes (LYT / PARA / Zettelkasten / Generic) for first-class organizational support no other Claude+Obsidian competitor offers. v1.7.x audit closure: every BLOCKER + HIGH + MEDIUM + LOW finding from the v1.7.0 audit is CLOSED or DEFERRED-with-rationale. Optional DragonScale Memory extension (log folds, deterministic addresses, semantic tiling lint, boundary-first autoresearch).
Complete AI coding workflow system. Self-correcting memory + persistent FTS5-indexed research wikis + auto-research loop + multi-LLM council on a single SQLite store. 33 skills, 8 agents, 22 commands, 37 hook scripts across 24 events. Cross-agent via SkillKit.
Access official Microsoft documentation, API references, and code samples for Azure, .NET, Windows, and more.
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
Connect to Atlassian products including Jira and Confluence. Search and create issues, access documentation, manage sprints, and integrate your development workflow with Atlassian's collaboration tools.