By ItMeDiaTech
Build and query a local RAG knowledge base from project docs using semantic vector search and hybrid retrieval, delivering AI answers with citations in Claude Code sessions. Auto-index files, enhance prompts automatically via hooks, configure embedded multi-agent framework, enable/disable RAG, all running locally with Python scripts and no API keys.
Update config to disable automatic RAG enhancement.
Update config to enable automatic RAG enhancement.
Configure embedded Multi-Agent Framework (MAF) features for RAG-CLI.
Analyze current project and index relevant documentation.
Execute the retrieval script and return results only.
Matches all tools
Hooks run on every tool call, not just specific ones
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Just a heads-up, turns out Anthropic / Claude does not like it when you avoid token usage cost by routing traffic to the CLI tool from them. This shadow banned me from their platform when I was on their $200 a month plan. They refuse to respond after months of submitting an appeal, etc, and no project I worked on violated any aspect of their Terms. After research, I see many people have been banned on similar cases. You have been warned.
Local Retrieval-Augmented Generation system for Claude Code with Multi-Agent Framework integration.
A production-ready Claude Code plugin that combines ChromaDB vector embeddings with intelligent document retrieval and Multi-Agent Framework (MAF) orchestration for context-aware development assistance.
Current Version: 2.0.0 Status: Production Ready (with known limitations documented in KNOWN_ISSUES.md)
Key Features:
Alternative Project: For a standalone CLI experience with extended features, see dt-cli. Both projects are actively maintained and can be used together.
RAG-CLI is a production-ready local Retrieval-Augmented Generation system that enhances your development workflow by providing instant access to your project documentation, codebase context, and external resources. It works seamlessly with Claude Code as a native plugin, eliminating the need for external API calls while processing documents locally with enterprise-grade security and performance.
RAG-CLI runs efficiently on:
The easiest way to get RAG-CLI as a Claude Code plugin:
# In Claude Code terminal
/plugin marketplace add https://github.com/ItMeDiaTech/rag-cli.git
/plugin install rag-cli
Then restart Claude Code. The plugin will activate automatically with zero configuration.
Benefits:
/plugin update rag-cliFor development, testing, or custom configuration:
# Clone the repository
git clone https://github.com/ItMeDiaTech/rag-cli.git
cd rag-cli
# Create virtual environment (recommended)
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
# Install dependencies
pip install -r requirements.txt
# Verify installation
python -c "from rag_cli.core import embeddings; print('Installation successful!')"
For contributing to RAG-CLI:
# Clone and install in editable mode
git clone https://github.com/ItMeDiaTech/rag-cli.git
cd rag-cli
# Create virtual environment
python -m venv venv
source venv/bin/activate
# Install with development dependencies
pip install -e ".[dev]"
Local RAG plugin with Multi-Agent Framework orchestration for context-aware development assistance
npx claudepluginhub itmediatech/rag-cli --plugin rag-cliA powerful code indexing tool with multi-platform support
Document search with hybrid BM25/semantic retrieval, GraphRAG knowledge graphs, and pluggable providers for Claude Code. Index documentation and code, then search using keyword matching, semantic similarity, graph relationships, or comprehensive multi-mode fusion.
Commands for loading context and priming Claude for specific tasks
Build Retrieval-Augmented Generation pipelines
Tooling and developer productivity experts - CLI tools, documentation, README generation, and DX optimization
LLM application development with RAG, embeddings, LangChain, and prompt engineering