/init

🚨 EXECUTION NOTICE FOR CLAUDE

When you invoke this command via SlashCommand, the system returns THESE INSTRUCTIONS below.

YOU are the executor. This is NOT an autonomous subprocess.

✅ The phases below are YOUR execution checklist
✅ YOU must run each phase immediately using tools (Bash, Read, Write, Edit, TodoWrite)
✅ Complete ALL phases before considering this command done
❌ DON't wait for "the command to complete" - YOU complete it by executing the phases
❌ DON't treat this as status output - it IS your instruction set

Immediately after SlashCommand returns, start executing Phase 0, then Phase 1, etc.

See @CLAUDE.md section "SlashCommand Execution - YOU Are The Executor" for detailed explanation.

Available Skills

This commands has access to the following skills from the rag-pipeline plugin:

chunking-strategies: Document chunking implementations and benchmarking tools for RAG pipelines including fixed-size, semantic, recursive, and sentence-based strategies. Use when implementing document processing, optimizing chunk sizes, comparing chunking approaches, benchmarking retrieval performance, or when user mentions chunking, text splitting, document segmentation, RAG optimization, or chunk evaluation.
document-parsers: Multi-format document parsing tools for PDF, DOCX, HTML, and Markdown with support for LlamaParse, Unstructured.io, PyPDF2, PDFPlumber, and python-docx. Use when parsing documents, extracting text from PDFs, processing Word documents, converting HTML to text, extracting tables from documents, building RAG pipelines, chunking documents, or when user mentions document parsing, PDF extraction, DOCX processing, table extraction, OCR, LlamaParse, Unstructured.io, or document ingestion.
embedding-models: Embedding model configurations and cost calculators
langchain-patterns: LangChain implementation patterns with templates, scripts, and examples for RAG pipelines
llamaindex-patterns: LlamaIndex implementation patterns with templates, scripts, and examples for building RAG applications. Use when implementing LlamaIndex, building RAG pipelines, creating vector indices, setting up query engines, implementing chat engines, integrating LlamaCloud, or when user mentions LlamaIndex, RAG, VectorStoreIndex, document indexing, semantic search, or question answering systems.
retrieval-patterns: Search and retrieval strategies including semantic, hybrid, and reranking for RAG systems. Use when implementing retrieval mechanisms, optimizing search performance, comparing retrieval approaches, or when user mentions semantic search, hybrid search, reranking, BM25, or retrieval optimization.
vector-database-configs: Vector database configuration and setup for pgvector, Chroma, Pinecone, Weaviate, Qdrant, and FAISS with comparison guide and migration helpers
web-scraping-tools: Web scraping templates, scripts, and patterns for documentation and content collection using Playwright, BeautifulSoup, and Scrapy. Includes rate limiting, error handling, and extraction patterns. Use when scraping documentation, collecting web content, extracting structured data, building RAG knowledge bases, harvesting articles, crawling websites, or when user mentions web scraping, documentation collection, content extraction, Playwright scraping, BeautifulSoup parsing, or Scrapy spiders.

To use a skill:

!{skill skill-name}

Use skills when you need:

Domain-specific templates and examples
Validation scripts and automation
Best practices and patterns
Configuration generators

Skills provide pre-built resources to accelerate your work.

Security Requirements

CRITICAL: All generated files must follow security rules:

@docs/security/SECURITY-RULES.md

Key requirements:

Never hardcode API keys or secrets
Use placeholders: your_service_key_here
Protect .env files with .gitignore
Create .env.example with placeholders only
Document key acquisition for users

Arguments: $ARGUMENTS

Goal: Initialize complete RAG project with user-selected framework, directory structure, dependencies, and starter code.

Core Principles:

Ask user for framework preference early
Create clean project structure
Install dependencies correctly
Provide documentation links and next steps

Phase 1: Discovery and Framework Selection Goal: Determine project path and gather user preferences

Actions:

Parse $ARGUMENTS for project path (default to current directory if empty)
Check if path exists: !{bash test -d "$ARGUMENTS" && echo "exists" || echo "not-found"}
Create directory if needed: !{bash mkdir -p "$ARGUMENTS"}
Check existing setup: !{bash ls "$ARGUMENTS"/requirements.txt "$ARGUMENTS"/venv 2>/dev/null || echo "new-project"}
Use AskUserQuestion: "Which RAG framework? (1) LlamaIndex, (2) LangChain, (3) Both"
If Both selected, ask: "Vector database? (Chroma/Pinecone/FAISS)"

Phase 2: Project Setup Goal: Create directory structure and install dependencies

Actions:

Create directories: !{bash mkdir -p "$ARGUMENTS"/{data/{raw,processed},scripts,config,notebooks,tests,src}}
Create venv: !{bash cd "$ARGUMENTS" && python3 -m venv venv}
Install based on selection:
- LlamaIndex: !{bash cd "$ARGUMENTS" && source venv/bin/activate && pip install llama-index python-dotenv}
- LangChain: !{bash cd "$ARGUMENTS" && source venv/bin/activate && pip install langchain langchain-community python-dotenv}
- Both: !{bash cd "$ARGUMENTS" && source venv/bin/activate && pip install llama-index langchain langchain-community python-dotenv}
Install vector DB if selected: !{bash cd "$ARGUMENTS" && source venv/bin/activate && pip install chromadb}
Generate requirements: !{bash cd "$ARGUMENTS" && source venv/bin/activate && pip freeze > requirements.txt}

Phase 3: Code Generation Goal: Generate framework-specific starter code and documentation

Actions:

Task(description="Generate RAG starter code", subagent_type="rag-pipeline:rag-architect", prompt="You are the rag-architect agent. Generate starter code for RAG project at $ARGUMENTS.

Framework: [User's selection from Phase 1] Vector DB: [User's selection from Phase 1]

Create these files:

config/.env.example - API key placeholders (OPENAI_API_KEY, etc), vector DB config
src/rag_pipeline.py - Main RAG pipeline with document loading, chunking, embeddings, vector store, query functions
scripts/index_documents.py - Index documents from data/raw/
scripts/query_example.py - Example query script
notebooks/rag_demo.ipynb - Step-by-step RAG demo notebook
README.md with:
- Setup instructions
- Documentation links:
  - LlamaIndex: https://developers.llamaindex.ai/python/framework/
  - LlamaIndex Getting Started: https://developers.llamaindex.ai/python/framework/getting_started/reading
  - LangChain: https://python.langchain.com/docs/
  - LangChain RAG Tutorial: https://python.langchain.com/docs/tutorials/rag/
- Project structure explanation
- Next steps guide
.gitignore - Exclude venv/, .env, pycache/, *.pyc, .ipynb_checkpoints/, data/

Use framework best practices, type hints, comprehensive comments, error handling.

Expected output: All files created with working code.")

Phase 4: Git Initialization and Summary Goal: Initialize git repository and display summary

Actions:

Check git status: !{bash cd "$ARGUMENTS" && git status 2>&1 | grep -q "not a git repository" && echo "no-git" || echo "exists"}
If no git, initialize: !{bash cd "$ARGUMENTS" && git init}
Stage files: !{bash cd "$ARGUMENTS" && git add .}
Create commit: !{bash cd "$ARGUMENTS" && git commit -m "Initial RAG project setup

Framework: [selected framework] Vector DB: [selected vector DB]

Generated with Claude Code Co-Authored-By: Claude noreply@anthropic.com"}

Display summary:
- Project location: $ARGUMENTS
- Framework and vector DB selections
- Files created
- Next steps: cd $ARGUMENTS, source venv/bin/activate, configure .env, review README.md
- Documentation links for reference

Available Skills

Security Requirements

Other plugins with /init