Skill

rag

Guides building RAG systems for Q&A, chatbots, knowledge bases, covering embedding models, chunking strategies, vector stores, ingestion pipelines, retrieval optimization.

Python

OpenAI

PostgreSQL

ai-ml

npx claudepluginhub arbazkhan971/godmode

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/godmode:rag

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

- `/godmode:rag`, "build RAG system", "knowledge base"

SKILL.md

173 lines · ~1.2k tokens

Similar Skills

rag-implementation

Build RAG systems for LLM apps using vector databases, embeddings, and retrieval strategies. Use for document Q&A, grounded chatbots, and semantic search.

llm-application-dev

rag-implementation

40.2k

Guides RAG implementation from requirements to LLM integration, covering embedding selection, vector DB setup, chunking strategies, and retrieval optimization.

antigravity-awesome-skills

RAG Implementation

faos-data-ai-architect

Stats

LanguageShell

Stars18

Forks8

MaintenanceExcellent

Last CommitApr 25, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

Use case: <questions the system must answer> Data sources: <docs, wiki, DB, PDFs, code> Corpus: <N documents, N tokens, N MB> Update frequency: static|daily|real-time Query patterns: Factual lookup (single-hop retrieval) Analytical (multi-document retrieval) Conversational (multi-turn Q&A) Structured (metadata filtering + retrieval)

| Model | Dims | MTEB | Cost | | text-embedding-3-large | 3072 | 64.6 | $0.13/1M | | text-embedding-3-small | 1536 | 62.3 | $0.02/1M | | Cohere embed-v3 | 1024 | 64.5 | $0.10/1M | | Voyage voyage-3 | 1024 | 67.1 | $0.06/1M | | BGE-large-en-v1.5 | 1024 | 64.2 | Free* |

Hybrid search (RECOMMENDED for production): Dense (vector): semantic similarity Sparse (BM25): keyword/exact matching Fusion: Reciprocal Rank Fusion (RRF) Top-K: 5-20 chunks (start with 10) Reranker: cross-encoder on top-20 results (highest-impact single optimization)

Context window budget: System prompt: <N tokens> Retrieved context: <N tokens> Conversation history: <N tokens> Output reservation: <N tokens> Total < model context limit Assembly: rank by relevance, include until budget. Format with source attribution.

Retrieval metrics: Hit rate @ K: % queries with answer in top-K MRR: average 1/rank of first correct result Generation metrics: Faithfulness: grounded in retrieved context Hallucination rate: answers without evidence Targets: Recall@10 >= 80%, MRR >= 0.7 Faithfulness >= 90%, Hallucination < 5%

Failure	Action
Low recall < 70%	Increase overlap, add BM25, reranker
High hallucination	Add "only use context", reduce chunks
High latency	Cache frequent queries, reduce top-K

Failure

Action

Low recall < 70%

Increase overlap, add BM25, reranker

High hallucination

Add "only use context", reduce chunks

High latency

Cache frequent queries, reduce top-K

Failure	Action
Low recall < 70%	Increase overlap, add BM25, reranker
High hallucination	Add "only use context", reduce chunks
High latency	Cache frequent queries, reduce top-K

Failure

Action

Low recall < 70%

Increase overlap, add BM25, reranker

High hallucination

Add "only use context", reduce chunks

High latency

Cache frequent queries, reduce top-K

rag

Popularity

Invocation

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

rag

Popularity

Invocation

Context Preview

SKILL.md

Activate When

Workflow

1. Requirements

2. Embedding Model Selection

3. Chunking Strategy

4. Vector Store

5. Ingestion Pipeline

6. Retrieval Optimization

7. Context Assembly

8. Evaluation

Hard Rules

TSV Logging

Keep/Discard

Stop Conditions

Autonomous Operation

Error Recovery

Similar Skills

Help us improve

Activate When

Workflow

1. Requirements

2. Embedding Model Selection

3. Chunking Strategy

4. Vector Store

5. Ingestion Pipeline

6. Retrieval Optimization

7. Context Assembly

8. Evaluation

Hard Rules

TSV Logging

Keep/Discard

Stop Conditions

Autonomous Operation

Error Recovery