From llm-application-dev
Selects and optimizes embedding models for semantic search and RAG. Compares dimensions, token limits, and domain fit across Voyage, OpenAI, and open-source models.
How this skill is triggered — by the user, by Claude, or both
Slash command
/llm-application-dev:embedding-strategiesThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Guide to selecting and optimizing embedding models for vector search applications.
Guide to selecting and optimizing embedding models for vector search applications.
| Model | Dimensions | Max Tokens | Best For |
|---|---|---|---|
| voyage-3-large | 1024 | 32000 | Claude apps (Anthropic recommended) |
| voyage-3 | 1024 | 32000 | Claude apps, cost-effective |
| voyage-code-3 | 1024 | 32000 | Code search |
| voyage-finance-2 | 1024 | 32000 | Financial documents |
| voyage-law-2 | 1024 | 32000 | Legal documents |
| text-embedding-3-large | 3072 | 8191 | OpenAI apps, high accuracy |
| text-embedding-3-small | 1536 | 8191 | OpenAI apps, cost-effective |
| bge-large-en-v1.5 | 1024 | 512 | Open source, local deployment |
| all-MiniLM-L6-v2 | 384 | 256 | Fast, lightweight |
| multilingual-e5-large | 1024 | 512 | Multi-language |
Document → Chunking → Preprocessing → Embedding Model → Vector
↓
[Overlap, Size] [Clean, Normalize] [API/Local]
Full template library and detailed worked examples live in references/details.md. Read that file when you need the concrete templates.
npx claudepluginhub yo-steven/agents-exploration-20260523 --plugin llm-application-devSelects and optimizes embedding models for semantic search and RAG. Covers model comparison, chunking strategies, and pipeline design.
Guides selection and optimization of embedding models for vector search and RAG, including model comparisons, chunking strategies, dimension reduction, and Python templates for OpenAI and local models.