By the-focus-ai
Hybrid document search using embeddings and grep for semantic + literal matching with metadata extraction and filtering
npx claudepluginhub the-focus-ai/claude-marketplace --plugin embeddings-search-skillA Claude Code plugin that provides hybrid document search combining grep (literal matching) with semantic embeddings for comprehensive search across project documents.
Part of The-Focus-AI/claude-marketplace.
.embeddings/index.json file for fast loadingInstall via Claude Code plugin marketplace or add directly:
claude --plugin-dir /path/to/embeddings-search-skill
The skill activates automatically when searching for documents by meaning:
# Index a directory (default: sliding window strategy)
npx tsx src/cli.ts index .
# Index with specific strategy
npx tsx src/cli.ts index . --strategy paragraph
npx tsx src/cli.ts index . --strategy sentence
npx tsx src/cli.ts index . --strategy sliding --window 500 --overlap 150
# Search documents
npx tsx src/cli.ts search "budget concerns"
# Search with filters
npx tsx src/cli.ts search "love" --collection "jane austin"
npx tsx src/cli.ts search "meeting notes" --type report
npx tsx src/cli.ts search "quarterly review" --after 2024-01-01 --before 2024-04-01
# View discovered taxonomy
npx tsx src/cli.ts taxonomy .
# List files matching filters
npx tsx src/cli.ts files --collection "jane austin" --type novel
# View index statistics
npx tsx src/cli.ts stats .
The indexer automatically extracts metadata from:
--collection <name> - Filter by collection (derived from directory names)--type <type> - Filter by document type (novel, letter, report, essay, documentation, notes)--author <name> - Filter by author--topic <topic> - Filter by topic (love, marriage, society, money, etc.)--path <pattern> - Filter by file path pattern--after <date> - Only documents modified after date--before <date> - Only documents modified before dateDirectly indexed:
.txt - Plain text.md - MarkdownRequires text extraction first:
.pdf - PDF documents.doc, .docx - Word documents.ppt, .pptx - PowerPoint presentationsFor binary files, create a .txt sidecar with extracted text (e.g., report.pdf.txt).
Index is stored in .embeddings/index.json containing:
@xenova/transformers - Local embedding modelfast-glob - File discoverytsx - TypeScript executionMIT
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Manus-style persistent markdown files for planning, progress tracking, and knowledge storage. Works with Claude Code, Kiro, Clawd CLI, Gemini CLI, Cursor, Continue, Hermes, and 17+ AI coding assistants. Now with Arabic, German, Spanish, and Chinese (Simplified & Traditional) support.
Payload Development plugin - covers collections, fields, hooks, access control, plugins, and database adapters.
Write SQL, explore datasets, and generate insights faster. Build visualizations and dashboards, and turn raw data into clear stories for stakeholders.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.