From glean-pack
Optimizes Glean costs by filtering irrelevant content, pruning stale docs, enabling incremental indexing, and consolidating datasources. For reducing enterprise search indexing volume.
npx claudepluginhub jeremylongshore/claude-code-plugins-plus-skills --plugin glean-packThis skill is limited to using the following tools:
Glean pricing scales with indexed content volume and user count. Reduce costs by indexing only relevant content, pruning stale data, and using incremental indexing.
Optimizes Glean search relevance and indexing throughput using batch sizing, datasource configs, content quality, and incremental/bulk strategies.
Audits Algolia usage, optimizes costs via virtual replicas, batching/caching, and Analytics API monitoring. For high bills on records/searches.
Searches and fetches Glean developer docs via MCP tools for APIs, SDKs (Python/JS), MCP config, authentication, indexing, and integrations.
Share bugs, ideas, or general feedback.
Glean pricing scales with indexed content volume and user count. Reduce costs by indexing only relevant content, pruning stale data, and using incremental indexing.
| Strategy | Savings | Implementation |
|---|---|---|
| Filter irrelevant content | 20-40% | Skip drafts, templates, archived pages |
| Prune stale documents | 10-20% | Delete docs not updated in 12+ months |
| Use incremental indexing | Compute savings | Index only changed docs, not full corpus |
| Consolidate datasources | Admin savings | Fewer connectors to maintain |
| Set content size limits | Storage savings | Truncate body to ~50KB per doc |
function shouldIndex(doc: SourceDocument): boolean {
if (doc.status === 'draft' || doc.status === 'archived') return false;
if (doc.updatedAt < oneYearAgo) return false;
if (doc.title.startsWith('[Template]')) return false;
if (doc.content.length < 50) return false; // Skip near-empty pages
return true;
}
const filtered = allDocs.filter(shouldIndex);
// Typically reduces corpus by 30-50%