From clay-pack
Optimizes Clay table enrichment throughput by reordering columns, adding conditional run rules, and pre-processing input data to cut processing time and boost hit rates.
npx claudepluginhub jeremylongshore/claude-code-plugins-plus-skills --plugin clay-packThis skill is limited to using the following tools:
Optimize Clay table processing speed, enrichment hit rates, and credit efficiency. Clay processes enrichment columns sequentially per row, and each enrichment column makes external API calls. Performance tuning focuses on reducing wasted enrichments, ordering columns optimally, and managing table auto-run behavior.
Identifies Clay pitfalls like webhook 50K limits, waterfall credit waste, personal email filtering. Use for integration reviews, table audits, onboarding.
Guides Next.js Cache Components and Partial Prerendering (PPR): 'use cache' directives, cacheLife(), cacheTag(), revalidateTag() for caching, invalidation, static/dynamic optimization. Auto-activates on cacheComponents: true.
Guides building MCP servers enabling LLMs to interact with external services via tools. Covers best practices, TypeScript/Node (MCP SDK), Python (FastMCP).
Share bugs, ideas, or general feedback.
Optimize Clay table processing speed, enrichment hit rates, and credit efficiency. Clay processes enrichment columns sequentially per row, and each enrichment column makes external API calls. Performance tuning focuses on reducing wasted enrichments, ordering columns optimally, and managing table auto-run behavior.
Clay runs enrichment columns left-to-right. Place fast columns first:
| Column Type | Typical Speed | Position |
|---|---|---|
| Company lookup (Clearbit) | ~100ms | First (fastest) |
| Email finder (single provider) | ~200ms | Second |
| Email waterfall (multi-provider) | 1-10s | Middle |
| Claygent AI research | 5-30s | Later |
| HTTP API (outbound call) | Variable | Last |
| AI text generation | 2-5s | After Claygent |
Why order matters: Fast columns populate data that slow columns may need as input (e.g., company name feeds into Claygent research prompt).
Prevent enrichments from running on rows that won't yield results:
# In Clay column settings > "Only run if" condition:
# Email waterfall: only run if we have enough input data
ISNOTEMPTY(domain) AND ISNOTEMPTY(first_name) AND ISNOTEMPTY(last_name)
# Claygent: only run for high-value prospects
ICP Score >= 60 AND ISNOTEMPTY(Company Name)
# CRM push: only run for enriched, qualified leads
ICP Score >= 70 AND ISNOTEMPTY(Work Email)
This prevents:
// src/clay/pre-process.ts — clean data before sending to Clay
interface RawLead {
domain?: string;
email?: string;
first_name?: string;
last_name?: string;
}
function preProcessForClay(rows: RawLead[]): {
ready: RawLead[];
filtered: { row: RawLead; reason: string }[];
stats: { total: number; ready: number; filtered: number; deduped: number };
} {
const personalDomains = new Set([
'gmail.com', 'yahoo.com', 'hotmail.com', 'outlook.com',
'icloud.com', 'aol.com', 'protonmail.com', 'mail.com',
]);
const seen = new Set<string>();
const ready: RawLead[] = [];
const filtered: { row: RawLead; reason: string }[] = [];
let deduped = 0;
for (const row of rows) {
// Normalize domain
const domain = row.domain?.toLowerCase().trim().replace(/^(https?:\/\/)?(www\.)?/, '').replace(/\/.*$/, '');
// Filter invalid
if (!domain || !domain.includes('.')) {
filtered.push({ row, reason: 'invalid domain' });
continue;
}
if (personalDomains.has(domain)) {
filtered.push({ row, reason: 'personal email domain' });
continue;
}
if (!row.first_name?.trim() || !row.last_name?.trim()) {
filtered.push({ row, reason: 'missing name' });
continue;
}
// Deduplicate
const key = `${domain}:${row.first_name?.toLowerCase()}:${row.last_name?.toLowerCase()}`;
if (seen.has(key)) {
deduped++;
continue;
}
seen.add(key);
ready.push({ ...row, domain });
}
return {
ready,
filtered,
stats: {
total: rows.length,
ready: ready.length,
filtered: filtered.length,
deduped,
},
};
}
// Usage
const { ready, stats } = preProcessForClay(rawLeads);
console.log(`Pre-processing: ${stats.total} total -> ${stats.ready} ready (${stats.filtered} filtered, ${stats.deduped} deduped)`);
// Typical result: 30-50% of rows filtered, saving that many credits
Each additional waterfall provider adds 1-5 seconds per row and burns credits if the previous providers already found data:
# Before: 5-provider waterfall (slow, expensive)
# Each provider: ~2 credits, ~2s
# Worst case: 10 credits, 10s per row
waterfall_deep:
providers: [apollo, hunter, prospeo, dropcontact, findymail]
max_time_per_row: "~10s"
max_credits_per_row: 10
# After: 2-provider waterfall (fast, cheap)
# Covers 80%+ of findable emails with 2 providers
waterfall_optimized:
providers: [apollo, hunter]
max_time_per_row: "~4s"
max_credits_per_row: 4
coverage_loss: "~5-10%"
Rule of thumb: Apollo + one backup provider covers 80-85% of findable work emails. Adding more providers gives diminishing returns.
# Table Settings in Clay UI:
table_auto_update: ON # Parent switch: if OFF, nothing auto-runs
column_settings:
company_lookup:
auto_run: ON # Runs on every new row
email_waterfall:
auto_run: ON # Runs on every new row (if condition met)
condition: "ISNOTEMPTY(domain)"
claygent_research:
auto_run: OFF # Manual trigger only (expensive)
crm_push:
auto_run: ON # Auto-push qualified leads
condition: "ICP Score >= 70"
Clay's enrichment providers respond faster during off-peak hours (US nighttime):
// src/clay/scheduler.ts
function shouldProcessNow(rowCount: number): { proceed: boolean; reason: string } {
const hour = new Date().getUTCHours();
const isOffPeak = hour >= 2 && hour <= 8; // 2am-8am UTC
if (rowCount < 100) {
return { proceed: true, reason: 'Small batch — process anytime' };
}
if (rowCount >= 1000 && !isOffPeak) {
return {
proceed: false,
reason: `Large batch (${rowCount} rows). Schedule for 02:00-08:00 UTC for faster provider responses.`,
};
}
return { proceed: true, reason: isOffPeak ? 'Off-peak — optimal time' : 'Medium batch — acceptable' };
}
| Issue | Cause | Solution |
|---|---|---|
| Table stuck processing | Provider rate limit hit | Wait for reset or reduce concurrency |
| Slow enrichment (>10s/row) | Deep waterfall (5+ providers) | Reduce to 2-3 providers |
| Low hit rate (<40%) | Bad input data | Pre-validate and filter before import |
| Credits burning with no results | No conditional run rules | Add "Only run if" conditions to columns |
| Enrichment re-runs on edit | Table auto-update triggered | Turn off auto-update during bulk edits |
For cost optimization, see clay-cost-tuning.