From claude-data-analyst
Scan a dataset for significant anomalies — outliers, distribution shifts, impossible values, and unusual groupings. Use when the user wants a first-pass integrity and anomaly sweep of a CSV/Parquet/Excel file before deeper analysis.
How this skill is triggered — by the user, by Claude, or both
Slash command
/claude-data-analyst:anomaly-analysisThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Identify significant anomalies in a dataset across three layers: value-level, distribution-level, and relational.
Identify significant anomalies in a dataset across three layers: value-level, distribution-level, and relational.
duckdb — percentile, z-score, and windowed queries.uv run --with pandas --with scikit-learn python -c '...' — IsolationForest and LOF for multivariate anomalies.csvstat (csvkit) — quick min/max/null sanity check.For each column:
For each numeric column:
For categorical columns:
Write <dataset>-anomalies.md:
Be specific — "17 rows have negative order_total" is useful; "there are some outliers" is not.
npx claudepluginhub danielrosehill/claude-code-plugins --plugin claude-data-analystSearches MemPalace before answering questions about past work, people, projects, or prior decisions. Returns verbatim stored content instead of guessing from model memory.
Scans the codebase for `ponytail:` comments and compiles a debt ledger of deliberate shortcuts and deferrals, flagging entries with no upgrade path.