From claude-data-analyst
Scan a dataset for significant anomalies — outliers, distribution shifts, impossible values, and unusual groupings. Use when the user wants a first-pass integrity and anomaly sweep of a CSV/Parquet/Excel file before deeper analysis.
npx claudepluginhub danielrosehill/claude-code-plugins --plugin claude-data-analystThis skill uses the workspace's default tool permissions.
Identify significant anomalies in a dataset across three layers: value-level, distribution-level, and relational.
Conducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.
Share bugs, ideas, or general feedback.
Identify significant anomalies in a dataset across three layers: value-level, distribution-level, and relational.
duckdb — percentile, z-score, and windowed queries.uv run --with pandas --with scikit-learn python -c '...' — IsolationForest and LOF for multivariate anomalies.csvstat (csvkit) — quick min/max/null sanity check.For each column:
For each numeric column:
For categorical columns:
Write <dataset>-anomalies.md:
Be specific — "17 rows have negative order_total" is useful; "there are some outliers" is not.