From Claude-Data-Wrangler
Audit and standardise a dataset's header row against a naming convention (snake_case, camelCase, Title Case, kebab-case) and verify consistency with an existing or forthcoming data dictionary. Use when preparing a dataset for SQL loading, publishing, or when header inconsistency (mixed casing, spaces, punctuation, abbreviation drift) is blocking downstream work.
npx claudepluginhub danielrosehill/claude-code-plugins --plugin Claude-Data-WranglerThis skill uses the workspace's default tool permissions.
Check and normalise the header row of a flat dataset so every column name follows one convention, and reconcile against a data dictionary if one exists.
Conducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.
Share bugs, ideas, or general feedback.
Check and normalise the header row of a flat dataset so every column name follows one convention, and reconcile against a data dictionary if one exists.
snake_case — lowercase, underscores (SQL-friendly, default recommendation)camelCase — first word lowercase, subsequent capitalisedPascalCase — all words capitalised, no separatorkebab-case — lowercase, hyphens (avoid for SQL)Title Case — human-friendly, space-separatedSCREAMING_SNAKE_CASE — constants/enumsnrows=0 or CSV sniff). Do not load the full dataset unless the user asks for a rename-and-save in the same pass._/-, leading digits, reserved SQL keywords, duplicates (case-insensitive), non-ASCII characters, unit suffixes inconsistently placed (price_usd vs USDPrice).snake_case for anything destined for SQL or Parquet.data_dictionary.{md,csv,yaml,json} exists alongside the dataset:
customer_id in data vs CustomerID in dictionary).original → standardised) for user confirmation before touching the file.<name>_headers-std.csv by default); do not overwrite the source unless the user explicitly asks.update-data-dictionary skill).pip install pandas
Customer ID and customer_id both becoming customer_id) — stop and ask the user how to disambiguate; do not silently suffix with _1, _2.localization-headers.price_usd, weight_kg) — preserve unit suffixes; do not strip them.2024_revenue) — SQL-unsafe; prefix with _ or yr_ and flag.