From Claude-Data-Wrangler
Scan a dataset for columns whose values could be standardised to an ISO standard (countries → ISO 3166, currencies → ISO 4217, languages → ISO 639, dates → ISO 8601, subdivisions → ISO 3166-2, units → ISO 80000, MIME → IANA, etc.). Reports non-compliance, proposes a canonical form, and optionally refactors existing values to the standard. Use when the user wants to audit a dataset for standards-compliance or bring ad-hoc values in line with formal standards.
npx claudepluginhub danielrosehill/claude-code-plugins --plugin Claude-Data-WranglerThis skill uses the workspace's default tool permissions.
Audit a dataset for ISO-standardisable fields and optionally refactor to the standard.
Conducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.
Share bugs, ideas, or general feedback.
Audit a dataset for ISO-standardisable fields and optionally refactor to the standard.
| Standard | Scope | Detection heuristics |
|---|---|---|
| ISO 3166-1 (countries) | alpha-2 (US), alpha-3 (USA), numeric (840) | Column names country, nation, country_code; values matching country names or 2/3-letter codes |
| ISO 3166-2 (subdivisions) | e.g. US-CA, GB-ENG | State / province / region columns |
| ISO 3166-3 (formerly-used) | Historical country codes | USSR, DDR, YUG values |
| ISO 4217 (currencies) | USD, EUR, JPY | Currency columns, symbols in amounts |
| ISO 639 (languages) | 639-1 (en), 639-2/3 (eng) | Language columns, free-text language labels |
| ISO 8601 (dates/times) | YYYY-MM-DD, YYYY-MM-DDTHH:MM:SSZ, durations | Date-like columns in non-ISO formats |
| ISO 80000 / SI units | SI units and prefixes | Unit columns, mixed unit systems |
| ISO 3166-1 alpha-2 in emails / URLs (TLDs) | — | Domain / email columns with ccTLDs |
| IANA time zones (not ISO but standard) | Europe/London | Timezone columns |
| IANA media types / MIME | application/json | Content-type / format columns |
| ISO 10962 (CFI) | Financial instrument classification | Instrument columns |
| ISO 6166 (ISIN) | Securities identifiers | Security ID columns; checksum validation |
| ISO 17442 (LEI) | Legal Entity Identifier | Entity ID columns; 20-char format |
pycountry, iso4217, langcodes, etc.).01/02/2024 that could be UK or US ordering).iso_review.md:
standardise-country-names, add-iso3166, enrich-with-currency.pip install pandas pycountry langcodes iso4217 python-stdnum python-dateutil babel
country column mixing alpha-2 and full names). Report and pick a target form with the user.01/02/2024) — never guess. Require user confirmation; prefer source metadata or data dictionary for the answer.