Data cleaning, enrichment, restructuring, and packaging skills for tabular and JSON datasets. Excludes data visualisation.
Add or update a CHANGELOG.md in a data repository, recording dataset versions, schema changes, row-count deltas, enrichments applied, and re-publications. Follows Keep-a-Changelog conventions adapted for datasets. Use when the user wants versioned documentation of how a dataset has evolved over time.
Create a data dictionary for a dataset (CSV, JSON, JSONL, Parquet, Excel) that documents every column/field — name, type, description, units, example values, nulls allowed, source. Use when a dataset has no accompanying documentation and the user wants one generated.
Add ISO 3166 country codes (alpha-2, alpha-3, numeric) to a dataset that references countries by name but lacks standardised codes. Use when the user has a CSV/JSON/Parquet/Excel dataset with country names and wants ISO 3166 codes added as new columns/fields.
Prepare or refactor a dataset for upload into a REST API or MCP server — mapping dataset columns to API request fields, handling batching, pagination, rate limits, authentication, idempotency, and error retries. Works from an OpenAPI spec the user provides, a user-pointed MCP tool schema, or documentation for a well-known API (Salesforce, HubSpot, Airtable, Notion, Stripe, Shopify, Pipedrive, etc.). Generates a loader script plus a dry-run preview before executing.
Convert between CSV and JSON formats — CSV to JSON array, CSV to JSONL, JSON to CSV, JSONL to CSV. Handles type inference, header/record mapping, nested structure flattening, and encoding issues. Use when the user wants to reformat tabular data between row-oriented CSV and object-oriented JSON forms.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Data cleaning, enrichment, restructuring, and packaging skills for tabular and JSON datasets. Data visualisation is out of scope (handled by a separate plugin).
| Skill | Purpose |
|---|---|
data-cleanliness-scan | Scan flat files (CSV/Parquet/JSON/Excel) and flag columns likely to fail SQL ingestion or analysis |
standardise-country-names | Normalise inconsistent country names ("USA" vs "United States of America") |
text-to-numeric | Parse formatted strings like $4.27, 1,234.56, €1.2M, (500) into numeric columns |
unicode-consistency | Detect and fix mixed Unicode normalisation, mojibake, invisible chars, confusables |
date-wrangling | Convert dates/times between ISO 8601, epoch (s/ms/µs/ns), with/without timezone, fiscal, week-date |
iso-review | Audit the dataset for fields that could be standardised to an ISO standard (3166, 4217, 639, 8601, LEI, ISIN, …) and optionally refactor |
| Skill | Purpose |
|---|---|
add-iso3166 | Add ISO 3166 country codes (alpha-2/3, numeric) to datasets referencing countries |
enrich-with-currency | Map ISO 3166 codes to ISO 4217 currency codes (plus name / symbol) |
data-enrichment | Brainstorm and rank enrichment opportunities (temporal, geo, entity, FX, embeddings, holidays …) |
| Skill | Purpose |
|---|---|
add-data-dictionary | Generate a data dictionary (Markdown / YAML / JSON / CSV) for a dataset |
update-data-dictionary | Keep an existing data dictionary in sync after schema changes |
data-dictionary-export | Export a data dictionary to a polished PDF via Typst |
data-to-document | Render a dataset (or a filtered slice) to PDF via Typst, with layout auto-chosen from data shape, selectable fields, and custom column labels |
add-changelog | Maintain a dataset-focused CHANGELOG.md (Keep-a-Changelog, SemVer-adapted) |
| Skill | Purpose |
|---|---|
csv-to-json | Bidirectional CSV ↔ JSON / JSONL conversion |
json-restructure | Reshape JSON — flatten, nest, group-by, explode arrays, promote/demote fields |
data-shape | Propose a normalised SQL schema (tables, keys, relationships) from a flat source |
data-comparability | Align multiple datasets — reconcile headers, types, vocabularies, units — for merge/union |
| Skill | Purpose |
|---|---|
pii-flag | Detect PII (names, emails, IDs, cards, coords, …) at cell-level with confidence scores |
synthetic-data-overlay | Replace PII with realistic synthetic substitutes preserving shape and referential integrity |
| Skill | Purpose |
|---|---|
database-guide | Recommend a database backend (relational / analytical / document / graph / vector / time-series) |
parquet-jsonl-package | Package a dataset as Parquet and/or JSONL with compression and partitioning |
sql-load | Load a flat dataset into SQL (Postgres / MySQL / SQLite / MSSQL / DuckDB) with schema validation |
graph-database | Reshape tabular/JSON data into nodes + edges, emit Cypher / GraphML / CSV bulk loads |
vector-upsert | Embed text fields and upsert into a vector DB (Pinecone / Qdrant / Weaviate / pgvector / Chroma / Milvus) |
hf-dataset-push | Publish a packaged dataset to Hugging Face Hub with dataset card |
api-loader | Prepare and push data into a REST API or MCP server, from an OpenAPI spec or well-known SDK |
geodata-formatter | Convert CSV / tabular geodata into GeoJSON (or NDGeoJSON) with CRS reprojection and geometry inference |
divergent-data-pipe | Build an incremental sync from a canonical upstream into a downstream project that has diverged (renames / enrichments), preserving the divergence |
Every skill follows the safety and data-layout rules in CONVENTIONS.md. Highlights:
_iso3166, _numeric, _synthetic); overwrite only on explicit user request.npx claudepluginhub danielrosehill/claude-code-plugins --plugin Claude-Data-WranglerClaude Code plugin for image editing, batch operations, format conversion, and filesystem organisation of image libraries — bucket by resolution, aspect ratio, orientation, format, EXIF capture time, camera, and dedupe duplicates / near-duplicates.
First-pass data analysis toolkit: correlations, PII flagging, anomalies, hypothesis tests, data dictionaries, and trend analysis on a dataset in a folder.
Claude Code plugin: ideation and planning workflow — capture, evaluate, rank, simulate, and plan ideas, with ideation/single-idea-eval/multi-idea-ranking/feature-ideas/simulation/idea-capture variants.
Manage a Proxmox VE host via SSH and the Proxmox API — guided first-run onboarding, VM/CT lifecycle, storage and ZFS inspection, log review, and update workflows. Per-host config is stored outside the plugin so the same install works across multiple Proxmox environments.
Personal budgeting workflow primitives — transaction logging, categorization, budget creation, spending analysis, goal tracking, and forecasting — plus a provisioning skill for a fresh budgeting workspace.
Agent Skills for AI/ML tasks including dataset creation, model training, evaluation, and research paper publishing on Hugging Face Hub
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.