Auto-discovered marketplace from datafusion-contrib/datafusion-skills
npx claudepluginhub datafusion-contrib/datafusion-skillsDataFusion-powered skills for Claude Code: query data with SQL, read Parquet/CSV/JSON/Arrow files, search DataFusion docs, create external tables, build materialized views, and visualize query plans.
Directory of popular Claude Code extensions including development tools, productivity plugins, and MCP integrations
Curated collection of 141 specialized Claude Code subagents organized into 10 focused categories
No description available.
Share bugs, ideas, or general feedback.
A Claude Code plugin that adds Apache DataFusion-powered skills for data exploration, querying, and materialized views.
Add the repository as a plugin source and install:
/plugin marketplace add datafusion-contrib/datafusion-skills
/plugin install datafusion-skills@datafusion-skills
This registers the GitHub repo as a marketplace and installs the plugin. Skills will be available as /datafusion-skills:<skill-name> in all future sessions.
/plugin marketplace update datafusion-skills
/plugin update datafusion-skills@datafusion-skills
queryRun SQL queries against registered tables or ad-hoc against files. Accepts raw SQL or natural language questions. Supports Parquet, CSV, JSON, Arrow IPC, and Avro.
/datafusion-skills:query SELECT * FROM 'trades.parquet' WHERE symbol = 'AAPL' LIMIT 10
/datafusion-skills:query "what are the top 5 symbols by volume?"
/datafusion-skills:query FROM sales WHERE amount > 100
read-fileRead and explore any data file — Parquet, CSV, JSON, Arrow IPC, Avro — locally or from S3/GCS. Auto-detects format by extension.
/datafusion-skills:read-file trades.parquet what columns does it have?
/datafusion-skills:read-file s3://my-bucket/data.parquet describe the schema
/datafusion-skills:read-file metrics.csv how many rows?
create-tableRegister a data file as a persistent external table. Explores the schema and persists the registration so all other skills can access the table automatically.
/datafusion-skills:create-table trades.parquet
/datafusion-skills:create-table data.csv --name sales --format csv
materialized-viewCreate and manage materialized views — persist SQL query results as Parquet files for fast repeated access. Track source dependencies and refresh when data changes.
/datafusion-skills:materialized-view "create a daily summary of trades grouped by symbol"
/datafusion-skills:materialized-view refresh trades_daily
/datafusion-skills:materialized-view status
/datafusion-skills:materialized-view list
explain-planVisualize and analyze query execution plans. Identifies performance bottlenecks and suggests optimizations.
/datafusion-skills:explain-plan SELECT * FROM trades WHERE date > '2024-01-01'
/datafusion-skills:explain-plan --analyze SELECT COUNT(*) FROM large_table GROUP BY category
datafusion-docsSearch Apache DataFusion documentation — user guide, SQL reference, and API docs. Returns relevant documentation for a question or keyword.
/datafusion-skills:datafusion-docs window functions
/datafusion-skills:datafusion-docs "how do I create an external table?"
/datafusion-skills:datafusion-docs APPROX_PERCENTILE_CONT
install-datafusionInstall or update datafusion-cli. Supports Homebrew, cargo install, and pre-built binaries.
/datafusion-skills:install-datafusion
/datafusion-skills:install-datafusion --update
All skills share a single state.sql file per project — a plain SQL file containing CREATE EXTERNAL TABLE statements and configuration. When state is first needed, you'll be asked where to store it:
.datafusion-skills/state.sql) — colocated with the project, optionally gitignored~/.datafusion-skills/<project>/state.sql) — keeps the repo cleanAny skill restores the session via datafusion-cli --file state.sql.
Skills reference each other where it makes sense:
read-file suggests query for follow-up exploration and create-table for persisting dataquery uses session state from create-table automaticallymaterialized-view creates persistent Parquet files registered via create-tableexplain-plan helps optimize queries from querydatafusion-docs to troubleshoot DataFusion errors automaticallyApache DataFusion is a fast, extensible query engine built in Rust on top of Apache Arrow. It offers:
# Clone the repo
git clone https://github.com/datafusion-contrib/datafusion-skills.git
cd datafusion-skills
# Launch Claude Code with the local plugin directory
claude --plugin-dir .
Test individual skills: