Query local or cloud (S3/GCS) Parquet, CSV, JSON, Arrow, and Avro files with SQL using DataFusion in Claude Code sessions. Register persistent external tables, create and refresh materialized views, visualize and optimize execution plans, inspect schemas, and search DataFusion documentation.
npx claudepluginhub datafusion-contrib/datafusion-skills --plugin datafusion-skillsRegister a data file as a persistent external table in the DataFusion session. Supports Parquet, CSV, JSON, Arrow IPC, and Avro files. Explores the schema and writes to the session state file for reuse across skills.
Search Apache DataFusion documentation, user guide, and API reference. Returns relevant documentation for a question or keyword. Searches the official DataFusion repository and website.
Visualize and analyze DataFusion query execution plans. Shows logical and physical plans, identifies performance bottlenecks, and suggests optimizations. Supports EXPLAIN and EXPLAIN ANALYZE.
Install or update datafusion-cli. Supports installation via cargo install, Homebrew, or pre-built binaries. Checks the current version and offers to upgrade if outdated.
Create and manage materialized views using DataFusion. Persist SQL query results as Parquet files for fast repeated access. Track source dependencies and refresh when data changes. Powered by datafusion-cli's COPY TO.
Run SQL queries against registered tables or ad-hoc against files using datafusion-cli. Accepts raw SQL or natural language questions. Supports Parquet, CSV, JSON, and Arrow IPC files.
Read and explore data files (Parquet, CSV, JSON, Arrow IPC, Avro) locally or from S3/GCS. Auto-detects format by extension. Uses datafusion-cli for schema inspection and data preview.
DuckDB-powered skills for Claude Code: read any data file, attach and query DuckDB databases, search DuckDB/DuckLake docs, search past session logs, and install/update DuckDB extensions.
Share bugs, ideas, or general feedback.
Agent skill for the sq CLI: SLQ and native SQL, sources and handles, output formats, inspect/diff/tbl, and per-driver notes for databases and file formats.
Blazingly fast tabular data wrangling with 51 qsv skill-based commands for CSV, TSV, Excel, JSONL, and Parquet files
Build ClickHouse tables with sub-second queries, 10x compression, and zero full table scans
Semantic SQL compiler — compile .view.yml schema definitions into dialect-specific SQL. Unix-philosophy CLI designed as a tool-use interface for LLMs.
Quick insights from dlt pipeline data. Connect to a pipeline, profile tables, plan charts, and assemble marimo dashboards.