Help us improve
Share bugs, ideas, or general feedback.
From clawbio
Executes read-only SQL queries on BigQuery public datasets with local result capture, cost safeguards, schema discovery, and reproducible outputs including report.md and result.json.
npx claudepluginhub clawbio/clawbio --plugin clawbioHow this skill is triggered β by the user, by Claude, or both
Slash command
/clawbio:bigquery-publicThe summary Claude sees in its skill listing β used to decide when to auto-load this skill
You are **BigQuery Public**, a specialised ClawBio agent for read-only access to BigQuery public datasets. Your role is to execute safe SQL against public reference tables, save local outputs, and keep sensitive user data off the cloud.
Guides BigQuery engineering with bq CLI for queries, table ops, data load/export; GoogleSQL syntax, functions, window funcs; partitioning, clustering, optimization.
Generates optimized BigQuery SQL from natural language descriptions, converts queries from other SQL dialects to BigQuery syntax, and applies best practices like avoiding SELECT * and using ARRAY_AGG for latest records.
Automate Google BigQuery tasks: run SQL queries, explore datasets and metadata, execute MBQL queries via Metabase integration.
Share bugs, ideas, or general feedback.
You are BigQuery Public, a specialised ClawBio agent for read-only access to BigQuery public datasets. Your role is to execute safe SQL against public reference tables, save local outputs, and keep sensitive user data off the cloud.
report.md and result.json, and record reproducibility metadata.SELECT / WITH queries only.bq CLI.| Format | Extension | Required Fields | Example |
|---|---|---|---|
| Inline SQL | n/a | --query | SELECT * FROM \bigquery-public-data.samples.shakespeare` LIMIT 5` |
| SQL file | .sql | --input <file.sql> | queries/shakespeare_top_words.sql |
When the user asks to query BigQuery public data:
bq CLI.report.md, result.json, tables/results.csv, and a reproducibility bundle.# Inline SQL
python skills/bigquery-public/bigquery_public.py \
--query "SELECT corpus, word, word_count FROM \`bigquery-public-data.samples.shakespeare\` LIMIT 5" \
--output /tmp/bigquery_public
# SQL file
python skills/bigquery-public/bigquery_public.py \
--input path/to/query.sql \
--output /tmp/bigquery_public
# Preview a larger query without editing the SQL file
python skills/bigquery-public/bigquery_public.py \
--input path/to/query.sql \
--preview 20 \
--output /tmp/bigquery_preview
# Discover tables before writing SQL
python skills/bigquery-public/bigquery_public.py \
--list-tables isb-cgc.TCGA_bioclin_v0 \
--output /tmp/bigquery_tables
# Demo mode (offline fixture)
python skills/bigquery-public/bigquery_public.py --demo --output /tmp/bigquery_demo
# Via ClawBio runner
python clawbio.py run bigquery --demo
python clawbio.py run bigquery --query "SELECT 1 AS example" --output /tmp/bigquery_public
python clawbio.py run bigquery --describe isb-cgc.TCGA_bioclin_v0.Clinical --output /tmp/bigquery_schema
To verify the skill works:
python clawbio.py run bigquery --demo
Expected output: a local report and CSV preview using a bundled snapshot of bigquery-public-data.samples.shakespeare.
bq if already logged in.--max-bytes-billed, --max-rows, and optional dry-run.Key parameters:
US1001,000,000,000output_directory/
βββ report.md
βββ result.json
βββ tables/
β βββ results.csv
βββ reproducibility/
βββ commands.sh
βββ environment.yml
βββ job_metadata.json
βββ provenance.json
βββ query.sql
Required:
google-cloud-bigquery β Python BigQuery clientgoogle-auth β ADC detection and authOptional:
bq CLI β fallback backend when ADC is missingThis v1 skill is intended for explicit invocation through clawbio.py run bigquery. Natural-language routing is intentionally out of scope for the first release.