Help us improve
Share bugs, ideas, or general feedback.
From godmode
Designs data pipelines and ETL processes covering extraction, transformation, loading, data quality checks, orchestration, and patterns for batch, streaming, CDC, ELT. Useful for building pipelines, data flows, syncing, or moving data between systems.
npx claudepluginhub arbazkhan971/godmodeHow this skill is triggered — by the user, by Claude, or both
Slash command
/godmode:pipelineThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- `/godmode:pipeline`, "build a data pipeline", "ETL"
Design batch and streaming data pipelines. Plan ingestion, transformation, quality checks, and failure recovery. Use when building ETL/ELT systems or data infrastructure.
Builds ETL/ELT data pipelines with extraction, transformation, loading, error handling, scheduling, and monitoring. Activates for build ETL, data pipeline, move data from X to Y, or sync data requests.
Designs scalable data pipelines for batch and streaming processing with Airflow, Prefect, dbt, Kafka, Spark, Delta Lake, and Great Expectations. Guides architecture, ingestion, orchestration, transformation, quality, and monitoring.
Share bugs, ideas, or general feedback.
/godmode:pipeline, "build a data pipeline", "ETL"ls dags/ dbt_project.yml dagster.yaml 2>/dev/null
grep -r "airflow\|dagster\|prefect\|kafka" \
requirements.txt package.json 2>/dev/null
Name: <pipeline>
Type: batch | streaming | micro-batch | CDC
Schedule: cron | event-triggered | continuous
SLA: <max latency>
Sources: <name>: <type> (<format>, <volume/day>)
Transforms: 1. <step> (input -> output)
Destinations: <target>: <type> (<write method>)
Idempotent: yes/no
Error handling: skip | fail | dead-letter | retry
IF data changes hourly: batch with cron. IF sub-second latency needed: streaming (Kafka). IF already using PostgreSQL: CDC with Debezium.
Extraction: track watermarks, retry with backoff, log metrics. Patterns: API pagination with rate limit, DB incremental by updated_at, file dedup.
Transformation: pure functions only -- no DB calls,
no side effects. Composable via .pipe().
Loading strategies:
Every pipeline needs these (not optional):
IF quality < 95%: alert and investigate. IF count change > 50%: block load and alert.
Structured logging at every stage. Metrics: duration_seconds, rows_processed/rejected, last_success, data_freshness, quality_score. Alert: failure, 2x duration, quality < 95%, no data > 2 hours.
DLQ for bad records. Retry with exponential backoff. Checkpoint and resume for large batches. Circuit breaker if source fails N times.
Append .godmode/pipeline-results.tsv:
timestamp stage source target records_in records_out rejected quality_pct status
KEEP if: pipeline runs end-to-end AND quality checks
pass AND SLA met.
DISCARD if: quality fails OR errors OR SLA exceeded.
STOP when ALL of:
- Pipeline runs end-to-end with zero errors
- Quality checks validate all stages
- SLA met
- Backfill tested
On failure: git reset --hard HEAD~1. Never pause.
| Failure | Action |
|---|---|
| Schema changed | Fail loudly, update contract |
| Duplicates | Use upsert, add dedup step |
| DLQ growing | Investigate rejection reason |
| Exceeds SLA | Profile stages, parallelize |
| Connection timeout | Retry with backoff, check pool |