Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By ambushdata
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, commands, and rules evolved over 10+ months of intensive daily use
npx claudepluginhub joshuarweaver/cascade-code-general-misc-1 --plugin ambushdata-ai-setupIncrementally fix Python, PySpark, and pipeline errors:
Create or verify a checkpoint in your data pipeline development workflow.
Comprehensive security and quality review of uncommitted data pipeline changes:
Generate and run end-to-end pipeline tests. Creates complete pipeline test scenarios, validates data flow through components, and verifies output quality.
Manage eval-driven development workflow for data pipelines.
Atlantis framework architecture specialist. Reviews Blackboard pattern implementation, component design, SCD patterns, metadata model structure, and monorepo organization. Ensures adherence to Atlantis principles and Azure Synapse best practices.
Data engineering code review specialist. Reviews pipeline code, transformations, SQL, and data processing logic for correctness, readability, performance, and data quality. Use after writing or modifying data pipelines.
Pandera data quality specialist. Authors schema-based validation using Pandera, integrates with Atlantis framework, validates Pydantic metadata models, and ensures data quality throughout Azure Synapse pipelines.
Dependency analysis specialist. Maps data lineage, identifies downstream consumers, and warns about breaking changes before they happen.
Atlantis framework developer. Writes composable ETL building blocks using OOP patterns, Blackboard architecture, Pydantic metadata models, and Azure Synapse. Implements SCD2/SCD1 patterns and reusable pipeline components.
Backend architecture patterns, API design, database optimization, and server-side best practices for Node.js, Express, and Next.js API routes.
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Universal coding standards, best practices, and patterns for TypeScript, JavaScript, React, and Node.js development.
Automatically extract reusable patterns from Claude Code sessions and save them as learned skills for future use.
A formal evaluation framework for Claude Code sessions, implementing eval-driven development (EDD) principles.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Data engineering agents providing expertise in ETL pipelines, streaming, and data warehousing
Data engineering and ETL tools. Includes 3 specialized agents, 4 commands, and 19 skills.
Claude Code skill pack for Databricks (24 skills)
Agents for data engineering, machine learning, and AI development
Data engineering plugin - warehouse exploration, pipeline authoring, Airflow integration
Editorial "Data Engineering" bundle for Claude Code from Antigravity Awesome Skills.
No model invocation
Executes directly as bash, bypassing the AI model
No model invocation
Executes directly as bash, bypassing the AI model
Share bugs, ideas, or general feedback.
AI assistant framework for Atlantis - OOP-based composable ETL framework with Blackboard pattern, Pydantic metadata, and Pandera data quality.
Specialized agents for building reusable Atlantis components (SCD2Reader, SCD1Writer, etc.), Azure Synapse pipelines, Pydantic metadata models, and Pandera schema validation.
The Atlantis framework is an object-oriented approach to ETL development:
from atlantis import AtlantisPipeline, Blackboard
from atlantis.components import SCD2Reader, CustomerProcessor, SCD1Writer
# Components don't know about each other - only the blackboard
pipeline = AtlantisPipeline(
config=process_metadata, # Pydantic ProcessMetadata model
components=[
SCD2Reader(source_metadata),
CustomerProcessor(transformation_rules),
SCD1Writer(target_metadata)
]
)
result = pipeline.execute() # Components execute via blackboard
This repository has been transformed from a general software development framework to a specialized data engineering AI framework. See DATA_ENGINEERING_TRANSFORMATION.md for full details.
Tier 1 (Must-Have): Developer, Unit Test, Code Review, Refactor & Cleaner, Schema & Contract
Tier 2 (High-Leverage): Data Quality, SQL Optimization, Dependency Impact, Documentation
Tier 3 (Advanced): Architecture Guard, Security Scan, Metadata
See DATA_ENGINEERING_AGENTS.md for complete agent documentation.
Install Prerequisites
Clone and Install
git clone https://github.com/yourusername/ai-data-engineering.git
cd ai-data-engineering
# Install dependencies
pip install -r requirements.txt
# For development (includes pytest, black, flake8)
pip install -r requirements-dev.txt
python scripts/setup_package_manager.py
# Write a data pipeline
@developer.md Write a PySpark pipeline to aggregate customer orders
# Create tests
@unit-test.md Create pytest tests for the aggregation function
# Review code
@code-reviewer.md Review this pipeline for idempotency and partitioning
See QUICKSTART.md for detailed setup instructions including:
This framework fully supports Windows, macOS, and Linux. All hooks and scripts are written in Python for maximum compatibility.
The system automatically detects your preferred Python package manager (pip, poetry, pipenv, or uv) with the following priority:
COPILOT_PACKAGE_MANAGER.github-copilot/package-manager.json~/.github-copilot/package-manager.jsonTo set your preferred package manager:
# Via environment variable
export COPILOT_PACKAGE_MANAGER=poetry
# Via setup script - global
python scripts/setup_package_manager.py --global poetry
# Via setup script - project
python scripts/setup_package_manager.py --project poetry
# Detect current setting
python scripts/setup_package_manager.py --detect
Or use the /setup-pm command in VS Code.
This repo is a VS Code GitHub Copilot configuration - use it directly or copy components as needed.