From data-preprocessing-pipeline
Process automate data cleaning, transformation, and validation for ML tasks. Use when requesting "preprocess data", "clean data", "ETL pipeline", or "data transformation". Trigger with relevant phrases based on skill purpose.
How this skill is triggered — by the user, by Claude, or both
Slash command
/data-preprocessing-pipeline:preprocessing-data-with-automated-pipelinesThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Construct and execute automated data preprocessing pipelines for cleaning, transforming, and validating ML-ready datasets.
Construct and execute automated data preprocessing pipelines for cleaning, transforming, and validating ML-ready datasets.
construct and execute automated data preprocessing pipelines, ensuring data quality and readiness for machine learning. It streamlines the data preparation process by automating common tasks such as data cleaning, transformation, and validation.
This skill activates when you need to:
User request: "Preprocess the customer data from the CSV file to remove duplicates and handle missing values."
The skill will:
User request: "Create an ETL pipeline to transform the sensor data from the database into a format suitable for time series analysis."
The skill will:
This skill can be integrated with other Claude Code skills for data analysis, model training, and deployment. It provides a standardized way to prepare data for these tasks, ensuring consistency and reliability.
The skill produces structured output relevant to the task.
npx claudepluginhub flight505/skill-forge --plugin data-preprocessing-pipelineSearches MemPalace before answering questions about past work, people, projects, or prior decisions. Returns verbatim stored content instead of guessing from model memory.
Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.
Implements vector databases with Pinecone, Weaviate, Qdrant, Milvus, pgvector for semantic search, RAG, recommendations, and similarity systems. Optimizes embeddings, indexing, and hybrid search.