Split datasets into training, validation, and testing sets for ML model development. Use when requesting "split dataset", "train-test split", or "data partitioning".
/plugin marketplace add jeremylongshore/claude-code-plugins-plus/plugin install dataset-splitter@claude-code-plugins-plusThis skill is limited to using the following tools:
assets/README.mdassets/dataset_schema.jsonassets/example_dataset.csvassets/split_data_config.yamlreferences/README.mdscripts/README.mdscripts/split_data.pyThis skill automates the process of dividing a dataset into subsets for training, validating, and testing machine learning models. It ensures proper data preparation and facilitates robust model evaluation.
This skill activates when you need to:
User request: "Split the data in 'my_data.csv' into 70% training, 15% validation, and 15% testing sets."
The skill will:
User request: "Create a train-test split of 'large_dataset.csv' with an 80/20 ratio."
The skill will:
This skill can be integrated with other data processing and model training tools within the Claude Code ecosystem to create a complete machine learning workflow.
Use when working with Payload CMS projects (payload.config.ts, collections, fields, hooks, access control, Payload API). Use when debugging validation errors, security issues, relationship queries, transactions, or hook behavior.