From synthetic-data
Initialize a synthetic data workspace directory with standard folders, Python requirements, and venv setup.
npx claudepluginhub danielrosehill/claude-code-plugins --plugin synthetic-dataThis skill uses the workspace's default tool permissions.
Initialize a workspace for synthetic data generation tasks. This creates folder structure, a Python requirements file, and optionally a virtual environment.
Conducts multi-round deep research on GitHub repos via API and web searches, generating markdown reports with executive summaries, timelines, metrics, and Mermaid diagrams.
Share bugs, ideas, or general feedback.
Initialize a workspace for synthetic data generation tasks. This creates folder structure, a Python requirements file, and optionally a virtual environment.
./synthetic-data-workspace/ in current directory)Create folder structure:
mkdir -p ./synthetic-data-workspace/{inputs,outputs,reports,configs}
Create a README.md describing the workspace layout:
# Synthetic Data Workspace
- `inputs/` — Real source datasets (CSV, Parquet, JSON)
- `outputs/` — Generated synthetic datasets
- `reports/` — Quality evaluation reports
- `configs/` — YAML/JSON configuration files (schemas, synthesis params)
## Quick start
1. Activate venv: `source venv/bin/activate` (if created)
2. Place real data in `inputs/`
3. Define schema or config in `configs/`
4. Run synthesis skill (e.g. `tabular-from-real`)
5. Check outputs in `outputs/` and reports in `reports/`
Create requirements.txt with common tools:
sdv>=1.10.0
sdmetrics>=0.11.0
faker>=20.0.0
mimesis>=7.0.0
synthcity>=0.2.0
pandas>=2.0.0
numpy>=1.24.0
pyarrow>=12.0.0
scikit-learn>=1.3.0
umap-learn>=0.5.0
(User can edit to remove unnecessary packages.)
Check Python version:
python3 --version # Must be 3.10 or higher
Offer venv setup:
cd ./synthetic-data-workspace
python3 -m venv venv
source venv/bin/activate
pip install --upgrade pip
pip install -r ../requirements.txt
Record workspace path for other skills (e.g. in a .workspace-config file or environment variable):
echo "SYNTHETIC_DATA_WORKSPACE=$(pwd)" > .workspace-config
./synthetic-data-workspace/{inputs,outputs,reports,configs}/requirements.txt created and ready to customizeREADME.md with usage guideinputs/ and outputs/ to .gitignore)