From experiment-tracker
Compares multiple ML experiment runs side-by-side using experiment records, building comparison tables, analyzing parameter sensitivity, generating visualizations, and identifying the best configuration.
How this command is triggered — by the user, by Claude, or both
Slash command
/experiment-tracker:compareThe summary Claude sees in its command listing — used to decide when to auto-load this command
Compare multiple ML experiment runs side-by-side to identify the best configuration. ## Steps 1. Load experiment records from the tracking store. 2. Select experiments to compare: 3. Build a comparison table: 4. Analyze parameter sensitivity: 5. Generate visualizations: 6. Identify the winning configuration: 7. Recommend next experiments to try. ## Format ## Rules - Only compare experiments with the same dataset version. - Use consistent metrics across all compared runs. - Statistical significance matters; do not draw conclusions from single runs.
Compare multiple ML experiment runs side-by-side to identify the best configuration.
Comparison: <N> experiments
Best Run: <experiment name>
Key Findings:
- <parameter X> has <impact> on <metric Y>
2plugins reuse this command
First indexed Mar 30, 2026
npx claudepluginhub costrict-plugins-repo/github-trending-plan --plugin experiment-tracker/compareCompares multiple ML experiment runs side-by-side using experiment records, building comparison tables, analyzing parameter sensitivity, generating visualizations, and identifying the best configuration.
/compare-modelsCompares multiple ML models on a shared dataset, computing accuracy, latency, memory, and cost metrics, then generates a ranked recommendation report.
/experimentDesigns an ML experiment with hypothesis, split strategy, leakage checks, and evaluation plan. Searches prior work, detects type (supervised, unsupervised, time-series, anomaly), and tailors workflow.
/reportGenerates formatted terminal report of AI experiments with stats, top tools, rating distribution, tags, recent activity, and latest details. Supports optional tool, tag, days, rating filters.