Assesses ML pipeline stage and applies patterns for data pipelines, model training, serving, MLOps, evaluation, and debugging with validations like schema checks, drift detection, and skew guards.
npx claudepluginhub gadaalabs/claude-code-on-steroidsThis skill uses the workspace's default tool permissions.
**GRADIENT** — *In ML, the gradient is the directional signal that tells you exactly how to improve.*
Creates isolated Git worktrees for feature branches with prioritized directory selection, gitignore safety checks, auto project setup for Node/Python/Rust/Go, and baseline verification.
Executes implementation plans in current session by dispatching fresh subagents per independent task, with two-stage reviews: spec compliance then code quality.
Dispatches parallel agents to independently tackle 2+ tasks like separate test failures or subsystems without shared state or dependencies.
GRADIENT — In ML, the gradient is the directional signal that tells you exactly how to improve. When invoked: assesses pipeline stage (data / training / serving / MLOps), loads the relevant pattern file, and applies ML-specific validation — schema checks, drift detection, training-serving skew guards, latency budgets.
Core principle: ML systems have unique failure modes — data drift, training-serving skew, silent degradation. Test data and models, not just code.
Announce at start: "Running GRADIENT for ML-specific patterns."
STAGE ASSESSMENT:
"What stage are you at?"
A) Data collection / ingestion
B) Feature engineering / preprocessing
C) Model training / experimentation
D) Model evaluation / validation
E) Model serving / inference
F) Production monitoring / MLOps
G) Debugging an ML failure
Stage → Section mapping:
hunter first, then return with evidenceAfter identifying stage, ask: "What's the primary constraint — accuracy, latency, cost, or reliability?"
Load patterns: patterns/data-pipeline.md
Key tests to implement:
Rule: Write pipeline tests before writing model code.
Load patterns: patterns/model-training.md
Before training complex models:
Load patterns: patterns/model-serving.md
Tests required before deployment:
Load patterns: patterns/mlops.md
Required components:
| Task | Metrics |
|---|---|
| Classification | accuracy, precision, recall, F1, AUC-ROC, AUC-PR |
| Regression | MAE, MSE, RMSE, R², MAPE |
| Ranking | NDCG, MAP, MRR |
Cost/latency budgets (set before training, enforce in CI):
Never:
Always:
| Skill | Integration |
|---|---|
forge | Write data tests before pipeline, model tests before training |
hunter | Use for training failures, accuracy drops |
sentinel | Verify metrics before claiming model works |
chronicle | Store patterns from failed experiments |