Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
ML Workbench for Claude Code. Full ML lifecycle: search papers across 7 academic sources, discover and download datasets from 5 repositories, explore and clean data, engineer features, train models (Naive Bayes, KNN, LDA/QDA, SVM, Decision Trees, Ensembles, GLM, Gaussian Process, Neural Networks), run autonomous experiments, build AI apps with LLMs and RAG, build MCP servers, deploy models with Docker and CI/CD, detect drift, explain predictions with SHAP, generate podcasts from papers, manage notebooks, extract YouTube content, and learn ML interactively with 3 university-grade courses (CS229, Applied ML, ML Engineering). 11 agents, 16 skills, 3 CLI tools (mlx-exp, mlx-search, mlx-status), 1 MCP server, 3 output styles, Python LSP via pyright.
npx claudepluginhub damionrashford/mlx --plugin mlxBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Builds AI-powered applications using pre-trained models, LLM APIs, embeddings, RAG pipelines, and agent architectures. Knows the Claude Agent SDK, OpenAI Agents SDK, Vercel AI SDK, and DSPy — and fetches their live docs before scaffolding agent code. Use proactively when the user wants to build an AI application, set up a RAG system, do prompt engineering, integrate LLM APIs, build an agent with any framework, work with embeddings/vector stores, optimize prompts with DSPy, or evaluate LLM outputs.
Answers business questions with data through descriptive statistics, hypothesis testing, segmentation, trend analysis, and visualization. Use proactively when the user wants to understand what happened in the data, compare groups, find trends, create charts or dashboards, run A/B test analysis, segment customers, calculate KPIs, or build data reports for stakeholders.
Builds and maintains data pipelines, warehouses, and lakehouses. Use proactively when the user needs to build an ETL/ELT pipeline, set up dbt transformations, implement incremental loading, orchestrate workflows with Airflow or Prefect, process data at scale with Spark, Polars, or DuckDB, design a data lakehouse (Delta Lake, Iceberg, Hudi), validate data quality with Great Expectations or Soda, or set up production data infrastructure feeding ML systems. Distinct from data-scientist (exploratory modeling) and data-analyst (BI and reporting).
Full-pipeline data science agent: dataset discovery, EDA, cleaning, feature engineering, training, and evaluation. Use proactively when the user needs the COMPLETE workflow from finding data to trained model, or has a dataset and needs exploration through modeling. Always starts with data understanding.
Specializes in neural network architecture design, training dynamics, and GPU optimization. Use proactively when the user needs to design or debug a neural network architecture (CNNs, Transformers, RNNs, SSMs, diffusion models), troubleshoot loss curves or gradient pathologies (vanishing gradients, exploding gradients, dead ReLU), implement distributed training (DDP, FSDP, DeepSpeed), optimize GPU memory and throughput (mixed precision, gradient checkpointing, torch.compile), or run systematic architecture search experiments. Distinct from ml-engineer (tabular/classical ML) and ai-engineer (pre-trained model integration).
Statistical analysis, hypothesis testing, A/B testing, cohort analysis, segmentation, trend detection, business metrics, pre-delivery validation, and data visualization. Use when the user asks to "analyze this data", "run a statistical test", "compare groups", "find trends", "do A/B test analysis", "segment customers", "calculate KPIs", "validate this analysis", "check my work", "sanity check", "review my numbers", "make a chart", "create a dashboard", "plot the data", "visualize results", or mentions hypothesis testing, cohort analysis, business analytics, data validation, bar charts, line charts, heatmaps, scatter plots, or data storytelling.
Autonomous time-budget experiment loop. Modify a training script, train for a fixed wall-clock budget, evaluate, record, repeat. Inspired by karpathy/autoresearch. Use for overnight architecture search, systematic hyperparameter sweeps, or any iterative model improvement workflow.
Context engineering for building production LLM applications: context window management, degradation patterns, optimization strategies, memory system selection, multi-agent architecture, filesystem context patterns, and tool design principles. Use when building LLM apps, RAG pipelines, AI agents, multi-agent systems, or when designing memory, tool APIs, or context strategies for any language model application.
Explore, clean, and engineer datasets end-to-end: statistical profiling, distribution checks, missing value analysis, duplicate detection, outlier removal, type fixing, encoding, create features, encode categories, transform columns, add rolling windows, build interaction terms, and feature engineering. Supports pandas, polars, and PySpark. Use when the user wants to explore data, profile columns, understand a dataset, clean data, handle missing values, remove duplicates, fix data types, preprocess a dataset before modeling, create features, encode categories, transform columns, add rolling windows, build interaction terms, or do feature engineering.
Detect data drift, concept drift, and model performance degradation in production. Uses PSI, KS-test, and chi-squared for statistical drift, plus evidently and nannyml for automated reports. Use when monitoring a deployed model or comparing training vs production data distributions.
Data science and ML workflow tools. 9 agents, 8 commands, 19 skills, 9 templates for problem framing, preprocessing, validation, EDA, experimentation, review, deployment, and knowledge compounding.
Train and optimize machine learning models with automated workflows
Skills to support Machine Learning experimentation using the Python ecosystem.
ML engineering plugin: Give your AI coding agent ML engineering superpowers.
Agent skill and CLI helpers for using RivalSearchMCP research tools from Claude Code. Includes workflows, command references, and a standalone CLI for the hosted MCP server.
9 research tools: 5-engine web search, 9 social platforms, 5 news sources, 5 academic databases, GitHub, website mapping, document analysis, and research topic synthesis. No API keys required. Deterministic outputs for agent chaining.
The Media OS for Claude Code with routed modes dispatch. 96 production-grade skills + 13 modes (routed playbooks) + 7 orchestrator agents (architect, probe, qc, hdr, encoder, live, delivery) + 5 safety + audit hooks + workflow monitors + a PATH-level CLI toolbelt, covering the entire professional media stack — FFmpeg complete (transcode, filters, streaming, HDR, color, broadcast MXF/IMF, DRM, 360°, VapourSynth), professional companion CLIs (yt-dlp, MKVToolNix, GPAC, Shaka, HandBrake, MoviePy, MediaInfo, PySceneDetect, ffsubsync, ffmpeg-normalize, ImageMagick, ExifTool, SoX, GNU parallel, cloud upload), OBS Studio full stack, streaming frameworks (GStreamer, MediaMTX), broadcast IP (NDI SDK, OpenTimelineIO, HDR dynamic metadata via dovi_tool + hdr10plus_tool, Blackmagic DeckLink, gphoto2), low-level control (MIDI 1.0 + 2.0 UMP, OSC, DMX512/Art-Net/sACN, VISCA + ONVIF PTZ), system audio routing (PipeWire, JACK, Core Audio, WASAPI), VFX (USD, OpenEXR, OpenImageIO), computer vision (OpenCV, MediaPipe), WebRTC (W3C spec, Pion/mediasoup/LiveKit SFUs), and 2026 open-source AI media (Real-ESRGAN/SwinIR/HAT upscale, RIFE/FILM interpolation, rembg/BiRefNet/RVM matting, Kokoro/OpenVoice/Piper TTS, Depth-Anything/MiDaS depth, ComfyUI/FLUX-schnell/Kolors image gen, LTX-Video/CogVideoX video gen, LivePortrait/LatentSync lipsync, PaddleOCR/Tesseract 5 OCR, DeepFilterNet/RNNoise audio denoise, CLIP/SigLIP/BLIP-2/LLaVA tagging). Strict OSI-open + commercial-safe license filter on every AI model.
Critical thinking platform for decision analysis, proposal review, debate preparation, and due diligence. Three agents (advocate, adversary, judge) with a structured argument graph, sequential thought chain, and 10 reasoning traditions.
Elite research intelligence plugin powered by RivalSearchMCP. 10 research tools, 5 workflow skills, 6 specialist agents, portable hooks, zero API keys. Your AI research team.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimMatches all tools
Hooks run on every tool call, not just specific ones
Admin access level
Server config contains admin-level keywords
Executes bash commands
Hook triggers when Bash tool is used
Modifies files
Hook triggers on file write and edit operations
External network access
Connects to servers outside your machine
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
ML experiment tracking with metrics logging and run comparison
Self-documenting, self-improving framework for analytical repositories