Search everything...

Stats

Actions

Available In

Help us improve

Share bugs, ideas, or general feedback.

ml-intern - Claude Code Plugin | ClaudePluginHub

Plugin

ml-intern

Name: ml-intern
Author: infiniv

By infiniV

ultra-instinct ML engineering intern for Claude Code. Reads papers, audits datasets, ships SFT/DPO/LoRA runs to Hugging Face. Built on the procedural knowledge from huggingface/ml-intern, wired into Claude Code's native agentic harness.

npx claudepluginhub infiniv/ultra-ml-intern --plugin ml-intern

Popularity

Stars

Above avg

Med: 0·Avg: 267

Installs

Top 5%

Med: 0·Avg: 1

What's Inside

Slash Commands6

Security: $ARGUMENTS is untrusted user input

/ml-audit

Audit an HF dataset — schema, sample rows, anomalies, recommended training method.

Security note

/ml-intern

Kick off the full ml-intern workflow on an ML task — research → audit dataset → architect training job → submit. Loads the ml-intern skill and dispatches the right subagents.

Security: $ARGUMENTS is untrusted user input

/ml-preflight

Pre-flight a training script before submitting it to HF Jobs — checks for the 8 expensive mistakes.

Security note

/ml-research-ultra

Deep literature crawl. 6–10 query angles, 2-hop citation graph BFS, 30–50 full-paper reads in parallel subagents, cross-paper synthesis with gap analysis.

Security note

/ml-research

Run a literature review for an ML task — finds landmark paper, crawls citation graph, extracts recipe.

Agents4

dataset-auditor

/dataset-auditor

Dataset quality auditor for HF datasets. Use before committing to a dataset for fine-tuning. Returns schema, row counts, sample rows, distributions, anomalies (class imbalance, duplicates, missing values, format issues), and a recommended training method based on column shape. Isolates 10k+ tokens of dataset metadata + sample rows from the main thread.

ml-paper-reader

/ml-paper-reader

Single-paper deep reader. Reads ONE paper end-to-end (abstract → intro → method → experiments → results → limitations → future work) and returns a structured ~800-word digest where every factual claim is backed by a verbatim quote with §section reference. Designed for parallel fan-out from `/ml-research-ultra` — each invocation isolates 50k+ tokens of paper HTML from the main thread. Use when the orchestrator needs the full content of a paper, not just the recipe.

ml-paper-researcher

/ml-paper-researcher

ML literature crawler. Use when the main task needs a methodology-grounded recipe drawn from multiple papers — e.g., "find the best recipe for math reasoning fine-tuning", "what dataset and method does the GRPO follow-up work use", "literature review for sparse-attention long-context training". Returns a structured ≤800-word report with anchor papers, extracted recipes, citation-graph descendants, and working code-example URLs. Isolates 50k+ tokens of paper text from the main thread.

training-job-architect

/training-job-architect

Designs and reviews ML training submissions for both local execution and HF Jobs. Use after the recipe is chosen and the dataset is audited — produces a complete training script + the exact run command, sized to hardware, with all required fields (push_to_hub, hub_model_id, disable_tqdm, Trackio, timeout, package installs). Detects compute mode automatically and asks the user when both local and Jobs are viable. Catches the "model lost" / "30m timeout" / "missing flash-attn" mistakes before they cost real money.

Skills2

ml-intern

/ml-intern

Use when the user asks to fine-tune, train, evaluate, audit, or ship a machine-learning model on the Hugging Face ecosystem — SFT, DPO, GRPO, RLHF, LoRA/QLoRA, post-training, dataset auditing, paper-driven research, hf jobs submission, Trackio monitoring, push-to-Hub. Triggers include "fine-tune", "train a model", "SFT", "DPO", "GRPO", "RLHF", "post-training", "audit this dataset", "literature review for X task", "submit hf job", "find a dataset for X", "best recipe for X", "hyperparameter sweep", "OOM during training", "push to Hub". Replicates the workflow of huggingface/ml-intern inside Claude Code with zero new dependencies.

model-provenance

/model-provenance

Harvest the canonical training/inference code and papers for a specific ML model (e.g. DINOv3, SAM 2, Whisper, Qwen2-VL) and archive everything locally for accurate, grounded coding. Use when the user names a model and wants to find its real/official code, training recipe, or papers; wants to "store the model's code and papers locally", build a local reference archive for a model, or ensure future coding against a model is grounded in its actual source. Verifies which repo is canonical (not a fork/lookalike), clones it, extracts the key train/inference files, downloads paper PDFs with metadata, writes a synthesis report, and saves a persistent memory that mandates reading the archived code before writing code for that model. Triggers include "find the real code for this model", "archive the model's training/inference code and papers", "harvest DINOv3", "set up a local source-of-truth for a model".

MCP Servers1

huggingface

External

Stats

Version0.4.0

LanguageShell

Stars3

Copy clicks2

MaintenanceExcellent

LicenseMIT

Last CommitJun 9, 2026

AddedApr 26, 2026

Actions

View on GitHub View README Plugin Marketplace JSON Homepage

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge.

Available In

ultra-ml-intern3

Safety Signals

Caution

External network access

Connects to servers outside your machine

Uses power tools

Uses Bash, Write, or Edit tools

Help us improve

Share bugs, ideas, or general feedback.

README

ultra-ml-intern: ML engineering intern for Claude Code

ultra-instinct ML engineering intern for Claude Code. Reads papers, audits datasets, ships SFT/DPO/LoRA runs to Hugging Face.

ultra-ml-intern is a Claude Code plugin that gives Claude the workflow of an ML engineering intern. It researches ML papers, audits Hugging Face datasets, designs fine-tuning recipes (SFT, DPO, GRPO, LoRA, QLoRA, RLHF), and submits training jobs to HF Jobs with Trackio monitoring.

The procedural knowledge comes from huggingface/ml-intern, HF's standalone Python harness around the Claude API. This repo wires the same intelligence into Claude Code, Anthropic's official agentic harness for Claude. Same model, a more capable loop, and you bring your own Claude (Max subscription or API key) instead of paying for a second harness on top.

Works in any Claude Code surface: terminal CLI, IDE extensions, and the web app.

Install

# In any Claude Code session:
/plugin marketplace add infiniV/ultra-ml-intern
/plugin install ml-intern@ultra-ml-intern

Restart Claude Code, then verify with /plugin and /agents. The slash commands (/ml-intern, /ml-research, …) keep their short names; the ultra- prefix is just the package wrapper.

What you get:

2 skills: ml-intern (the workflow) and model-provenance (archive a model's real code + papers locally)
6 slash commands: /ml-intern, /ml-research, /ml-research-ultra, /ml-audit, /ml-preflight, /ml-train
4 subagents: ml-paper-researcher, ml-paper-reader, dataset-auditor, training-job-architect
1 MCP server: Hugging Face (activates when HF_TOKEN is set)

Quickstart

> "fine-tune Qwen3-0.5B for math reasoning"

The skill activates automatically and walks the 6-step research-driven workflow:

Find the landmark paper for the task
Crawl the citation graph for recent SOTA
Read methodology sections (3, 4, 5) and extract the recipe
Validate the dataset and base model exist on Hub
Write a training script grounded in current TRL APIs
Pre-flight check → smoke test → full hf jobs run with Trackio monitoring

What it does

You ask	It does
"fine-tune X for Y"	Full pipeline: literature review → dataset audit → training-job design → smoke test → full run
"what's the best recipe for X"	Dispatches the `ml-paper-researcher` subagent; returns recipe + citations
"do a deep literature review on X"	Runs `/ml-research-ultra`: 6–10 query angles, 2-hop citation BFS, 30–50 papers read in parallel `ml-paper-reader` subagents, gap-finding synthesis, optional local PDF/HTML archive
"audit dataset Y"	Dispatches the `dataset-auditor`; returns schema, anomalies, GO/NO-GO verdict
"preflight train.py"	Catches missing `push_to_hub`, default 30m timeout, bf16 on T4, missing flash-attn install, before you spend cluster hours
"submit hf jobs run"	Walks pre-flight → cost estimate → smoke test → full submission → Trackio dashboard URL

Skills

Skill What it does

ml-intern The end-to-end ML workflow: find landmark papers, crawl the citation graph, extract the recipe, audit the dataset and base model on Hub, write a TRL-grounded training script, pre-flight, smoke-test, and ship a full hf jobs run with Trackio monitoring. Activates whenever you ask to fine-tune, train, evaluate, or audit a model.

model-provenance Given a specific model (DINOv3, SAM 2, Whisper, Qwen2-VL…), finds and verifies the canonical repo over forks and lookalikes, clones it, extracts the real train/model/inference files, downloads the paper PDFs with metadata, writes a synthesis report, and archives everything to research/models/<slug>/. Registers a mandatory-read memory so future coding against that model is grounded in its actual source, not training-time recall. Cloned code is archived, never executed.

Commands

View full README on GitHub

Help us improve

Find plugins for your project

Help us improve

ml-intern

Popularity

What's Inside

Help us improve

Health & Quality

Confidence

README

ultra-ml-intern: ML engineering intern for Claude Code

Install

Quickstart

What it does

Skills

Commands

Similar Plugins

claude-token-reducer

drawio-diagramming

pro-workflow

creative-writing

developer-kit-typescript

huggingface-skills

More by infiniV

secret-ingredients

ultra-ml-intern: ML engineering intern for Claude Code

Install

Quickstart

What it does

Skills

Commands