Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By Borda
Automate end-to-end ML performance investigations: research SOTA papers and architectures, generate phased plans, judge experimental methodologies, profile bottlenecks, run metric-improvement campaigns with atomic git commits, auto-rollback on regressions, and leverage specialist agents for data lifecycle and deep paper analysis.
npx claudepluginhub borda/ai-rig --plugin researchAI/ML researcher for deep paper analysis, hypothesis generation, and experiment design. Use ONLY when the task is rooted in a research paper, ML hypothesis, or experiment — understanding a paper's method, implementing it from a publication, generating testable hypotheses, designing ablations, and validating ML results. NOT for general Python implementation unrelated to a paper (use sw-engineer), NOT for broad SOTA surveys (use /research skill), NOT for fetching library docs or web content (use web-explorer), NOT for dataset acquisition, completeness verification, split validation, or data leakage detection — those belong to data-steward; researcher owns hypothesis generation, experiment design, and implementing methods from papers.
Data lifecycle specialist — acquisition, management, validation, and ML pipeline integrity. Use for collecting datasets from external sources (delegates to foundry:web-explorer for web scraping/search), ensuring data completeness from paginated APIs, versioning datasets (DVC), tracking data lineage, auditing train/val/test splits, detecting data leakage, verifying augmentation pipelines, and configuring DataLoaders. Bridges research:scientist (data needs) and foundry:web-explorer (data fetching). NOT for ML experiment design, hypothesis generation, or implementing methods from research papers (use research:scientist) — data-steward owns data acquisition, pipeline integrity, and split/leakage validation. NOT for DataLoader throughput optimization (use foundry:perf-optimizer), NOT for fetching library docs or API references (use foundry:web-explorer directly).
Research-supervisor review of program.md — validates experimental methodology (hypothesis clarity, measurement validity, control adequacy, scope, strategy fit) and emits APPROVED / NEEDS-REVISION / BLOCKED verdict before the expensive run loop.
Interactive wizard that scans the codebase, proposes a metric/guard/agent config, and writes a program.md run spec. Also runs cProfile on a file path to surface bottlenecks before prompting for optimization goal.
Sustained metric-improvement loop with atomic commits, auto-rollback, and experiment logging. Iterates with specialist agents, commits atomically, auto-rolls back on regression. Accepts a program.md file path. Supports --resume, --team, --colab, --codex, --researcher, --architect, --journal, --hypothesis.
Non-interactive end-to-end pipeline — auto-configure program.md (accept defaults), run judge+refine loop (up to 3 iterations), then run the campaign. Single command from goal to result.
Research State of the Art (SOTA) literature for an Artificial Intelligence / Machine Learning (AI/ML) topic, method, or architecture. Finds relevant papers, builds a comparison table, recommends the best implementation strategy for the current codebase, and optionally produces a phased implementation plan mapped to the codebase. Delegates deep analysis to the research:scientist agent and codebase mapping to foundry:solution-architect.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
ML engineering plugin: Give your AI coding agent ML engineering superpowers.
OSS Claude Code config: agents, skills, and hooks for professional AI-assisted development workflows
Set up ML experiment tracking
Autonomous research loops with 10 commands. Generalizes Karpathy's autoresearch loop to any domain with mechanical evaluation, overnight persistence, and zero dependencies.
Agent Skills for AI/ML tasks including dataset creation, model training, evaluation, and research paper publishing on Hugging Face Hub
Evaluate and compare ML model performance metrics
Codebase structural index — scan Python projects once, query the import graph for blast-radius and coupling before touching code — Python projects only
Implementation workflow skills: feature, fix, refactor, plan, debug
OSS Claude Code config: agents, skills, and hooks for professional AI-assisted development workflows
OSS maintenance skills: analyse, resolve, review, release
Personal AI coding assistant configuration for Python/ML OSS development. Version-controlled, opinionated, continuously improved.
Managing AI coding workflows for Python/ML OSS is complex — you need domain-aware agents, not generic chat. This config packages 12 specialist agents and 20+ slash-command skill workflows across four focused plugins, in a version-controlled, continuously benchmarked setup optimized for:
[!NOTE] What this adds over vanilla Claude Code: With defaults, Claude reviews code as a generalist. With this config, it reviews as 6 specialists in parallel, with a Codex pre-pass for unbiased coverage, file-based handoff to prevent context flooding, automatic lint-on-save, and token compression via RTK — all orchestrated by slash commands that chain into complete workflows.
# Install Claude Code
npm install -g @anthropic-ai/claude-code
# 1. Clone (run from the directory that will CONTAIN the clone)
git clone https://github.com/Borda/AI-Rig Borda-AI-Rig
# 2. Register as a local marketplace
claude plugin marketplace add ./Borda-AI-Rig
# 3. Install all four plugins
claude plugin install foundry@borda-ai-rig # base agents + audit, manage, calibrate, brainstorm, …
claude plugin install oss@borda-ai-rig # OSS workflow: analyse, review, resolve, release
claude plugin install develop@borda-ai-rig # development: feature, fix, refactor, plan, debug
claude plugin install research@borda-ai-rig # ML research: topic, plan, judge, run, sweep
[!NOTE] Safe to install alongside any existing Claude Code setup. Plugins live in a private cache (
~/.claude/plugins/cache/<plugin>/) under their own namespace. Your existing~/.claude/agents/,~/.claude/skills/, andsettings.jsonare never modified or overwritten — custom agents and skills you have created remain fully independent. See the Claude Code plugin reference for details.
4. One-time settings merge — run inside Claude Code:
/foundry:init link
link symlinks foundry agents and skills into ~/.claude/ so you can type /audit, /manage, /brainstorm, etc. without a foundry: prefix. OSS, develop, and research skills always use their plugin prefix (/oss:review, /develop:fix, /research:run). Safe to re-run.
[!IMPORTANT] Codex CLI — optional companion; the plugins install Claude Code agents and skills only:
npm install -g @openai/codex cp -r Borda-AI-Rig/.codex/ ~/.codex/ # Codex agents and profiles
→ See Token Savings (RTK) for RTK install details.
A typical maintainer morning — 15 new issues, 3 PRs waiting, a release due:
# 1. Morning triage — what needs attention?
/oss:analyse health # repo overview, duplicate issue clustering, stale PR detection
# 2. Review incoming PRs
/oss:review 55 --reply # 7-agent review + welcoming contributor comment
# — or: full review first, then apply every finding in one automated pass
/oss:review 21 # 7-agent review → saved findings report
/oss:resolve 21 report # Codex reads the report and applies every comment
# 3. Fix the critical bug from overnight
/oss:analyse 42 # understand the issue
/develop:fix 42 # reproduce → regression test → minimal fix → quality stack
# 4. Ship the release
/oss:release prepare v2.1.0 # changelog, notes, migration guide, readiness audit