Karpathy's autoresearch as a Claude Code plugin — autonomous fixed-budget optimization for ML training, code performance, prompt engineering, and more. Apple Silicon (MLX), NVIDIA CUDA, and RunPod cloud. Optimized for Claude Max subscriptions.
npx claudepluginhub flight505/autoresearchStart an autoresearch experiment loop — pre-flight checks, reads program.md, enters autonomous keep-or-revert loop. Specify target: local, server, or auto-detect.
Configure autoresearch hardware targets — Apple Silicon (MLX), NVIDIA server (CUDA), or RunPod cloud GPU. One-time setup stored persistently.
View autoresearch experiment results — shows results.tsv, keep/discard summary, and best metric. Supports local and remote server targets.
Karpathy's autoresearch as a Claude Code plugin. Run autonomous fixed-budget experiments overnight using your Claude Max subscription — zero per-token billing.
The autoresearch pattern: an AI agent iteratively edits one file, runs a fixed-budget experiment (typically 5 minutes), measures a single scalar metric, and keeps the change if it improved. Repeat forever. The agent runs 50-100 experiments overnight while you sleep.
This plugin adds:
| Command | What it does |
|---|---|
/autoresearch:setup | One-time hardware configuration (local Mac, remote server, RunPod) |
/autoresearch:run | Pre-flight checks + start the experiment loop |
/autoresearch:status | View results.tsv and summarize progress |
/autoresearch:advisor | Analyze any project for autoresearch opportunities |
The advisor works in any project — not just ML training. It identifies code performance, pipeline throughput, prompt engineering, build speed, and other optimization targets where the autoresearch pattern applies.
# Add the marketplace (if you haven't already)
claude plugin marketplace add flight505/flight505-marketplace
# Install the plugin
claude plugin install autoresearch@flight505-plugins
# 1. Configure your hardware targets (one-time)
/autoresearch:setup
# 2. cd into your autoresearch repo
cd ~/autoresearch
# 3. Start an experiment loop
/autoresearch:run
# 4. Check results anytime
/autoresearch:status
cd ~/autoresearch
claude --dangerously-skip-permissions \
-p "/autoresearch:run overnight, aim for 50+ experiments"
The --dangerously-skip-permissions flag enables fully autonomous operation. Use only in the autoresearch repo.
The plugin supports three hardware targets. Configure them once with /autoresearch:setup.
For MacBook M-series. Uses MLX — no PyTorch or CUDA needed. Best for quick daytime iteration.
For any NVIDIA GPU accessible via SSH. The recommended repo depends on your GPU:
| GPU | Recommended Repo |
|---|---|
| Consumer (RTX 20/30/40/50 series) | flight505/autoresearch-blackwell |
| Datacenter (H100, A100) | karpathy/autoresearch |
The Blackwell fork works on all consumer NVIDIA GPUs (Turing through Blackwell). Key features:
torch.compile enabled by default (Linux + Triton)--smoke-test flag for quick 10-second validationNo hardware? Rent GPUs on demand with RunPod. Cloud provisioning is coming in a future update — for now, /autoresearch:setup stores your API key so you're ready when it launches.
The advisor skill (/autoresearch:advisor) identifies optimization targets in any project. The pattern works wherever you have:
Examples: API latency, build duration, bundle size, query execution time, prompt accuracy, pipeline throughput, inference speed.
Claude Code authenticates via OAuth with your Claude.ai account. With a Max plan, usage is billed against your subscription's included quota — not per-token API billing. An overnight run of 50-100 five-minute experiments is completely practical on the flat monthly fee.
MIT
Qiushi Skill: methodology skills for AI agents guided by seeking truth from facts, with Claude Code, Cursor, OpenClaw, Codex, OpenCode, and Hermes guidance.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.