Search everything...

Plugin

autoresearch

Karpathy's autoresearch as a Claude Code plugin — autonomous fixed-budget optimization for ML training, code performance, prompt engineering, and more. Apple Silicon (MLX), NVIDIA CUDA, and RunPod cloud. Optimized for Claude Max subscriptions.

Component Overview

/run, /setup +1

Commands

Agents

advisor

Skills

SessionStart

Hooks

MCP Servers

LSP Servers

Output Styles

Install

npx claudepluginhub flight505/autoresearch

Component Details

Commands (3)

Autoresearch Run

/run

Start an autoresearch experiment loop — pre-flight checks, reads program.md, enters autonomous keep-or-revert loop. Specify target: local, server, or auto-detect.

Autoresearch Setup

/setup

Configure autoresearch hardware targets — Apple Silicon (MLX), NVIDIA server (CUDA), or RunPod cloud GPU. One-time setup stored persistently.

Autoresearch Status

/status

View autoresearch experiment results — shows results.tsv, keep/discard summary, and best metric. Supports local and remote server targets.

Skills (1)

advisor

/advisor

Analyzes any project and suggests where Karpathy's autoresearch pattern could optimize it — not just ML training, but code performance, pipeline throughput, prompt engineering, build speed, and more. References user's configured hardware targets. Use when user mentions autoresearch, autonomous experiments, optimization loops, overnight runs, fixed-budget optimization, or asks 'what could I autoresearch here?' or 'how can I optimize this automatically?'

Hooks (1)

Review workflow modifications before installing

Event Hooks

1 hook across 1 event

README

autoresearch — Claude Code Plugin

Karpathy's autoresearch as a Claude Code plugin. Run autonomous fixed-budget experiments overnight using your Claude Max subscription — zero per-token billing.

What it does

The autoresearch pattern: an AI agent iteratively edits one file, runs a fixed-budget experiment (typically 5 minutes), measures a single scalar metric, and keeps the change if it improved. Repeat forever. The agent runs 50-100 experiments overnight while you sleep.

This plugin adds:

Command	What it does
`/autoresearch:setup`	One-time hardware configuration (local Mac, remote server, RunPod)
`/autoresearch:run`	Pre-flight checks + start the experiment loop
`/autoresearch:status`	View results.tsv and summarize progress
`/autoresearch:advisor`	Analyze any project for autoresearch opportunities

The advisor works in any project — not just ML training. It identifies code performance, pipeline throughput, prompt engineering, build speed, and other optimization targets where the autoresearch pattern applies.

Install

# Add the marketplace (if you haven't already)
claude plugin marketplace add flight505/flight505-marketplace

# Install the plugin
claude plugin install autoresearch@flight505-plugins

Quick start

# 1. Configure your hardware targets (one-time)
/autoresearch:setup

# 2. cd into your autoresearch repo
cd ~/autoresearch

# 3. Start an experiment loop
/autoresearch:run

# 4. Check results anytime
/autoresearch:status

Unattended overnight run

cd ~/autoresearch
claude --dangerously-skip-permissions \
  -p "/autoresearch:run overnight, aim for 50+ experiments"

The --dangerously-skip-permissions flag enables fully autonomous operation. Use only in the autoresearch repo.

Hardware targets

The plugin supports three hardware targets. Configure them once with /autoresearch:setup.

Apple Silicon (MLX)

For MacBook M-series. Uses MLX — no PyTorch or CUDA needed. Best for quick daytime iteration.

NVIDIA CUDA (remote server)

For any NVIDIA GPU accessible via SSH. The recommended repo depends on your GPU:

GPU	Recommended Repo
Consumer (RTX 20/30/40/50 series)	flight505/autoresearch-blackwell
Datacenter (H100, A100)	karpathy/autoresearch

The Blackwell fork works on all consumer NVIDIA GPUs (Turing through Blackwell). Key features:

PyTorch SDPA attention (no Flash Attention 3 dependency)
torch.compile enabled by default (Linux + Triton)
OOM cascade with automatic activation checkpointing
--smoke-test flag for quick 10-second validation
28+ GPU FLOPS entries for accurate MFU reporting

RunPod (cloud GPU)

No hardware? Rent GPUs on demand with RunPod. Cloud provisioning is coming in a future update — for now, /autoresearch:setup stores your API key so you're ready when it launches.

The autoresearch pattern — beyond ML

The advisor skill (/autoresearch:advisor) identifies optimization targets in any project. The pattern works wherever you have:

A single file to edit — the mutable artifact
A scalar metric — computable without human judgment
A fixed time budget — equal cost per experiment
Automated execution — no human in the loop

Examples: API latency, build duration, bundle size, query execution time, prompt accuracy, pipeline throughput, inference speed.

Why Claude Max

Claude Code authenticates via OAuth with your Claude.ai account. With a Max plan, usage is billed against your subscription's included quota — not per-token API billing. An overnight run of 50-100 five-minute experiments is completely practical on the flat monthly fee.

License

MIT

Similar Plugins

qiushi-skill

2.8k

Qiushi Skill: methodology skills for AI agents guided by seeking truth from facts, with Claude Code, Cursor, OpenClaw, Codex, OpenCode, and Hermes guidance.

Stats

Version1.1.0

Stars0

Installs1

MaintenanceGood

LicenseMIT

AddedMar 28, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

autoresearch

Component Overview

/run, /setup +1

Commands

Agents

advisor

Skills

SessionStart

Hooks

MCP Servers

LSP Servers

Output Styles

Install

npx claudepluginhub flight505/autoresearch

Component Details

Commands (3)

Autoresearch Run

/run

Start an autoresearch experiment loop — pre-flight checks, reads program.md, enters autonomous keep-or-revert loop. Specify target: local, server, or auto-detect.

Autoresearch Setup

/setup

Configure autoresearch hardware targets — Apple Silicon (MLX), NVIDIA server (CUDA), or RunPod cloud GPU. One-time setup stored persistently.

Autoresearch Status

/status

View autoresearch experiment results — shows results.tsv, keep/discard summary, and best metric. Supports local and remote server targets.

Skills (1)

advisor

/advisor

Hooks (1)

Review workflow modifications before installing

Event Hooks

1 hook across 1 event

README

autoresearch — Claude Code Plugin

Karpathy's autoresearch as a Claude Code plugin. Run autonomous fixed-budget experiments overnight using your Claude Max subscription — zero per-token billing.

What it does

This plugin adds:

Command	What it does
`/autoresearch:setup`	One-time hardware configuration (local Mac, remote server, RunPod)
`/autoresearch:run`	Pre-flight checks + start the experiment loop
`/autoresearch:status`	View results.tsv and summarize progress
`/autoresearch:advisor`	Analyze any project for autoresearch opportunities

Install

# Add the marketplace (if you haven't already)
claude plugin marketplace add flight505/flight505-marketplace

# Install the plugin
claude plugin install autoresearch@flight505-plugins

Quick start

# 1. Configure your hardware targets (one-time)
/autoresearch:setup

# 2. cd into your autoresearch repo
cd ~/autoresearch

# 3. Start an experiment loop
/autoresearch:run

# 4. Check results anytime
/autoresearch:status

Unattended overnight run

cd ~/autoresearch
claude --dangerously-skip-permissions \
  -p "/autoresearch:run overnight, aim for 50+ experiments"

The --dangerously-skip-permissions flag enables fully autonomous operation. Use only in the autoresearch repo.

Hardware targets

The plugin supports three hardware targets. Configure them once with /autoresearch:setup.

Apple Silicon (MLX)

For MacBook M-series. Uses MLX — no PyTorch or CUDA needed. Best for quick daytime iteration.

NVIDIA CUDA (remote server)

For any NVIDIA GPU accessible via SSH. The recommended repo depends on your GPU:

GPU	Recommended Repo
Consumer (RTX 20/30/40/50 series)	flight505/autoresearch-blackwell
Datacenter (H100, A100)	karpathy/autoresearch

The Blackwell fork works on all consumer NVIDIA GPUs (Turing through Blackwell). Key features:

PyTorch SDPA attention (no Flash Attention 3 dependency)
torch.compile enabled by default (Linux + Triton)
OOM cascade with automatic activation checkpointing
--smoke-test flag for quick 10-second validation
28+ GPU FLOPS entries for accurate MFU reporting

RunPod (cloud GPU)

No hardware? Rent GPUs on demand with RunPod. Cloud provisioning is coming in a future update — for now, /autoresearch:setup stores your API key so you're ready when it launches.

The autoresearch pattern — beyond ML

The advisor skill (/autoresearch:advisor) identifies optimization targets in any project. The pattern works wherever you have:

A single file to edit — the mutable artifact
A scalar metric — computable without human judgment
A fixed time budget — equal cost per experiment
Automated execution — no human in the loop

Examples: API latency, build duration, bundle size, query execution time, prompt accuracy, pipeline throughput, inference speed.

Why Claude Max

License

MIT

Similar Plugins

qiushi-skill

2.8k

Qiushi Skill: methodology skills for AI agents guided by seeking truth from facts, with Claude Code, Cursor, OpenClaw, Codex, OpenCode, and Hermes guidance.

Stats

Version1.1.0

Stars0

Installs1

MaintenanceGood

LicenseMIT

AddedMar 28, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

autoresearch

Component Overview

Install

Component Details

Commands (3)

Skills (1)

Hooks (1)

README

autoresearch — Claude Code Plugin

What it does

Install

Quick start

Unattended overnight run

Hardware targets

Apple Silicon (MLX)

NVIDIA CUDA (remote server)

RunPod (cloud GPU)

The autoresearch pattern — beyond ML

Why Claude Max

License

Similar Plugins

qiushi-skill

autoresearch

Component Overview

Install

Component Details

Commands (3)

Skills (1)

Hooks (1)

README

autoresearch — Claude Code Plugin

What it does

Install

Quick start

Unattended overnight run

Hardware targets

Apple Silicon (MLX)

NVIDIA CUDA (remote server)

RunPod (cloud GPU)

The autoresearch pattern — beyond ML

Why Claude Max

License

Similar Plugins

qiushi-skill

caveman

ui-design

nanobanana