Plugin

auto-research-with-eyes

Name: auto-research-with-eyes
Author: llv22

Autonomous ML research pipeline: idea discovery → experiment → review → paper writing

npx claudepluginhub llv22/autoresearchwitheyes

Component Overview

Commands

Agents

Skills

MCP Servers

Component Details

Commands (5)

autor.auto-review-loop

/autor.auto-review-loop

Autonomous multi-round research review loop. Repeatedly reviews via Codex MCP, implements fixes, and re-reviews until positive assessment or max rounds reached. Use when user says "auto review loop", "review until it passes", or wants autonomous iterative improvement.

autor.idea-discovery

/autor.idea-discovery

Workflow 1: Full idea discovery pipeline. Orchestrates research-lit → idea-creator → novelty-check → research-reviewer to go from a broad research direction to validated, pilot-tested ideas. Use when user says "idea discovery pipeline" or wants the complete idea exploration workflow.

autor.paper-writing

/autor.paper-writing

Workflow 3: Full paper writing pipeline. Orchestrates paper-plan → paper-figure → paper-write → paper-compile → paper-improver to go from a narrative report to a polished, submission-ready PDF. Use when user says "write paper pipeline", "paper writing", or wants the complete paper generation workflow.

autor.research-pipeline

/autor.research-pipeline

Full research pipeline: Workflow 1 (idea discovery) → implementation → Workflow 2 (auto review loop). Goes from a broad research direction to validated, reviewed research. Use when user says "full pipeline", "end-to-end research", or wants the complete autonomous research lifecycle. Does NOT include paper writing (Workflow 3) — invoke /autor.paper-writing separately after this completes.

autor.template

/autor.template

Download and set up venue-specific LaTeX templates. Supports iclr2026, neurips2026, icml2026, emnlp2026, or a custom venue. Use when user says "download template", "setup venue", or wants to configure a new conference template.

Agents (2)

paper-improver

/paper-improver

Use this agent when the paper-writing pipeline needs to iteratively improve a compiled paper. Runs REVIEWER_MODEL xhigh review, implements fixes, and recompiles for MAX_IMPROVEMENT_ROUNDS rounds to polish writing quality, fix theoretical inconsistencies, and soften overclaims.

research-reviewer

/research-reviewer

Use this agent when the idea-discovery pipeline needs external critical feedback on research ideas, papers, or experimental results. Invokes REVIEWER_MODEL via Codex MCP with xhigh reasoning to act as a senior ML reviewer.

Skills (10)

analyze-results

/analyze-results

Analyze ML experiment results, compute statistics, generate comparison tables and insights. Use when user says "analyze results", "compare", or needs to interpret experimental data.

idea-creator

/idea-creator

Generate and rank research ideas given a broad direction. Use when user says "找idea", "brainstorm ideas", "generate research ideas", "what can we work on", or wants to explore a research area for publishable directions.

monitor-experiment

/monitor-experiment

Monitor running experiments, check progress, collect results. Use when user says "check results", "is it done", "monitor", or wants experiment output.

novelty-check

/novelty-check

Verify research idea novelty against recent literature. Use when user says "查新", "novelty check", "有没有人做过", "check novelty", or wants to verify a research idea is novel before implementing.

paper-compile

/paper-compile

Compile LaTeX paper to PDF, fix errors, and verify output. Use when user says "编译论文", "compile paper", "build PDF", "生成PDF", or wants to compile LaTeX into a submission-ready PDF.

paper-figure

/paper-figure

Generate publication-quality figures and tables from experiment results. Use when user says "画图", "作图", "generate figures", "paper figures", or needs plots for a paper.

paper-plan

/paper-plan

Generate a structured paper outline from review conclusions and experiment results. Use when user says "写大纲", "paper outline", "plan the paper", "论文规划", or wants to create a paper plan before writing.

paper-write

/paper-write

Draft LaTeX paper section by section from an outline. Use when user says "写论文", "write paper", "draft LaTeX", "开始写", or wants to generate LaTeX content from a paper plan.

research-lit

/research-lit

Search and analyze research papers, find related work, summarize key ideas. Use when user says "find papers", "related work", "literature review", "what does this paper say", or needs to understand academic papers.

run-experiment

/run-experiment

Deploy and run ML experiments on local or remote GPU servers. Use when user says "run experiment", "deploy to server", "跑实验", or needs to launch training jobs.

MCP Servers (1)

Connects to external services

codex

README

AutoResearchWithEyes

Let Claude Code do research while you sleep. Wake up to find your paper scored, weaknesses identified, experiments run, and narrative rewritten — autonomously.

· Join Community

A Claude Code plugin for autonomous ML research workflows. Orchestrates cross-model collaboration — Claude Code drives the research while an external LLM (via Codex MCP) acts as a critical reviewer. Also supports alternative model combinations (e.g., GLM + GPT, GLM + MiniMax) — no Claude API required.

Why cross-model? A single model reviewing its own output creates blind spots. Two complementary models — Claude Code for fast execution, GPT-5.4 xhigh for rigorous critique — produce better outcomes than either alone. Going from 1 to 2 models is the biggest gain; adding more gives diminishing returns.

Quick Start

# 1. Clone and enter the project
git clone https://github.com/llv23/AutoResearchWithEyes.git
cd AutoResearchWithEyes

# 2. Set up Codex MCP (for cross-model review)
npm install -g @openai/codex
codex auth login
# Codex MCP auto-configures from .mcp.json when running in the project directory

# 3. Launch Claude Code — skills and commands are auto-discovered
claude

# Run any workflow command:
> /autor.idea-discovery "your research direction"      # Plain text → ./autor.idea_discovery/
> /autor.idea-discovery path/to/autor.idea_discovery/task_spec.md    # Spec file  → path/to/autor.idea_discovery/
> /autor.auto-review-loop                              # Review → fix → re-review overnight
> /autor.paper-writing "NARRATIVE_REPORT.md"           # Narrative → polished PDF
> /autor.research-pipeline "your research direction"   # Full end-to-end pipeline

Each command produces a self-contained output folder — the input spec, intermediate results, final reports, and live status tracking are all organized in one place:

path/to/
└── autor.idea_discovery/                  # Self-contained output folder
    ├── task_spec.md                       # Copy of input (portability & reproducibility)
    ├── task_status.md                     # Live status, decisions, linked idea rankings
    ├── LITERATURE_SURVEY.md               # Phase 1: landscape summary with paper table
    ├── IDEA_REPORT.md                     # Phase 5: ranked ideas + experiment plan
    └── REVIEW_<topic>_<date>.md           # Phase 4: external reviewer feedback

task_status.md is updated after each phase — it tracks pipeline progress, key decisions, and includes a ranked idea table where each idea links to its full description in IDEA_REPORT.md.

See click-through/case-0/ for a complete worked example.

Output Folder Conventions

All workflow commands follow the same self-contained output pattern: the output folder includes task_spec.md (input), task_status.md (live status with linked rankings), and all generated results — making it portable, shareable, and reproducible.

Command	Input	Output Folder
`/autor.idea-discovery path/to/spec.md`	`spec.md`	`path/to/autor.idea_discovery/`
`/autor.idea-discovery "plain text"`	(inline)	`./autor.idea_discovery/`
`/autor.idea-discovery spec.md — output: custom/`	`spec.md`	`custom/`

Override the output folder with — output: path/to/output/ appended to any command.

See Setup for full details.

Features

10 composable skills — atomic building blocks: literature search, idea generation, novelty check, experiments, paper writing
4 workflow commands — orchestrate skills + agents into end-to-end pipelines (/autor.idea-discovery, /autor.auto-review-loop, /autor.paper-writing, /autor.research-pipeline)
2 specialized agents — research-reviewer (senior ML reviewer via Codex MCP) and paper-improver (2-round auto-improvement)
Cross-model collaboration — Claude Code executes, GPT-5.4 xhigh reviews. Adversarial, not self-play
Centralized configuration — all constants in CLAUDE.md, override per-invocation with inline arguments
Venue templates — bundled template directories with fallback resolution: TEMPLATE_DIR/VENUE/ → bundled → error with instructions
GPU deployment — auto rsync, screen sessions, multi-GPU parallel experiments, live monitoring
Flexible models — default Claude x GPT-5.4, also supports GLM + GPT, GLM + MiniMax

Architecture

View full README on GitHub

Similar Plugins

omp

251

Oh My Paper research harness: memory system, Codex delegation, and pipeline commands for academic research projects.

Stats

Version2.0.0

Stars0

MaintenanceExcellent

LicenseMIT

AddedMar 25, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

Help us improve

Share bugs, ideas, or general feedback.

Back to Plugins

AutoResearchWithEyes

Let Claude Code do research while you sleep. Wake up to find your paper scored, weaknesses identified, experiments run, and narrative rewritten — autonomously.

· Join Community

Why cross-model? A single model reviewing its own output creates blind spots. Two complementary models — Claude Code for fast execution, GPT-5.4 xhigh for rigorous critique — produce better outcomes than either alone. Going from 1 to 2 models is the biggest gain; adding more gives diminishing returns.

Quick Start

# 1. Clone and enter the project
git clone https://github.com/llv23/AutoResearchWithEyes.git
cd AutoResearchWithEyes

# 2. Set up Codex MCP (for cross-model review)
npm install -g @openai/codex
codex auth login
# Codex MCP auto-configures from .mcp.json when running in the project directory

# 3. Launch Claude Code — skills and commands are auto-discovered
claude

# Run any workflow command:
> /autor.idea-discovery "your research direction"      # Plain text → ./autor.idea_discovery/
> /autor.idea-discovery path/to/autor.idea_discovery/task_spec.md    # Spec file  → path/to/autor.idea_discovery/
> /autor.auto-review-loop                              # Review → fix → re-review overnight
> /autor.paper-writing "NARRATIVE_REPORT.md"           # Narrative → polished PDF
> /autor.research-pipeline "your research direction"   # Full end-to-end pipeline

Each command produces a self-contained output folder — the input spec, intermediate results, final reports, and live status tracking are all organized in one place:

path/to/
└── autor.idea_discovery/                  # Self-contained output folder
    ├── task_spec.md                       # Copy of input (portability & reproducibility)
    ├── task_status.md                     # Live status, decisions, linked idea rankings
    ├── LITERATURE_SURVEY.md               # Phase 1: landscape summary with paper table
    ├── IDEA_REPORT.md                     # Phase 5: ranked ideas + experiment plan
    └── REVIEW_<topic>_<date>.md           # Phase 4: external reviewer feedback

task_status.md is updated after each phase — it tracks pipeline progress, key decisions, and includes a ranked idea table where each idea links to its full description in IDEA_REPORT.md.

See click-through/case-0/ for a complete worked example.

Output Folder Conventions

Command	Input	Output Folder
`/autor.idea-discovery path/to/spec.md`	`spec.md`	`path/to/autor.idea_discovery/`
`/autor.idea-discovery "plain text"`	(inline)	`./autor.idea_discovery/`
`/autor.idea-discovery spec.md — output: custom/`	`spec.md`	`custom/`

Override the output folder with — output: path/to/output/ appended to any command.

See Setup for full details.

Features

10 composable skills — atomic building blocks: literature search, idea generation, novelty check, experiments, paper writing
4 workflow commands — orchestrate skills + agents into end-to-end pipelines (/autor.idea-discovery, /autor.auto-review-loop, /autor.paper-writing, /autor.research-pipeline)
2 specialized agents — research-reviewer (senior ML reviewer via Codex MCP) and paper-improver (2-round auto-improvement)
Cross-model collaboration — Claude Code executes, GPT-5.4 xhigh reviews. Adversarial, not self-play
Centralized configuration — all constants in CLAUDE.md, override per-invocation with inline arguments
Venue templates — bundled template directories with fallback resolution: TEMPLATE_DIR/VENUE/ → bundled → error with instructions
GPU deployment — auto rsync, screen sessions, multi-GPU parallel experiments, live monitoring
Flexible models — default Claude x GPT-5.4, also supports GLM + GPT, GLM + MiniMax

auto-research-with-eyes

Component Overview

Component Details

Commands (5)

Agents (2)

Skills (10)

MCP Servers (1)

README

AutoResearchWithEyes

Quick Start

Output Folder Conventions

Features

Architecture

Similar Plugins

omp

Help us improve

Help us improve

auto-research-with-eyes

Component Overview

Component Details

Commands (5)

Agents (2)

Skills (10)

MCP Servers (1)

README

AutoResearchWithEyes

Quick Start

Output Folder Conventions

Features

Architecture

Similar Plugins

omp

Help us improve

magi-researchers

research-collaborator

research-companion

gyoshu

synapse