Marketplace

claude-lab

npx claudepluginhub butanium/claude-lab

README

View full README on GitHub

1 Plugin

clab

12·

Autonomous research orchestration: agents for hypothesis-driven investigation, experiment running, fresh-eyes review, and batch evaluation.

3mo

v0.1.0-055be21

Butanium

Stats

Plugins1

Stars12

UpdatedJun 16, 2026

Links

View on GitHub View Marketplace JSON

clab

STATUS: Work in progress - very experimental and fast evolving codebase

Claude Code plugin for autonomous research orchestration.

disclaimer

Report tend to still be sloppy (with not enough red teaming of the results etc.) but it's sloly getting better.

What It Does

A scaffolding system for hypothesis-driven research using Claude Code. The orchestrator agent acts as a PI — it maintains hypotheses, designs experiments, and delegates execution to specialized subagents (scientist, colleague, reviewer) that run with constrained permissions enforced by hooks.

Orchestrator agents

orchestrator — Autonomous research mode. Maintains RESEARCH_STATE.md, designs experiments, spawns subagents, synthesizes findings.
interactive-orchestrator — Interactive research mode. Same as orchestrator but collaborates with the user in real time.

Subagents (spawned by orchestrator via Task tool)

scientist — Runs experiments, writes reports. Can only write to its own experiment folder (hooks block RESEARCH_STATE.md, tools/, etc.).
colleague — Fresh-eyes review with intentionally limited context. Read-only, restricted to files specified in ALLOWED_FILES.
reviewer — Red-teams reports for common errors (missing CIs, overclaims, non-interactive plots, etc.).

Supporting skills (preloaded by orchestrator agents via frontmatter)

/research-principles — Core principles for hypothesis-driven investigation (shared across all roles).
/research-judging — How to set up and run the LLM judge pipeline for batch evaluation.
/experiment-structure — Standard experiment folder structure and templates.
/contact-supervisor — How to send notifications to the human supervisor via ntfy.sh.
/writing-guidelines — How to write up findings as an interactive Quarto report.
/supervisor-report — Process for writing and reviewing reports for the supervisor.
/efficient-api-usage — Cost and latency optimization (prompt caching, batch API).

Installation

For development (load directly without installation):

claude --plugin-dir /path/to/this/repo/plugins/clab

Note: --plugin-dir must be passed every time you run Claude. Changes to the plugin are reflected after restarting Claude.

For persistent install (via local marketplace):

Add this repo as a marketplace:

/plugin marketplace add /path/to/this/repo

Install the plugin:
```
/plugin install clab@claude-lab
```

To update after local changes, run /plugin marketplace update claude-lab then reinstall.

Tip for development: Enable auto-update on the marketplace (/plugin → Marketplaces → claude-lab → Enable auto-update) to automatically pick up changes at startup.

Local symlink install (workaround for GH #17688 — plugin frontmatter hooks don't fire):

The plugin system doesn't parse hooks from agent/skill frontmatter. This script symlinks agents, skills, and hooks into .claude/ so they're loaded by the local agent loader which correctly handles hooks.

# Run from your project directory (where .claude/ lives)
path/to/claude-lab/scripts/install-plugin-locally.sh path/to/claude-lab/plugins/clab

# Overwrite existing symlinks
path/to/claude-lab/scripts/install-plugin-locally.sh path/to/claude-lab/plugins/clab --force

# Uninstall
path/to/claude-lab/scripts/install-plugin-locally.sh path/to/claude-lab/plugins/clab --uninstall

Requires hook commands to use "$CLAUDE_PROJECT_DIR"/... paths (not ${CLAUDE_PLUGIN_ROOT}). Restart Claude Code after install.

Configuration

export CLAB_NTFY_TOPIC="your-ntfy-topic"  # Required for notifications

Usage

Start a research session:

claude --dangerously-skip-permissions

Then invoke the orchestrator agent with your research question:

claude --agent orchestrator --dangerously-skip-permissions "Your research question here"

Skills are preloaded automatically via the agent's frontmatter — no manual /skill loading needed.

Project Structure (created by orchestrator)

RESEARCH_STATE.md      # Hypotheses, evidence, confidence levels
TECHNICAL_GUIDE.md     # Project-specific technical knowledge
research_diary.md      # Reflections, @clement mentions
scaffolding_notes.md   # General autonomous research best practices
tools/                 # Reusable utilities (orchestrator maintains)
experiments/           # One folder per experiment (config.yaml, report.md, outputs/)
sidequests/            # Interesting tangents for later
archive/               # Deprecated files (never delete, always archive)

claude-lab

README

1 Plugin

clab

claude-lab

README

clab

disclaimer

What It Does

Orchestrator agents

Subagents (spawned by orchestrator via Task tool)

Supporting skills (preloaded by orchestrator agents via frontmatter)

Installation

Configuration

Usage

Project Structure (created by orchestrator)

Agents & Hooks

1 Plugin

clab

Related Marketplaces

nextjs

thedotmack

ruview

clab

disclaimer

What It Does

Orchestrator agents

Subagents (spawned by orchestrator via Task tool)

Supporting skills (preloaded by orchestrator agents via frontmatter)

Installation

Configuration

Usage

Project Structure (created by orchestrator)

Agents & Hooks

Related Marketplaces

nextjs

thedotmack

ruview