By lagz0ne
Automated experiment loop for continuous improvement — bootstrap, benchmark, iterate, log
Claude Code plugin for automated experiment loops. Each iteration targets a 1% improvement — small, measurable, compounding.
Inspired by karpathy/autoresearch and davebcn87/pi-autoresearch, ported to work as a native Claude Code plugin using only commands, skills, and shell scripts — no custom tools or extensions required.
claude plugin add lagz0ne/1percent
/autoresearch "optimize inference speed"
Claude will:
autoresearch/* branchThe loop runs until you say stop or 3 consecutive experiments are discarded.
Your benchmark outputs METRIC name=value lines on stdout:
METRIC accuracy=0.95
METRIC duration_ms=1234
The plugin extracts these into JSON, compares against the previous run, and decides: keep (commit) or discard (revert). Every run is logged to ignored runtime state under .autoresearch/sessions/<session-id>/run.jsonl.
In-progress state lives under ignored .autoresearch/ files so the session survives context resets without polluting commits. Default resume reads only the active session and the last 20 run lines; older sessions are cold storage unless asked for or explicitly extended. Reusable learning is committed separately under research/learnings/<session-id>.md.
.autoresearch/sessions/<session-id>/ (ignored, never committed).autoresearch/currentresearch/learnings/<session-id>.md (committed)Result: trailerThis keeps active research resumable without filling git history with session trash or forcing every resume to load old logs.
| Component | File | Role |
|---|---|---|
| Command | commands/autoresearch.md | /autoresearch — bootstrap a session |
| Skill | skills/autoresearch/SKILL.md | Loop discipline, decision rules, anti-patterns |
| Reference | skills/autoresearch/references/experiment-protocol.md | Templates, JSONL schema, examples |
| Script | scripts/parse-metrics.sh | METRIC name=value → JSON (pure awk) |
| Hooks | hooks/stop.md, hooks/pre-compact.md | Keep the loop alive across turns and compaction |
See CHANGELOG.md.
bash tests/test-parse-metrics.sh # 18 assertions
bash tests/test-experiment-flow.sh # 21 assertions
bash scripts/version.sh patch # 0.1.0 → 0.1.1
bash scripts/version.sh minor # 0.1.0 → 0.2.0
bash scripts/version.sh major # 0.1.0 → 1.0.0
MIT
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
No description provided.
No description provided.
Transform requirements into navigable design catalogs using EventStorming methodology, Mermaid diagrams, and progressive elaboration
TBD -- JTBD-powered 5-tier UX methodology: Jobs To Be Done, Mini-Jobs, Ulwick Job Maps with emotional forces, Solution Epics/Stories, and traceability analysis.
Adversarial triage with three roles: Pusher, Challenger, Arbiter. Produces high-confidence results through tension and refinement.
npx claudepluginhub lagz0ne/1percent --plugin autoresearchAutonomous research loops with 10 commands. Generalizes Karpathy's autoresearch loop to any domain with mechanical evaluation, overnight persistence, and zero dependencies.
Autonomous experimentation skill — your AI coding agent designs experiments, tests hypotheses, discards failures, keeps wins. Runs overnight while you sleep.
Autonomous experiment loop for any project type. Inspired by karpathy/autoresearch.
Autonomous experiment loop that optimizes any file by a measurable metric. 5 slash commands, 8 evaluators, configurable loop intervals (10min to monthly).
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.