Use when benchmarking and evolving prompt assets through a fixed-suite optimization loop
From second-claude-codenpx claudepluginhub unclejobs-ai/second-claude-code --plugin second-claude-codeThis skill uses the workspace's default tool permissions.
Designs and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Enables AI agents to execute x402 payments with per-task budgets, spending controls, and non-custodial wallets via MCP tools. Use when agents pay for APIs, services, or other agents.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
A loop without a benchmark is meaningless.
Run a Karpathy-style optimization loop over the plugin's own prompt assets.
skills/**/SKILL.md, agents/*.md, commands/*.md, or templates/*.md| Command | Purpose |
|---|---|
list-suites | List bundled loop suites in benchmarks/loop/ |
show-suite <name> | Inspect the validated suite manifest |
run <name> | Create an isolated run branch, score a baseline, generate candidates, and promote a winner if min_delta is met |
resume <run_id> | Reload the saved loop state from .data/state/loop-active.json |
node scripts/loop-runner.mjs show-suite <name> to load the suite and verify the allowed targets.node scripts/loop-runner.mjs run <name> ... to create the isolated codex/loop-... branch and run worktree.min_delta is not met, or the loop plateaus.min_delta.${CLAUDE_PLUGIN_DATA}/state/loop-active.json.captures/loop-<run_id>/benchmarks/loop/*.jsonprompt-detect.mjs for v1. It is slash-command only..mjs, docs/, README*, or tests/ as loop targets.min_delta is not met.