Help us improve
Share bugs, ideas, or general feedback.
From north-starr-genai
Autonomously improve any skill or agent prompt via measure-change-test hill-climbing. Dispatches the `auto-improver` agent on a separate thread. Use when a skill gives inconsistent results, when asked to "improve/optimize/autoresearch" a skill, or when output quality needs iterative tightening.
npx claudepluginhub selcukyucel/north-starr-genai --plugin north-starr-genaiHow this skill is triggered — by the user, by Claude, or both
Slash command
/north-starr-genai:autoimprove <skill or agent name to improve><skill or agent name to improve>The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Entry point for autonomous prompt optimization. Delegates the hill-climbing loop to the `auto-improver` agent on a separate thread.
Guides technical evaluation of code review feedback: read fully, restate for understanding, verify against codebase, respond with reasoning or pushback before implementing.
Share bugs, ideas, or general feedback.
Entry point for autonomous prompt optimization. Delegates the hill-climbing loop to the auto-improver agent on a separate thread.
Inspired by Karpathy's autoresearch — the same pattern applied to skill and agent prompt refinement.
The user provides the target skill or agent name.
SKILL.md or agent fileGuardrails:
auto-improver itself (infinite recursion)orchestrator directly (too coupled to pipeline state — suggest optimizing an individual transition handler instead)Ask the user for test inputs (1–3 scenarios) and offer to generate the scoring checklist (3–6 yes/no questions). Get approval before starting.
Spawn auto-improver via the Agent tool (subagent_type: "north-starr-genai:auto-improver") on a separate thread. Pass:
The agent will:
.plans/autoimprove-<target>/ORIGINAL.mdIMPROVED.md, results.tsv, CHANGELOG.mdeval-designer where the target is a prompt-producing componentThe agent pauses every 5 rounds for a progress check. The user can continue, stop, or adjust the checklist. If the user says "autopilot", the agent skips future checkpoints.
Read the agent's output files and surface a concise summary:
Autoimprove Complete: <target>
────────────────────────────────
Score: <baseline>% → <final>% (<+delta>%)
Rounds: <total> (<kept> kept, <reverted> reverted)
Files:
.plans/autoimprove-<target>/ORIGINAL.md (backup)
.plans/autoimprove-<target>/IMPROVED.md (proposed new version)
.plans/autoimprove-<target>/results.tsv (per-round log)
.plans/autoimprove-<target>/CHANGELOG.md (insights + recommendations)
To adopt: cp .plans/autoimprove-<target>/IMPROVED.md <target file path>
/learn IntegrationAfter presenting results:
The changelog captures <N> insights about what works for this target. Want to run /learn
to capture these as pattern rules for future skills?
auto-improver agent/analyze-code