From factory
Improves existing projects through systematic experimentation: study, research, hypothesis generation, build/eval loop, and archival. Triggered by 'improve X' or 'make X better'.
How this skill is triggered — by the user, by Claude, or both
Slash command
/factory:workflow-improve <project_path> [--focus <target>]<project_path> [--focus <target>]The summary Claude sees in its skill listing — used to decide when to auto-load this skill
The user wants: **$ARGUMENTS**
The user wants: $ARGUMENTS
Run local study to gather observations:
factory study $PROJECT_PATH
Writes observations to .factory/strategy/observations.md.
factory agent researcher --task "Deep research for the project. Read observations at .factory/strategy/observations.md. Analyze codebase structure, eval scores, and experiment history. Search the web for best practices relevant to weak dimensions. Check .factory/archive/ for prior knowledge. Write findings to .factory/strategy/research-local.md.
Read: .factory/strategy/observations.md
Write output to: .factory/strategy/research-local.md" --project "$PROJECT_PATH" --timeout 600
Apply the CEO Review Gate protocol:
.factory/strategy/research-local.md.factory/reviews/ceo-verdict-research.mdOn RELOOP: return to researcher (max 3 iterations)
factory agent strategist --task "Generate prioritized hypotheses. Read the backlog at .factory/strategy/backlog.md — clear as many items as possible. Read Hypothesis Budget from observations for constraints. Read CEO research review at .factory/reviews/ceo-verdict-researcher.md. Each hypothesis must be specific, scoped to one PR, tied to observations, with expected impact on eval dimensions. Tag backlog items with **Backlog item:** and new items with **New:**. Write to .factory/strategy/current.md.
Read: .factory/strategy/observations.md, .factory/strategy/research-local.md
Write output to: .factory/strategy/current.md" --project "$PROJECT_PATH" --timeout 600
Apply the CEO Review Gate protocol:
.factory/strategy/current.md.factory/reviews/ceo-verdict-strategy.mdOn RELOOP: return to strategist (max 3 iterations)
factory begin $PROJECT_PATH --hypothesis "Implement hypothesis"
factory agent builder --task "Implement the current hypothesis from .factory/strategy/current.md. Read CLAUDE.md and factory.md. Read the CEO strategy approval. Implement exactly what the hypothesis describes. Run tests. Commit and open a draft PR.
Read: .factory/strategy/current.md
Write output to: .factory/reviews/builder-latest.md" --project "$PROJECT_PATH" --timeout 600
Apply the CEO Review Gate protocol:
.factory/reviews/builder-latest.md.factory/reviews/ceo-verdict-build.mdOn RELOOP: return to builder (max 3 iterations)
factory agent evaluator --task "Run eval: factory eval $PROJECT_PATH. Capture composite score. Report delta from baseline. Interpret dimension changes.
Read: .factory/reviews/builder-latest.md
Write output to: .factory/reviews/evaluator-latest.md" --project "$PROJECT_PATH" --timeout 600
factory precheck $PROJECT_PATH --score-before 0 --score-after 0
factory finalize $PROJECT_PATH --id 1 --verdict keep --hypothesis 'hypothesis'
factory agent archivist --task "Archive experiment results and learnings.
Read: .factory/experiments/verdict.json
Write output to: .factory/archive/experiment.md" --project "$PROJECT_PATH" --timeout 300 --model haiku &
(fire-and-forget — CEO continues immediately)
npx claudepluginhub akashgit/remote-factory --plugin factoryRuns a research workflow with baseline measurement, failure analysis, web research, and strategy generation for metric-driven optimization. Use when project has research_target configured.
Runs iterative experiments to optimize measurable metrics (speed, accuracy, config). Manages .lab/ directory for experiment history and autonomous workflow.
Generate and critically evaluate grounded improvement ideas for the current project by scanning the codebase. Use for 'what to improve', 'give ideas', or proactive project direction suggestions.