Skill

harness:deploy

Finalizes evolver optimization: shows scores/improvements/diffs, tags git commits with metrics, pushes changes, cleans temp files, or promotes insights to CLAUDE.md.

Python

npx claudepluginhub raphaelchristi/harness-evolver --plugin harness-evolver

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/harness-evolver:deploy

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadWriteBashGlobAskUserQuestion

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Finalize the evolution results. In v3, the best code is already in the main branch (auto-merged during evolve). Deploy is about cleanup, tagging, and pushing.

SKILL.md

89 lines · ~716 tokens

Similar Skills

harness:evolve

Runs propose-evaluate-iterate loop to optimize and evolve AI agent performance using LangSmith evaluations and git worktrees for isolation. Requires .evolver.json setup.

8 tools

harness-evolver

evolve

Starts an autonomous evolutionary code optimization run using Claude Code models as mutation operators, iteratively improving code via selection, crossover, and evaluation.

claude-evolve

evolve

388

Continuously selects and cycles through RPI improvement tasks until halted. Automatically runs post-mortems, analyzes goals gaps, and compounds fixes.

20 files

agentops

Stats

LanguagePython

Stars21

Forks2

MaintenanceExcellent

Last CommitApr 18, 2026

Actions

View Source View Plugin View on GitHub View README

Help us improve

Share bugs, ideas, or general feedback.

Stats

Actions

Help us improve

Share bugs, ideas, or general feedback.

/harness:deploy

Finalize the evolution results. In v3, the best code is already in the main branch (auto-merged during evolve). Deploy is about cleanup, tagging, and pushing.

What To Do

TOOLS="${EVOLVER_TOOLS:-$([ -d ".evolver/tools" ] && echo ".evolver/tools" || echo "$HOME/.evolver/tools")}"
EVOLVER_PY="${EVOLVER_PY:-$([ -f "$HOME/.evolver/venv/bin/python" ] && echo "$HOME/.evolver/venv/bin/python" || echo "python3")}"

1. Show Results

python3 -c "
import json
c = json.load(open('.evolver.json'))
baseline = c['history'][0]['score'] if c['history'] else 0
best = c['best_score']
improvement = best - baseline
print(f'Baseline: {baseline:.3f}')
print(f'Best: {best:.3f} (+{improvement:.3f}, {improvement/max(baseline,0.001)*100:.0f}% improvement)')
print(f'Iterations: {c[\"iterations\"]}')
print(f'Experiment: {c[\"best_experiment\"]}')
"

Show git diff from before evolution started:

git log --oneline --since="$(python3 -c "import json; print(json.load(open('.evolver.json'))['created_at'][:10])")" | head -20

2. Ask What To Do (interactive)

{
  "questions": [{
    "question": "Evolution complete. What would you like to do?",
    "header": "Deploy",
    "multiSelect": false,
    "options": [
      {"label": "Tag and push", "description": "Create a git tag with the score and push to remote"},
      {"label": "Just review", "description": "Show the full diff of all changes made during evolution"},
      {"label": "Clean up only", "description": "Remove temporary files (trace_insights.json, etc.) but don't push"},
      {"label": "Promote learnings", "description": "Add proven evolution insights to CLAUDE.md (permanent knowledge)"}
    ]
  }]
}

3. Execute

If "Tag and push":

VERSION=$(python3 -c "import json; c=json.load(open('.evolver.json')); print(f'evolver-v{c[\"iterations\"]}')")
SCORE=$(python3 -c "import json; print(f'{json.load(open(\".evolver.json\"))[\"best_score\"]:.3f}')")
git tag -a "$VERSION" -m "Evolver: score $SCORE"
git push origin main --tags

If "Just review":

git diff HEAD~{iterations} HEAD

If "Clean up only":

rm -f trace_insights.json best_results.json comparison.json production_seed.md production_seed.json

If "Promote learnings":

$EVOLVER_PY $TOOLS/promote_learnings.py --memory evolution_memory.md --target CLAUDE.md --threshold 5 --dry-run

Show the dry-run output. If the user approves, run without --dry-run.

4. Report

What was done
LangSmith experiment URL for the best result
Suggest reviewing the changes before deploying to production

harness:deploy

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

Similar Skills

Help us improve

Help us improve

Find plugins for your project

harness:deploy

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

/harness:deploy

What To Do

1. Show Results

2. Ask What To Do (interactive)

3. Execute

4. Report

Similar Skills

Help us improve

/harness:deploy

What To Do

1. Show Results

2. Ask What To Do (interactive)

3. Execute

4. Report