Plugin

anneal-cast

By krzemienski

Linear single-pour architecture of the Anneal plugin family. One planner, one red team, one oracle, one validate, emit.

What's Inside

Slash Commands1

/anneal-cast:anneal

/anneal

Run Cast architecture — single planner, always-on red team, functional validation, emit XML plus plan directory.

Agents9

momus

/momus

Post-plan reviewer. Reads the finished markdown plan and produces a ruthless audit envelope — identifies what would block implementation if handed to a new team. Invoked at stage 4 close-out after Prometheus completes.

oracle

/oracle

Architecture synthesizer. Reads all reviewer envelopes (Metis, Momus, Red-Team Trinity) and emits a bird's-eye verdict with release coherence, deployment risk, and blast radius assessment. Final review gate before Hephaestus validates. Invoked at stage 5 close-out.

prometheus-cast

/prometheus-cast

Cast-flavored planner. Writes the full plan in a single pass — no retry, no tournament, no deepen loop. Consumes Metis directives and the probe report; emits phase files plus plan.md. Invoked at stage 4 of every Cast run, exactly once per iteration.

redteam-assumptions

/redteam-assumptions

Assumptions adversary. Reads the plan and finds every load-bearing assumption — API stability, file paths, dependencies, environment, ordering, user actions. Invoked at stage 5 in parallel with the other two Red-Team members.

redteam-scope

/redteam-scope

Scope adversary. Reads the plan and finds where scope creeps — files touched that the user did not ask about, 'while-we-here' refactors, gold-plating. Invoked at stage 5 in parallel with the other two Red-Team members.

Skills7

atlas

/atlas

XML and plan emitter for anneal-cast. Reads the approved plan, all reviewer envelopes, and hephaestus evidence, then assembles an Opus 4.7 semantic-XML prompt plus a plan directory at ${ANNEAL_RUNS_ROOT:-./.anneal/runs}/{run_id}/. The only skill permitted to write outside the plugin's scoped staging directory. Triggers: invoke at stage 7 of every /anneal-cast:anneal run only when rollup emission_decision equals EMIT. Do NOT invoke on RE_LOOP or ABORT outcomes, do NOT invoke twice per run (emission is atomic), do NOT rewrite plan markdown during serialization, and do NOT add an XML declaration (schema forbids it).

hephaestus

/hephaestus

Functional validator for anneal-cast. Builds and exercises the real artifact described by the plan, captures evidence (build logs, CLI stdout/stderr, API responses with headers and body, screenshots), and returns PASS or FAIL with every verdict tied to a specific evidence file. Triggers: invoke at stage 6 of every /anneal-cast:anneal run once per successful review pass, always before atlas emits. Do NOT write test files, mocks, stubs, or test doubles; do NOT trust build success as validation; do NOT modify the plan to force a PASS — on FAIL return the verdict and let the orchestrator re-loop through metis.

metis

/metis

Pre-plan consultant for anneal-cast. Reads the user task and probe report, catches ambiguity, unstated requirements, and slop-risk patterns, and returns structured directives the planner must follow. Triggers: invoke at stage 3 of every /anneal-cast:anneal run; invoke when folding a validate FAIL back as new planner input; invoke when a task needs an ambiguity-audit before a plan is written. Do NOT invoke at any other stage, do NOT invoke twice in sequence, and do NOT invoke to write or critique finished plans (use momus for that).

momus

/momus

Post-plan reviewer for anneal-cast. Reads the finished markdown plan and returns a ruthless audit envelope — not kind, not collaborative — that identifies what would block implementation if the plan were handed to a new team. Triggers: invoke at stage 4 close-out immediately after prometheus-cast completes; invoke once per plan attempt; re-invoke on re-loop because the plan changed. Do NOT invoke before a plan exists (use metis for pre-plan ambiguity audit), do NOT invoke to fix the plan, and do NOT emit Temper-style 0-100 scores in Cast runs.

oracle

/oracle

Architecture synthesizer for anneal-cast. Reads every reviewer envelope — metis, momus, and all three red-team-trinity envelopes — and emits a bird's-eye verdict covering release coherence, deployment risk, breaking changes, and blast radius. Final review gate before hephaestus validates. Triggers: invoke at stage 5 close-out once per iteration after red-team-trinity returns all three envelopes. Do NOT invoke in parallel with the reviewers (oracle needs their output), do NOT invent new findings (only aggregate existing ones), and do NOT override a reviewer BLOCK without synthesis reasoning.

Hooks1

Event Hooks

1 hook across 1 event

Stats

Version0.1.0

LanguageHTML

Stars1

MaintenanceGood

LicenseMIT

Last CommitApr 23, 2026

AddedApr 24, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Available In

anneal-umbrella-dev1

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

Anneal

Controlled heating, slow cooling, iterative tempering — applied to work plans.

Runtime status (2026-04-22): Cast pipeline verified end-to-end in a real Claude Code worker — all 9 Greek-god agents dispatched, artifact written with exact byte match, XML emitted and passes validate-xml.py. See VERIFICATION-SUMMARY.md for the full trace. Alloy and Temper passed load verification; full E2E runs in progress.

Anneal is a Claude Code plugin family that converts a vague task into a rigorously-reviewed execution artifact: an XML prompt, a plan directory, and the skill enrichment needed to run it. It replaces the earlier deepest-plan prototype (shipped with 91 validator defects due to asymmetric vendoring) with a cleaner core built around three named plan-review archetypes — Metis, Momus, Oracle — and an always-on red team.

The name is literal: the plugin implements simulated annealing against plan-quality scores. Heat (generate candidates), cool (score, prune), temper (red-team critique), repeat until convergence.

Rather than picking one architecture as the default, we ship three complete, installable plugin variants — one per architecture — so you can install all three and compare side-by-side on real tasks.

What this directory contains

Path	What it is
`README.md`	You are here.
`ARCHITECTURE-PROPOSALS.md`	The architecture document (499 lines, 17 sections). Shared invariants, seven-stage spine, agent roster, three proposals.
`COMPARISON-PLAYBOOK.md`	How to test Cast / Alloy / Temper head-to-head. Decision rubric.
`INSTALL.md`	Install cheatsheet. Umbrella or per-plugin marketplace.
`diagrams/anneal-architectures.html`	Shared visual. Three Mermaid flowcharts with zoom/pan, editorial aesthetic.
`.claude-plugin/marketplace.json`	Umbrella dev marketplace listing all three plugins.
`cast/`	Plugin · anneal-cast · Linear single-pour architecture.
`alloy/`	Plugin · anneal-alloy · Tournament consensus architecture.
`temper/`	Plugin · anneal-temper · Fixed-point deepen architecture.
`_shared/`	Reference docs consumed by all three plugins (Opus 4.7 XML schema, agent prompts, plan-reviewer schema, plugin-format cheatsheet).
`scripts/smoke-test.sh`	Cross-plugin validation gate. Runs each plugin's `validate-plugin.py` and reports pass/fail.
`scripts/phase-4-review-prompts.md`	Staged reviewer prompts (architect + code-reviewer) for multi-perspective audit.

Each of the three plugin directories (cast/, alloy/, temper/) is a complete, installable Claude Code plugin:

{architecture}/
├── .claude-plugin/plugin.json        # Manifest
├── .claude-plugin/marketplace.json   # Per-plugin dev marketplace
├── README.md                         # Install + usage
├── PRD.md                            # Architecture-specific product requirements
├── ARCHITECTURE.md                   # Implementation detail
├── LICENSE                           # MIT
├── commands/anneal.md                # /anneal-{name}:anneal slash command
├── skills/{7-8 skills}/SKILL.md      # Metis, Prometheus variant, Momus, Red-Team Trinity, Oracle, Hephaestus, Atlas
├── agents/{9 agents}.md              # Agent definitions with model assignments
├── hooks/hooks.json                  # SessionStart: plugin-loaded marker
├── scripts/validate-plugin.py        # Self-validation
├── scripts/orchestrate.sh            # Pipeline implementation
├── diagrams/{name}-architecture.html # Architecture-specific visual
└── docs/                             # Invariants, worked example, emission format

Quick start

Install all three, restart Claude Code, and all three commands register:

# Add the umbrella marketplace
/plugin marketplace add /Users/nick/Desktop/anneal

# Install all three
/plugin install anneal-cast@anneal-umbrella-dev
/plugin install anneal-alloy@anneal-umbrella-dev
/plugin install anneal-temper@anneal-umbrella-dev

After restart, three slash commands are available:

/anneal-cast:anneal <task>         # Linear · ~8 spawns · ~4 min
/anneal-alloy:anneal <task>        # Tournament · ~18 spawns · ~6 min · default --versions 5
/anneal-temper:anneal <task>       # Fixed-point deepen · ~8×depth spawns · ~7 min · default --depth 3

Full install options, debugging, uninstall: see INSTALL.md.

The three architectures at a glance

All three satisfy the same eight invariants (red team always, validate always, XML + plan output, skill enrichment, unbounded re-loop, parallelization, category routing, dual-family prompts). They differ only in how stage 4 — Plan — works.

anneal-cast

What's Inside

anneal-cast

Popularity

What's Inside

Confidence

README

Anneal

What this directory contains

Quick start

The three architectures at a glance

Similar Plugins

caveman

llm-council-plugin

open-agent-hub

claude-mem

ui-design

claude-code-token-saver

More by krzemienski

shannon

lynx

deepest-plan

crucible

anneal-temper

Anneal

What this directory contains

Quick start

The three architectures at a glance

Popularity

Health & Quality

More by krzemienski

shannon

lynx

deepest-plan

crucible

anneal-temper

Similar Plugins

caveman

llm-council-plugin

open-agent-hub

claude-mem

ui-design

claude-code-token-saver