Help us improve
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
Share bugs, ideas, or general feedback.
By krzemienski
Linear single-pour architecture of the Anneal plugin family. One planner, one red team, one oracle, one validate, emit.
npx claudepluginhub krzemienski/anneal --plugin anneal-castPost-plan reviewer. Reads the finished markdown plan and produces a ruthless audit envelope — identifies what would block implementation if handed to a new team. Invoked at stage 4 close-out after Prometheus completes.
Architecture synthesizer. Reads all reviewer envelopes (Metis, Momus, Red-Team Trinity) and emits a bird's-eye verdict with release coherence, deployment risk, and blast radius assessment. Final review gate before Hephaestus validates. Invoked at stage 5 close-out.
Cast-flavored planner. Writes the full plan in a single pass — no retry, no tournament, no deepen loop. Consumes Metis directives and the probe report; emits phase files plus plan.md. Invoked at stage 4 of every Cast run, exactly once per iteration.
Assumptions adversary. Reads the plan and finds every load-bearing assumption — API stability, file paths, dependencies, environment, ordering, user actions. Invoked at stage 5 in parallel with the other two Red-Team members.
Scope adversary. Reads the plan and finds where scope creeps — files touched that the user did not ask about, 'while-we-here' refactors, gold-plating. Invoked at stage 5 in parallel with the other two Red-Team members.
XML and plan emitter for anneal-cast. Reads the approved plan, all reviewer envelopes, and hephaestus evidence, then assembles an Opus 4.7 semantic-XML prompt plus a plan directory at ${ANNEAL_RUNS_ROOT:-./.anneal/runs}/{run_id}/. The only skill permitted to write outside the plugin's scoped staging directory. Triggers: invoke at stage 7 of every /anneal-cast:anneal run only when rollup emission_decision equals EMIT. Do NOT invoke on RE_LOOP or ABORT outcomes, do NOT invoke twice per run (emission is atomic), do NOT rewrite plan markdown during serialization, and do NOT add an XML declaration (schema forbids it).
Functional validator for anneal-cast. Builds and exercises the real artifact described by the plan, captures evidence (build logs, CLI stdout/stderr, API responses with headers and body, screenshots), and returns PASS or FAIL with every verdict tied to a specific evidence file. Triggers: invoke at stage 6 of every /anneal-cast:anneal run once per successful review pass, always before atlas emits. Do NOT write test files, mocks, stubs, or test doubles; do NOT trust build success as validation; do NOT modify the plan to force a PASS — on FAIL return the verdict and let the orchestrator re-loop through metis.
Pre-plan consultant for anneal-cast. Reads the user task and probe report, catches ambiguity, unstated requirements, and slop-risk patterns, and returns structured directives the planner must follow. Triggers: invoke at stage 3 of every /anneal-cast:anneal run; invoke when folding a validate FAIL back as new planner input; invoke when a task needs an ambiguity-audit before a plan is written. Do NOT invoke at any other stage, do NOT invoke twice in sequence, and do NOT invoke to write or critique finished plans (use momus for that).
Post-plan reviewer for anneal-cast. Reads the finished markdown plan and returns a ruthless audit envelope — not kind, not collaborative — that identifies what would block implementation if the plan were handed to a new team. Triggers: invoke at stage 4 close-out immediately after prometheus-cast completes; invoke once per plan attempt; re-invoke on re-loop because the plan changed. Do NOT invoke before a plan exists (use metis for pre-plan ambiguity audit), do NOT invoke to fix the plan, and do NOT emit Temper-style 0-100 scores in Cast runs.
Architecture synthesizer for anneal-cast. Reads every reviewer envelope — metis, momus, and all three red-team-trinity envelopes — and emits a bird's-eye verdict covering release coherence, deployment risk, breaking changes, and blast radius. Final review gate before hephaestus validates. Triggers: invoke at stage 5 close-out once per iteration after red-team-trinity returns all three envelopes. Do NOT invoke in parallel with the reviewers (oracle needs their output), do NOT invent new findings (only aggregate existing ones), and do NOT override a reviewer BLOCK without synthesis reasoning.
Uses power tools
Uses Bash, Write, or Edit tools
Share bugs, ideas, or general feedback.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge.
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Memory compression system for Claude Code - persist context across sessions
Curate auto-memory, promote learnings to CLAUDE.md and rules, extract proven patterns into reusable skills.
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Shannon Framework v6 — single-plugin consolidation replacing OMC + VF + Crucible + 13 others. 26 commands, 22 skills, 10 agents, 14 hooks across 7 domain modules + 4 enforcement layers.
Sharp-eyed visual-audit suite for Claude Code. Two coupled skills catch real UI defects — contrast failures, false affordances, modal opacity, contract mismatches — via real-system probes, zero mocks, evidence-cited verdicts.
Deepest-mode planning — consensus + gates + phase hierarchy + multi-plan tournament synthesis. Self-contained. Works with Claude Code and OpenCode.
Fixed-Point Deepen architecture of the Anneal plugin family. One plan, heated and cooled repeatedly — inline red team at every depth, Momus 0-100 scoring, convergence by variance/delta/cap.
3-agent unanimous consensus validation with hard gates for Claude Code. Maps Lead/Alpha/Bravo roles to CC subagents, enforces unanimity at phase transitions, persists evidence per phase+role.
Controlled heating, slow cooling, iterative tempering — applied to work plans.
Runtime status (2026-04-22): Cast pipeline verified end-to-end in a real Claude Code worker — all 9 Greek-god agents dispatched, artifact written with exact byte match, XML emitted and passes validate-xml.py. See VERIFICATION-SUMMARY.md for the full trace. Alloy and Temper passed load verification; full E2E runs in progress.
Anneal is a Claude Code plugin family that converts a vague task into a rigorously-reviewed execution artifact: an XML prompt, a plan directory, and the skill enrichment needed to run it. It replaces the earlier deepest-plan prototype (shipped with 91 validator defects due to asymmetric vendoring) with a cleaner core built around three named plan-review archetypes — Metis, Momus, Oracle — and an always-on red team.
The name is literal: the plugin implements simulated annealing against plan-quality scores. Heat (generate candidates), cool (score, prune), temper (red-team critique), repeat until convergence.
Rather than picking one architecture as the default, we ship three complete, installable plugin variants — one per architecture — so you can install all three and compare side-by-side on real tasks.
| Path | What it is |
|---|---|
README.md | You are here. |
ARCHITECTURE-PROPOSALS.md | The architecture document (499 lines, 17 sections). Shared invariants, seven-stage spine, agent roster, three proposals. |
COMPARISON-PLAYBOOK.md | How to test Cast / Alloy / Temper head-to-head. Decision rubric. |
INSTALL.md | Install cheatsheet. Umbrella or per-plugin marketplace. |
diagrams/anneal-architectures.html | Shared visual. Three Mermaid flowcharts with zoom/pan, editorial aesthetic. |
.claude-plugin/marketplace.json | Umbrella dev marketplace listing all three plugins. |
cast/ | Plugin · anneal-cast · Linear single-pour architecture. |
alloy/ | Plugin · anneal-alloy · Tournament consensus architecture. |
temper/ | Plugin · anneal-temper · Fixed-point deepen architecture. |
_shared/ | Reference docs consumed by all three plugins (Opus 4.7 XML schema, agent prompts, plan-reviewer schema, plugin-format cheatsheet). |
scripts/smoke-test.sh | Cross-plugin validation gate. Runs each plugin's validate-plugin.py and reports pass/fail. |
scripts/phase-4-review-prompts.md | Staged reviewer prompts (architect + code-reviewer) for multi-perspective audit. |
Each of the three plugin directories (cast/, alloy/, temper/) is a complete, installable Claude Code plugin:
{architecture}/
├── .claude-plugin/plugin.json # Manifest
├── .claude-plugin/marketplace.json # Per-plugin dev marketplace
├── README.md # Install + usage
├── PRD.md # Architecture-specific product requirements
├── ARCHITECTURE.md # Implementation detail
├── LICENSE # MIT
├── commands/anneal.md # /anneal-{name}:anneal slash command
├── skills/{7-8 skills}/SKILL.md # Metis, Prometheus variant, Momus, Red-Team Trinity, Oracle, Hephaestus, Atlas
├── agents/{9 agents}.md # Agent definitions with model assignments
├── hooks/hooks.json # SessionStart: plugin-loaded marker
├── scripts/validate-plugin.py # Self-validation
├── scripts/orchestrate.sh # Pipeline implementation
├── diagrams/{name}-architecture.html # Architecture-specific visual
└── docs/ # Invariants, worked example, emission format
Install all three, restart Claude Code, and all three commands register:
# Add the umbrella marketplace
/plugin marketplace add /Users/nick/Desktop/anneal
# Install all three
/plugin install anneal-cast@anneal-umbrella-dev
/plugin install anneal-alloy@anneal-umbrella-dev
/plugin install anneal-temper@anneal-umbrella-dev
After restart, three slash commands are available:
/anneal-cast:anneal <task> # Linear · ~8 spawns · ~4 min
/anneal-alloy:anneal <task> # Tournament · ~18 spawns · ~6 min · default --versions 5
/anneal-temper:anneal <task> # Fixed-point deepen · ~8×depth spawns · ~7 min · default --depth 3
Full install options, debugging, uninstall: see INSTALL.md.
All three satisfy the same eight invariants (red team always, validate always, XML + plan output, skill enrichment, unbounded re-loop, parallelization, category routing, dual-family prompts). They differ only in how stage 4 — Plan — works.