From skill-steward
Plans and documents handoffs between specialized AI agents for multi-agent workflows. Includes templates for handoff documents and parallel batch contracts.
How this skill is triggered — by the user, by Claude, or both
Slash command
/skill-steward:multi-agent-handoffThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Structure work so multiple agents can execute sequentially without losing context.
Structure work so multiple agents can execute sequentially without losing context.
Create or update HANDOFF.md (or a section in the task issue) with:
## Goal
{one sentence outcome}
## Done
- {completed items}
## Next
1. {ordered steps for the receiving agent}
## Constraints
- {tech stack, style, files not to touch}
## Verification
- {commands or checks that must pass}
## Validation status
- {commands run}
- {commands skipped or blocked, with reason}
- {blocked JSON explained with `steward blocked explain --input <path> --json`, when available}
- {schema/output drift checked with `steward schema check-outputs --json`, when machine-readable output is part of the handoff}
- {claims not proven because validation was skipped or blocked}
## Partial results
- {missing, partial, superseded, or timed-out agents/lenses}
## Context links
- {paths, PRs, prior decisions}
## Artifact capture
- {ADR, FAQ, skill, evidence note, test, validator, generator, or check that should absorb durable learning}
For broad decomposable work, the parent may use a disposable batch section instead of a new plan format. Keep only enough contract to move safely:
Saved, Cost/duplication, and Future hot path;Use this compact shape before dispatching parallel lanes:
## Parallel Batch
Original goal: {user-visible outcome}
Acceptance check: {how the parent will know the original goal is satisfied}
Product impact check: {source-owned product behavior/API/UI/perf/doc-user workflow that must change or be proven unchanged}
Default native gate: {repo-native command or reason none exists}
Aggregate gate: {final validation before claiming completion}
Detour budget: {when to stop repairing tools and return to the goal}
Claim ceiling: {strongest claim allowed if all lanes pass}
Non-claims: {adjacent claims this batch cannot prove}
Acceleration note:
- Product impact line: {recognized prefix plus proof; use support_only: Steward scaffolding only when no product surface moved}
- Saved: {time, uncertainty, or risk reduced by running lanes in parallel}
- Cost/duplication: {duplicated work, integration cost, or coordination drag caused or avoided}
- Future hot path: {command, check, skill, script, deletion, or native route created for the next run}
| Lane | Agent/role | Scope | Write set | Forbidden paths | Native gate | Direct fix? | Terminal state |
|------|------------|-------|-----------|-----------------|-------------|-------------|----------------|
| L1 | {owner} | {bounded work} | `{paths}` | `{paths}` | `{command}` | yes/no | pending |
Delete or collapse this batch section after synthesis unless it becomes a review artifact. Do not preserve lane maps as project management state.
Parent-assigned lane contracts are the only write-authority surface. Advisory ecology route dispatch_lane_candidates, MoE findings, A2A notes, and reviewer comments are inputs only.
Each assigned lane should state:
lane_id, assigned agent/role, scope, exact write_set, and forbidden_paths;direct_fix_allowed: true|false, claim ceiling, non-claims, and escalation triggers;integrated_to_owner, rejected, blocked_to_current_ledger, promoted_to_durable_owner, deleted, reported_to_parent, accepted_as_input, partial, timed_out, or superseded.Only a parent lane contract may set direct_fix_allowed: true. Direct fixes must be bounded low-risk work with exact write sets, declared forbidden paths, inherited safety rules, required impact/permission checks, and available validation. If validation is skipped or blocked, the result downgrades to blocked or recommendation; it is not integrated_to_owner.
A2A artifacts never authorize writes, widen scope, accept/reject lanes, or launder steward judgment. The parent or explicit A2Human checkpoint owns authorization, synthesis, final claims, and scope changes.
When a lane proves a fix in a temp clone, fork, worktree, or external checkout,
that proof is accepted_as_input until the owner checkout carries the smallest
source-owned diff and reruns the native gate. Do not treat temp proof as
repo-owned evidence by default.
Before claiming a lane is integrated:
source_owner_status as temp_only,
owner_landed_pending_gate, owner_gate_passed, or
blocked_owner_dirty.integrated_to_owner, blocked_to_current_ledger,
accepted_as_input, or rejected.If the source checkout is dirty or cannot accept the diff safely, record the temp proof as a candidate and keep the stronger source-owned claim unproven. After one bounded landing attempt, close the lane as blocked or rejected rather than creating another proof packet for the same unlanded result.
runtime_behavior:, public_api:,
product_native_gate:, visual_capture:, performance_metric:,
release_path:, developer_workflow:, command_output:,
plugin_install:, or support_only:.
If Cost/duplication is not lower than the saved uncertainty, risk, or
repeat work, default to leave_native, rejected, or a low-confidence
support claim.HANDOFF.md to the next agent or subagent.references/ or issues)partial, timed_out, blocked, or superseded terminal statedisable-model-invocation: true on skills that must run only when invokedRepo Truth Verifier, Boundary Leak Reviewer, Evidence Ladder Reviewer, Doc Collapse Reviewer, Harness QA Reviewer, and Stale External Assumption Reviewer..codex/agents/*.toml or ~/.codex/agents/*.toml; define name, description, and developer_instructions, and spawn subagents only when the user explicitly asks for subagent delegation..cursor/agents/*.md or ~/.cursor/agents/*.md; each run has isolated context, and background/parallel execution is useful for independent slices.npx skills add arenukvern/skill_steward --skill multi-agent-handoff
See references/sources.md. When researching, follow skill-source-citations.
Reviews code diffs for over-engineering: unnecessary complexity, reinvented standard library, speculative abstractions. One-line findings per location. Use for 'review for over-engineering' or /ponytail-review.
npx claudepluginhub arenukvern/skill_steward --plugin skill-steward