From ai-engineering
Design or review an AI agent or multi-agent system. Trigger with "design an agent for", "review this agent architecture", "should this be one agent or many?", or when defining an agent's tools, control loop, memory, guardrails, and where humans stay in the loop.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-engineering:agent-design <agent goal, or an existing agent design to review><agent goal, or an existing agent design to review>The summary Claude sees in its skill listing — used to decide when to auto-load this skill
> If you see unfamiliar placeholders or need to check which tools are connected, see [CONNECTORS.md](../../CONNECTORS.md). This skill follows the [SKILL-CONTRACT.md](../../SKILL-CONTRACT.md) — it appends a `machine_output` block.
If you see unfamiliar placeholders or need to check which tools are connected, see CONNECTORS.md. This skill follows the SKILL-CONTRACT.md — it appends a
machine_outputblock.
Design an agent that does the job without going off the rails, or review one that does. The
failure modes of agents are specific: unbounded loops, tool misuse, lost context, and silent
drift. This skill designs against them. produces an agent-design-doc that prompt-review and
security-review consume.
/agent-design <agent goal, or an existing design to review>
Design or review: @$1
If designing from scratch, ask what the agent must accomplish, what tools it needs, and what it must never do without a human.
┌─────────────────────────────────────────────────────────────────┐
│ AGENT DESIGN │
├─────────────────────────────────────────────────────────────────┤
│ STANDALONE (always works) │
│ ✓ Single-agent vs multi-agent decision │
│ ✓ Tool set + capability scoping (least privilege) │
│ ✓ Control loop: plan, act, observe, stop conditions │
│ ✓ Memory + context strategy │
│ ✓ Guardrails, validation, human-in-the-loop points │
├─────────────────────────────────────────────────────────────────┤
│ SUPERCHARGED (when you connect your tools) │
│ + MCP: design tool access as real MCP servers │
│ + Blackboard: shared state + response cache for multi-agent runs│
│ + Knowledge base: ground the design in your agent patterns │
└─────────────────────────────────────────────────────────────────┘
## Agent Design: [goal]
### Topology
[Single or multi-agent, and why. Diagram of agents + responsibilities.]
### Tools (least privilege)
| Tool | Why | Risk | Gate |
|------|-----|------|------|
### Control Loop
[Plan/act/observe, stop conditions, step budget.]
### Guardrails & HITL
- [Validation + where a human approves]
### Scorecard
| Dimension | Score | Note |
|-----------|-------|------|
| Topology fit | /100 | |
| Control loop safety | /100 | |
| Capability scoping | /100 | |
| Memory/context | /100 | |
This is an assessment skill (it evaluates a design against agent failure modes). Append a
machine_output block per SKILL-CONTRACT.md.
Scorecard rubric (each 0-100): 90-100 robust against the known agent failure modes, 75-89 minor gaps, 50-74 will loop or misuse tools under stress, below 50 redesign.
machine_output:
skill: agent-design
version: "1.0"
timestamp: <ISO-8601>
status: complete
scorecard:
topology_fit: 82
control_loop_safety: 58
capability_scoping: 70
memory_context: 84
findings:
- id: F1
severity: critical
category: control_loop_safety
location: executor agent
description: No max-iteration budget; a failing tool call can loop indefinitely
- id: F2
severity: high
category: capability_scoping
location: tool set
description: Agent has write access to prod DB with no human gate on deletes
recommendations:
- id: R1
action: Add a hard step budget and a no-progress stop condition
effort: low
addresses: [F1]
- id: R2
action: Scope DB access to read; route destructive actions through HITL approval
effort: medium
addresses: [F2]
artifacts:
- agent-design-doc
next_actions:
- skill: prompt-review
reason: The agent's system prompt must encode the stop conditions and guardrails
- skill: security-review
reason: Tool access and capability scoping need a security pass
If ~~MCP is connected:
If ~~blackboard is connected (e.g. the mcp-agent-toolkit spoke):
blackboard_write / blackboard_read as the inter-agent shared state so agents stay decoupled instead of passing everything through promptscache_get / cache_set to skip re-paying for identical LLM calls across a runIf ~~knowledge base is connected:
npx claudepluginhub shubham0086/the-machine-os --plugin ai-engineeringCreates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.