From ai-engineering
Evaluate a complete system architecture above the component level. Trigger with "review this architecture", "is this design going to scale?", "staff-level design review", or when assessing strengths, tradeoffs, and scalability/security/cost risks of a whole system rather than a single service.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-engineering:architecture-review <design doc, diagram, or system description><design doc, diagram, or system description>The summary Claude sees in its skill listing — used to decide when to auto-load this skill
> If you see unfamiliar placeholders or need to check which tools are connected, see [CONNECTORS.md](../../CONNECTORS.md). This skill follows the [SKILL-CONTRACT.md](../../SKILL-CONTRACT.md) — it appends a `machine_output` block.
If you see unfamiliar placeholders or need to check which tools are connected, see CONNECTORS.md. This skill follows the SKILL-CONTRACT.md — it appends a
machine_outputblock.
Evaluate a system architecture the way a staff engineer would in a design review: not "is this component correct" but "will this whole system hold up." It sits above system-design, api-design, and database-design and judges the composition of all of them.
/architecture-review <design doc, diagram, or system description>
Review the provided architecture: @$1
If no design is provided, ask for the system description, the scale targets (traffic, data, users), and the hard constraints (budget, latency, compliance).
┌─────────────────────────────────────────────────────────────────┐
│ ARCHITECTURE REVIEW │
├─────────────────────────────────────────────────────────────────┤
│ STANDALONE (always works) │
│ ✓ Describe the system or paste a design doc / diagram │
│ ✓ Strengths, weaknesses, and explicit tradeoffs │
│ ✓ Scalability, reliability, security, and cost risks │
│ ✓ Alternatives considered and why this one (or not) │
│ ✓ Scorecard across 5 dimensions │
├─────────────────────────────────────────────────────────────────┤
│ SUPERCHARGED (when you connect your tools) │
│ + Code graph: map the design against the real dependency graph │
│ + Knowledge base: check against your architecture principles │
│ + Observability: ground "scales?" in real traffic and latency │
└─────────────────────────────────────────────────────────────────┘
security-review and threat-model## Architecture Review: [system name]
### Summary
[2-3 sentences: what the system does, the headline verdict, the single biggest risk.]
### Strengths
- [What this design gets right and why it matters]
### Weaknesses & Tradeoffs
| # | Area | Concern | Tradeoff being made | Severity |
|---|------|---------|---------------------|----------|
### Risk Register
| # | Risk type | Risk | Trigger condition | Mitigation |
|---|-----------|------|-------------------|------------|
| 1 | Scalability | [risk] | [when it bites] | [fix] |
### Alternatives
- [Alternative design, what it would change, when it would be the better call]
### Scorecard
| Dimension | Score | Note |
|-----------|-------|------|
| Scalability | /100 | |
| Reliability | /100 | |
| Security | /100 | |
| Cost | /100 | |
| Complexity | /100 | |
### Verdict
[SOUND] / [SOUND WITH CHANGES] / [RECONSIDER]
This is an assessment skill. After the human-readable review, append a machine_output
block per SKILL-CONTRACT.md.
Scorecard rubric (each 0-100): 90-100 production-grade at the stated scale, 75-89 sound with named changes, 50-74 material risks to resolve first, below 50 reconsider the approach.
machine_output:
skill: architecture-review
version: "1.0"
timestamp: <ISO-8601>
status: complete
scorecard:
scalability: 72
reliability: 80
security: 68
cost: 85
complexity: 60
findings:
- id: F1
severity: high
category: scalability
location: ingest-service
description: Single synchronous writer fans out to 6 downstreams; caps throughput
recommendations:
- id: R1
action: Put a queue between ingest and downstreams; make consumers idempotent
effort: medium
addresses: [F1]
artifacts:
- architecture-review-report
next_actions:
- skill: security-review
reason: Trust boundary at the ingest edge needs a dedicated pass
- skill: task-decomposition
reason: Turn the risk register into a sequenced remediation plan
If ~~code-graph is connected (e.g. the agent-context spoke):
If ~~knowledge base is connected:
If ~~observability is connected:
npx claudepluginhub shubham0086/the-machine-os --plugin ai-engineeringCreates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.