Use when designing end-to-end verification for a running application — teaches the executable contract pattern where smoke tests serve as the agreement between diagnostic server and CLI client.
From interhelmnpx claudepluginhub mistakeknot/interagency-marketplace --plugin interhelmThis skill uses the workspace's default tool permissions.
Implements structured self-debugging workflow for AI agent failures: capture errors, diagnose patterns like loops or context overflow, apply contained recoveries, and generate introspection reports.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
Designs and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Use when:
A smoke test is NOT just an end-to-end test. It is the contract between the diagnostic server and the CLI client — the authoritative definition of "this application works correctly."
POST /diag/smoke-test
{
"checks": [
{ "name": "server_reachable", "type": "health", "expect": "status == 'healthy'" },
{ "name": "subsystems_up", "type": "health", "expect": "all_subsystems('healthy')" },
{ "name": "state_initialized", "type": "assert", "expect": "simulation.tick >= 0" },
{ "name": "ui_renders", "type": "ui_state", "expect": "active_view != null" },
{ "name": "can_step", "type": "diff", "params": { "steps": 1 }, "expect": "simulation.tick > before.simulation.tick" },
{ "name": "no_errors", "type": "health", "expect": "error_count == 0" }
]
}
Response:
{
"passed": 5,
"failed": 1,
"total": 6,
"results": [
{ "name": "server_reachable", "status": "pass", "duration_ms": 12 },
{ "name": "subsystems_up", "status": "pass", "duration_ms": 45 },
{ "name": "state_initialized", "status": "pass", "duration_ms": 8 },
{ "name": "ui_renders", "status": "pass", "duration_ms": 15 },
{ "name": "can_step", "status": "pass", "duration_ms": 230 },
{ "name": "no_errors", "status": "fail", "detail": "error_count = 2", "duration_ms": 10 }
]
}
When adding a new subsystem to the application:
/diag/healthThe smoke test is the source of truth. When it fails after a code change:
Never delete a smoke test check to make tests pass. Either fix the code or update the expectation with a comment explaining why.