By gbasin
Adversarially stress-test technical plans by verifying claims against real documentation, running proof-of-concept code in .poc-stress-test/, and iteratively updating the plan to catch issues before building.
npx claudepluginhub gbasin/stress-test-skill --plugin stress-testAn agent skill that stress-tests technical plans before you build them.
Models are lazy about verification. They'll write a plan that says "use SQLite for concurrent writes" or "Y.js supports persistence out of the box" and move on without checking. These unchecked assumptions become mid-build surprises that force architectural pivots, messy workarounds, and wasted context.
This skill forces the model to actually verify its claims — searching real docs, ranking evidence quality, running proof-of-concept code when search is not enough, and fixing the plan before implementation starts. Each verification runs in a fresh sub-agent context, so there's less confirmation bias from the planning conversation — fewer hidden assumptions, less mid-build churn, and a clearer line between what's confirmed and what's still risk.
A plan claimed bash + sqlite3 would be fast enough for git hooks. The skill spun up parallel agents to research alternatives and run an actual latency POC:

The POC disproved the assumption — bash was 4-5x slower than estimated — and surfaced the real tradeoffs across runtimes:

npx skills add gbasin/stress-test-skill --all -g
Works with Claude Code, Codex, Cursor, Gemini CLI, GitHub Copilot, Windsurf, and other supported agents.
Six phases, each building on the last:
.poc-stress-test/ directory using the smallest representative setup in the most production-like environment available.Codex, Gemini, Claude の3つの AI で Plan ファイルを並列レビュー。実装計画の妥当性、抜け漏れ、リスクを分析する
Share bugs, ideas, or general feedback.
AI-assisted deep planning with research, interview, external LLM review, and TDD approach
TDD-validated implementation planning with plan review quality gate (2 skills, 5 agents, 1 command) - write plans, validate against codebase reality before execution
Language-agnostic test plan generation and execution.
Plan iron, verify real. Ironclad planning with independent verification chain. Turns any input into a bulletproof plan, executes with TDD, verifies with independent agents.
Software engineering skills from Code Complete and A Philosophy of Software Design. 20 skills across 3 agents (build, post-gate, debug). Building workflow with adaptive gates (BUILD, REVIEW, commit). Scientific debugging via debug-agent.