From Sipcode
Runs the published 20-task Sipcode benchmark on the user's machine to measure median savings (target 62.6%). Reproducible, locked corpus for verifying headline claims.
How this skill is triggered — by the user, by Claude, or both
Slash command
/sipcode:benchmarkThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Tell the user to run `npx sipcode benchmark` in their terminal (this is a CLI command, not an MCP tool — the benchmark requires file I/O outside the MCP scope). It takes ~90 seconds.
Tell the user to run npx sipcode benchmark in their terminal (this is a CLI command, not an MCP tool — the benchmark requires file I/O outside the MCP scope). It takes ~90 seconds.
If they want a quick smoke (3 tasks, ~15s), tell them to run npx sipcode benchmark --quick.
When the user reports back the output:
benchmark/METHODOLOGY.md for reproducibility.If the user's measured savings differ significantly from the 62.6% median, that's interesting — surface it as a real workload signal, not a bug.
npx claudepluginhub anuj7411/sipcode --plugin sipcodeCreates and runs reliable benchmarks to measure code change impacts on performance, including latency, throughput. Supports Node.js (vitest, tinybench), Python (pytest-benchmark), frontend (Lighthouse CI), with warmup, stats.
Runs a side-by-side benchmark comparing WOZCODE vs vanilla Claude Code on a user's codebase, measuring cost, turn count, and time savings.
Autonomously optimizes code performance using CodSpeed benchmarks, flamegraph analysis, and iterative improvement. Activates on optimization requests, slow functions, or regression mentions.