Help us improve
Share bugs, ideas, or general feedback.
From jm-adk
Designs and reviews A/B tests with explicit hypothesis, primary metric, guardrail metrics, variants, sample-size assumptions, duration, stopping rules, instrumentation checks, and decision criteria. [EXPLICIT] Trigger: "ab testing, a/b test, experiment design, split test, hypothesis formulation, statistical significance, sample size calculation, test duration"
npx claudepluginhub javimontano/jm-adk-alfaHow this skill is triggered — by the user, by Claude, or both
Slash command
/jm-adk:ab-testingThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
> "Method over hacks. Evidence over assumption."
README.mdagents/guardian.mdagents/lead.mdagents/specialist.mdagents/support.mdassets/README.mdassets/deliverable-checklist.mdassets/manifest.jsonevals/evals.jsonexamples/example-input.mdexamples/example-output.mdknowledge/body-of-knowledge.mdknowledge/knowledge-graph.jsonknowledge/knowledge-graph.mdprompts/meta.mdprompts/primary.mdprompts/variations/deep.mdprompts/variations/quick.mdtemplates/output.docx.mdtemplates/output.htmlApplies C++ Core Guidelines to write, review, or refactor C++ code. Enforces modern, safe, and idiomatic practices for C++17/20/23.
Share bugs, ideas, or general feedback.
"Method over hacks. Evidence over assumption."
Designs or audits an A/B test so a team can decide whether to run, fix, stop, or interpret an experiment without confusing speed with evidence. [EXPLICIT] The skill must make the hypothesis, metric contract, assumptions, sample-size needs, duration, instrumentation, risks, and decision rule explicit. [EXPLICIT]
| Anti-Pattern | Why It's Bad | Do This Instead |
|---|---|---|
| Testing without a decision rule | Produces data but no decision | Define win, loss, inconclusive, and guardrail-failure actions before launch |
| Optimizing many primary metrics | Inflates false positives and weakens accountability | Choose one primary metric and separate guardrails |
| Peeking and stopping early | Makes confidence claims unreliable | Define monitoring and stopping policy before launch |
| Missing instrumentation checks | Invalidates results after traffic is spent | Verify events, exposure logging, and sample ratio before analysis |
| Treating significance as business value | A statistically detectable lift may be too small to matter | Include MDE and practical impact threshold |
analytics-eventsfunnel-analyticsconversion-optimizationdata-validationexperimentation-strategyExample invocations:
| Scenario | Handling |
|---|---|
| Empty or minimal input | Request clarification before proceeding |
| Conflicting requirements | Flag conflicts explicitly, propose resolution |
| Out-of-scope request | Redirect to appropriate skill or escalate |
assets/deliverable-checklist.md provides the reusable checklist for final deliverable and guardian review.