From iclr-skills
Strengthens reproducibility for ICLR papers: maps claims to seeds, splits, commands, and compute; writes reproducibility statements and addresses reviewer concerns about verifiability.
How this skill is triggered — by the user, by Claude, or both
Slash command
/iclr-skills:iclr-reproducibilityThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this when the paper's main claims depend on experiments, simulations, data processing, human
Use this when the paper's main claims depend on experiments, simulations, data processing, human subjects, or benchmark comparisons. ICLR reviewers are asked to evaluate rigor and reproducibility, not just headline scores.
ICLR has long pushed reproducibility statements and code release as community norms. Treat the statement as a public contract: it sits beside the paper permanently, so reviewers and later readers will hold you to it. Map every claim to something checkable.
| Claim element | What the statement should pin | Reviewer doubt it removes |
|---|---|---|
| Headline number | Seed set, split, exact command | "Did they tune on test?" |
| Architecture detail | Config file in the supplement | "Hidden trick not in the text" |
| Compute cost | Hardware and FLOPs, train vs inference | "Only works at huge scale" |
| Data pipeline | Preprocessing script and license | "Leakage between splits" |
A representation-learning paper reports an embedding that improves retrieval. The reproducibility
statement maps the headline metric to eval_retrieval.py --seed {0..4} --split test, names the
frozen-encoder protocol, links the anonymized config, and reports per-seed variance. When a reviewer
asks whether the gain survives a different split, the authors point to the appendix table already
covering it. The verifiable mapping turns a potential "fragile" grade into "adequate" without new runs.
[Reproducibility grade] strong / adequate / fragile / not reviewable
[Claim-to-evidence map] <claim -> table/figure/appendix/artifact>
[Missing controls] <seeds, baselines, ablations, leakage checks>
[Compute disclosure] complete / incomplete
[Priority fixes] <smallest changes that improve review confidence>
npx claudepluginhub brycewang-stanford/awesome-journal-skills --plugin iclr-skillsStrengthens ICML reproducibility evidence: code/data availability, random seeds, compute disclosure, appendix evidence, and reviewer-facing claims.
Audits ICLR experiments for scientific rigor: baselines, ablations, scaling laws, robustness, statistics, benchmarks, human evaluation, and compute reporting. Helps pre-answer reviewer questions and isolate mechanisms with compute-matched controls.
Assists with NeurIPS reproducibility: aligns Paper Checklist with the paper, writes code/data instructions, sets seed/compute disclosure, and decides MLRC vs. main track.