From ijcai-skills
Audits IJCAI/IJCAI-ECAI experiments for baselines, ablations, statistical evidence, hyperparameters, compute, dataset handling, ethics, and reproducibility.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ijcai-skills:ijcai-experimentsThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this before submission when the experimental story is not yet locked. IJCAI reviewers
Use this before submission when the experimental story is not yet locked. IJCAI reviewers can score novelty, correctness, clarity, significance, impact, presentation, ethics, and reproducibility.
IJCAI draws reviewers from symbolic AI, search, planning, constraint satisfaction, KR, multi-agent systems, game theory, ML, NLP, and vision, so the experimental section must read across subcommunities. Calibrate evidence to the claim type rather than copying an ML-only template.
| Contribution type | Decisive evidence | Common reject trigger |
|---|---|---|
| Search / planning | Coverage, anytime quality, expansion counts, time/memory cutoffs, per-domain breakdown | Single suite, no domain table, missing strong planner baseline |
| Constraint / SAT | Cactus plots, instances within timeout, solver versions | No virtual-best comparison |
| Multi-agent / game theory | Welfare/equilibrium metrics, agent-count scaling, seeds | Claims hold at one population size only |
| Learning method | Strong current baselines, core-mechanism ablations, variance | Cherry-picked seeds, weak baselines |
| Theory-plus-experiment | Experiments confirming the proven bound | Empirics outside the theorem's regime |
A submission proposes a learned heuristic for cost-optimal classical planning and reports a single aggregate "12% fewer expansions" number. Apply the decision rules:
[Experiment readiness] strong / adequate / weak
[Claim -> evidence map] <claim: section/table/figure>
[Missing baseline or ablation] <item>
[Reproducibility gaps] <hyperparameters/seeds/compute/data/code>
[Decision-critical next run] <one experiment>
npx claudepluginhub brycewang-stanford/awesome-journal-skills --plugin ijcai-skillsAudits AAAI experimental evidence including baselines, ablations, statistical significance, robustness, human evaluation, and reproducibility-checklist alignment for Phase-1 survival.
Strengthens IJCAI/ECAI reproducibility evidence using the official reviewer rubric. Maps contributions (algorithms, theory, datasets, experiments) to credible/convincing ratings and guides evidence drafting.
Audits ICLR experiments for scientific rigor: baselines, ablations, scaling laws, robustness, statistics, benchmarks, human evaluation, and compute reporting. Helps pre-answer reviewer questions and isolate mechanisms with compute-matched controls.