Help us improve
Share bugs, ideas, or general feedback.
From experiment-planning-suite
用于 P4 设计最低可交付 baseline、投稿级强 baseline、消融实验、指标阈值和 stop condition。不用于真正运行实验。
npx claudepluginhub kkunkunya/experiment-planning-suite --plugin experiment-planning-suiteHow this skill is triggered — by the user, by Claude, or both
Slash command
/experiment-planning-suite:baseline-ablation-designerThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
本 skill 负责把 claim-evidence map 细化成可实施 baseline、ablation、metric 和 stop condition。它服务论文策略:证据要足以支撑当前论文级别,也要知道怎样升级或降级。
Creates p5.js generative art with seeded randomness, noise fields, and interactive parameter exploration. Use for algorithmic art, flow fields, or particle systems.
Share bugs, ideas, or general feedback.
本 skill 负责把 claim-evidence map 细化成可实施 baseline、ablation、metric 和 stop condition。它服务论文策略:证据要足以支撑当前论文级别,也要知道怎样升级或降级。
/baseline-ablation-designer(
claim_evidence_map,
target_level?,
project_resources?,
output_root = knowledge/experiment-plan/
) -> baseline_ablation_plan
每个主实验分两层 baseline:
| Layer | Use |
|---|---|
| minimum-delivery baseline | 本科/硕士/课程/普通交付必须能比较的最低对照。 |
| publication-grade strong baseline | 中文核心、SCI 或会议投稿需要的强对照,优先 SOTA、recent strong baseline、领域经典方法。 |
如果资源不足以跑强 baseline,明确写:
strong_baseline_status: missing|resource-blocked|verifier_todo|planned
paper_level_impact: <which level this prevents>
每个创新点至少对应一个“关掉/替换/降级”实验:
消融不能只是凑表;必须说明它证明哪个 mechanism 或 contribution。
指标按论文类型选择:
每个指标必须写:
防止无限调参。每个 high-cost experiment 至少有一个 stop condition:
stop_when:
- repeated_attempts_reach_limit
- metric_plateau
- baseline_not_reproduced
- resource_not_available
- claim_no_longer_supported
on_stop: salvage|ask_user|downgrade_level|resource_acquisition
Write machine rules to:
knowledge/experiment-plan/machine/stop-conditions.yaml
Write or update:
knowledge/experiment-plan/baseline-ablation-plan.md
knowledge/experiment-plan/machine/stop-conditions.yaml
plugins/experiment-planning-suite/skills/baseline-ablation-designer/,bump plugin 版本,刷新 marketplace/cache。