From review-of-educational-research-skills
Tests exhaustiveness, risk-of-bias, heterogeneity, and sensitivity for RER reviews or meta-analyses. Hardens synthesis against omission and fragility.
How this skill is triggered — by the user, by Claude, or both
Slash command
/review-of-educational-research-skills:revedres-comprehensiveness-and-balanceThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- The corpus and framework exist, but you have not stress-tested coverage or robustness
RER asks for comprehensive reviews; the burden is on you to make exhaustiveness provable, not asserted.
revedres-literature-synthesis).The amateur move is vote-counting — tallying significant vs. null studies. The RER standard is to weigh evidence by what each study measures and how credibly.
A pooled effect is a claim, not a fact, until you show it is not an artifact.
| Probe | What it guards against |
|---|---|
| Heterogeneity (Q, I², τ²) | reporting one number for a mix of different effects |
| Moderator analysis | masking real variation the framework should explain |
| Publication-bias diagnostics (funnel, Egger, trim-and-fill, p-curve/selection models) | an inflated effect from missing null results |
| Sensitivity analysis (leave-one-out, influence, alternative models) | a result driven by one study or one modeling choice |
| Dependent-effects handling (multilevel / robust variance) | false precision from multiple effects per sample |
For a narrative synthesis, the analogues are: confidence in the body of evidence (e.g. a GRADE-style judgment), explicit handling of conflicting findings, and a sensitivity check on which conclusions survive dropping the weakest studies.
【Saturation】documented? Y/N — named-omission test passed? Y/N
【Grey lit / language】exclusions + bias implication stated? Y/N
【Risk of bias】appraised for all studies; emphasis credibility-weighted? Y/N
【Conflict handling】reconciled by estimand/design (not vote-count)? Y/N
【Heterogeneity】I²/τ² + moderators reported? Y/N (meta) | strength-of-evidence judged (narrative)
【Publication bias】funnel/Egger/trim-fill/p-curve run? Y/N
【Sensitivity】leave-one-out / alt models / dependent-effects model? Y/N
【Next step】→ revedres-tables-figures (PRISMA flow, forest/funnel, coding tables)
npx claudepluginhub brycewang-stanford/awesome-journal-skills --plugin review-of-educational-research-skillsMakes systematic reviews for RER auditable and re-runnable by enforcing PRISMA/MARS reporting, coding reliability quantification, and open materials sharing.
Appraises credibility of studies in a PoPS piece to ensure balanced treatment of competing camps, using criteria like power, replication status, and publication bias.
Audits an Annual Review of Psychology draft for even-handed coverage across labs, paradigms, and rival theories, weights evidence by credibility, and flags self-promotion in citations.