From aer-skills
Use when selecting, implementing, or stress-testing the causal identification strategy for an empirical economics manuscript — difference-in-differences (including staggered designs), instrumental variables (including weak-IV-robust inference), regression discontinuity, synthetic control, or shift-share / Bartik. Apply before writing the introduction or results.
How this skill is triggered — by the user, by Claude, or both
Slash command
/aer-skills:aer-identificationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
In AER-track empirical economics, **identification is the paper**. This skill routes among canonical designs, modern defaults, and referee-facing diagnostics.
In AER-track empirical economics, identification is the paper. This skill routes among canonical designs, modern defaults, and referee-facing diagnostics.
If the design is fragile, return to aer-topic-selection; writing cannot save it.
Is treatment assignment plausibly random conditional on observables?
├── Yes, by design (RCT, lottery) → run the RCT analysis; register PAP via AEA RCT Registry
└── No → identification must come from variation
├── Sharp threshold in a running variable → RDD (sharp or fuzzy)
├── Discrete policy change in some units, not others, over time → DiD
│ ├── Single treatment date → canonical 2×2 DiD
│ └── Staggered adoption → Callaway-Sant'Anna or Borusyak-Jaravel-Spiess
├── Endogenous regressor + plausibly exogenous shifter → IV
│ ├── Shifter × pre-existing exposure shares → shift-share / Bartik
│ └── Single instrument → weak-IV-robust inference if F < 50
├── One treated unit / aggregate intervention → synthetic control
└── None of the above → reconsider the question
Use TWFE if and only if:
Otherwise, TWFE produces biased and often sign-flipped estimates.
Do not use TWFE. Use one of:
csdid (Stata), did (R). Identifies group-time average treatment effects (ATT(g,t)); estimands are doubly robust; supports event-study aggregation.did_multiplegt.Required diagnostics:
A flat pre-trend is necessary but not sufficient. Report:
The first-stage F > 10 rule is obsolete. Modern conventions:
Use weakivtest (Stata), ivDiag (R), or the Olea-Pflueger effective F statistic.
The IV's credibility depends on a story, not a test. State the exclusion restriction in one sentence in the introduction and defend it with:
Two valid sources of identification, with very different implications:
Pick one explicitly. Do not hand-wave between the two.
rdrobust.rdplot with the binning method explicitly statedIf the paper uses a field experiment:
A common confusion: identification answers whether X causes Y; mechanism answers why. Mechanism evidence should not weaken the identification of the main effect. Run:
When working from the repo or plugin bundle, load only the relevant resource:
docs/methods-reference.mdtemplates/stata/03_main_did.do, templates/r/03_main_did.R, or templates/python/main_did.pyexamples/aer-exemplars.md and examples/modern-aer-exemplars.mdUse the methods reference before prose: it fixes the estimand, diagnostic, inference method, and citation that the manuscript must report.
Do not advance to robustness or writing until, for the chosen design, all are true:
Write the gate decision before routing onward:
STRATEGY: IV
FIRST STAGE: effective F = 7.8; 2SLS CI is not primary
ROBUST INFERENCE: AR 95% CI = [-0.14, 0.52]
PLACEBO: beta = 0.003 (p = 0.71)
DECISION: advance with directional headline only
STRATEGY: <DiD | IV | RDD | SCM | shift-share | RCT>
MODERN ESTIMATOR USED: <yes / no / which>
REQUIRED DIAGNOSTICS REPORTED: <list>
INFERENCE METHOD: <robust / cluster-robust / AR / wild bootstrap / permutation>
WEAK-IV / TWFE / POLY-ORDER RED FLAGS: <list or "none">
NEXT SKILL: aer-robustness
npx claudepluginhub brycewang-stanford/aer-skills --plugin aer-skillsStress-tests causal identification arguments (RCT, DiD, RD, IV, shift-share) for AEJ: Applied manuscripts, ensuring designs meet the journal's credibility bar before exhibits are finalized.
Stress-tests quasi-experimental identification strategies (DID, IV, RDD, DML) for economic research manuscripts against modern estimators and reporting standards.
Stress-tests causal identification designs (DiD, IV, RDD, experiment) for EER manuscripts, ensuring credibility before finalizing exhibits.