From awesome-cognitive-and-neuroscience-skills
Selects Theory of Mind tasks matched to population, age, and construct, providing developmental hierarchy, psychometrics, confounds, and research protocols.
npx claudepluginhub neuroaihub/awesome_cognitive_and_neuroscience_skills --plugin awesome-cognitive-and-neuroscience-skillsThis skill uses the workspace's default tool permissions.
This skill encodes expert knowledge for selecting, administering, and interpreting Theory of Mind (ToM) assessments. It provides a construct taxonomy, task selection decision trees, age-appropriate recommendations, psychometric properties, and guidance on confounds. A general-purpose programmer would not know which ToM tasks are appropriate for which populations, the developmental sequence of T...
Designs habituation, preferential-looking, and violation-of-expectation paradigms for infant studies with age-appropriate trial timings, habituation criteria, exclusion standards, and coding benchmarks.
Diagnoses thinking failures and maintains process-sovereignty by auditing cognitive orientations and operations balance. Useful when reasoning feels stuck, circular, conclusions defended, or evidence explained away.
Designs critical thinking tasks targeting specific skills like evaluating evidence, identifying bias, or analyzing arguments for embedding into subject lessons.
Share bugs, ideas, or general feedback.
This skill encodes expert knowledge for selecting, administering, and interpreting Theory of Mind (ToM) assessments. It provides a construct taxonomy, task selection decision trees, age-appropriate recommendations, psychometric properties, and guidance on confounds. A general-purpose programmer would not know which ToM tasks are appropriate for which populations, the developmental sequence of ToM abilities, or the psychometric limitations of common measures.
Before executing the domain-specific steps below, you MUST:
For detailed methodology guidance, see the research-literacy skill.
This skill was generated by AI from academic literature. All parameters, thresholds, and citations require independent verification before use in research. If you find errors, please open an issue.
ToM develops in a predictable sequence (Wellman & Liu, 2004). Tasks should be matched to the expected level:
| Level | Construct | Age of Emergence | Key Task | Source |
|---|---|---|---|---|
| 1 | Diverse desires | ~3 years | Diverse desires task | Wellman & Liu, 2004 |
| 2 | Diverse beliefs | ~3-4 years | Diverse beliefs task | Wellman & Liu, 2004 |
| 3 | Knowledge access | ~4 years | Knowledge access task | Wellman & Liu, 2004 |
| 4 | First-order false belief | ~4-5 years | Sally-Anne (Wimmer & Perner, 1983) | Wellman et al., 2001 |
| 5 | Hidden emotion | ~5-6 years | Appearance-reality emotion task | Wellman & Liu, 2004 |
| 6 | Second-order false belief | ~6-7 years | Ice-cream van task | Perner & Wimmer, 1985 |
| 7 | Faux pas recognition | ~9-11 years | Faux pas stories | Baron-Cohen et al., 1999 |
| 8 | Advanced/adult ToM | Adolescence-adult | Strange Stories, RMET | Happe, 1994; Baron-Cohen et al., 2001 |
| Dimension | Description | Example Tasks |
|---|---|---|
| Belief attribution | Understanding others' beliefs, especially false beliefs | Sally-Anne, unexpected contents |
| Desire attribution | Understanding others' desires differ from one's own | Diverse desires task |
| Intention attribution | Understanding goal-directed action and intentionality | Intentional vs. accidental actions |
| Emotion attribution | Understanding others' emotions from context/cues | Hidden emotion, RMET |
| Visual perspective-taking | Level 1: what others see; Level 2: how others see it | Director task, Flavell tasks |
| Implicit/spontaneous ToM | Automatic, non-verbal ToM processing | Anticipatory looking, VoE paradigms |
What is the participant's age?
|
+-- Infants (6-24 months)
| --> Implicit ToM tasks only
| --> Anticipatory looking (Southgate et al., 2007)
| --> Violation-of-expectation (Onishi & Baillargeon, 2005)
|
+-- Preschoolers (3-5 years)
| --> Wellman & Liu (2004) scale (5 tasks)
| --> Sally-Anne / Change of location (Wimmer & Perner, 1983)
| --> Unexpected contents / Smarties task (Gopnik & Astington, 1988)
|
+-- School-age (6-12 years)
| --> Second-order false belief (Perner & Wimmer, 1985)
| --> Faux pas stories (Baron-Cohen et al., 1999)
| --> Strange Stories (Happe, 1994) -- simplified versions
|
+-- Adolescents and Adults
--> Strange Stories (Happe, 1994)
--> RMET (Baron-Cohen et al., 2001)
--> Director task (Keysar et al., 2003)
--> Faux pas test (Baron-Cohen et al., 1999)
--> Movie for the Assessment of Social Cognition (MASC; Dziobek et al., 2006)
What is the target population?
|
+-- Typically developing children
| --> Wellman & Liu (2004) scale (most validated)
| --> Standard false belief tasks
|
+-- Autism spectrum (children)
| --> Sally-Anne (Baron-Cohen et al., 1985)
| --> Unexpected contents (Perner et al., 1989)
| --> Happe Strange Stories (if verbal)
| |
| NOTE: Many autistic individuals pass standard false
| belief tasks by age 6-8. Use advanced tasks to
| avoid ceiling effects (Happe, 1994).
|
+-- Autism spectrum (adults)
| --> RMET (Baron-Cohen et al., 2001)
| --> Faux pas test (Baron-Cohen et al., 1999)
| --> MASC (Dziobek et al., 2006)
| --> Director task (Keysar et al., 2003)
|
+-- Brain injury / neurological
| --> Faux pas test (Stone et al., 1998)
| --> Strange Stories (Happe, 1994)
| --> RMET (Baron-Cohen et al., 2001)
| --> Yoni task (Shamay-Tsoory & Aharon-Peretz, 2007)
|
+-- Aging / dementia
--> Faux pas test (Gregory et al., 2002)
--> RMET (Baron-Cohen et al., 2001)
--> Strange Stories (Happe, 1994)
--> Note: control for processing speed and working memory
What ToM construct are you targeting?
|
+-- Belief attribution
| --> False belief tasks (Sally-Anne, unexpected contents)
| --> Second-order false belief
|
+-- Emotion recognition
| --> RMET (Baron-Cohen et al., 2001)
| --> Cambridge Mindreading Face-Voice Battery
|
+-- Social reasoning / pragmatics
| --> Faux pas test
| --> Strange Stories
|
+-- Visual perspective-taking
| --> Director task (Keysar et al., 2003)
| --> Flavell Level 1/2 tasks
|
+-- Implicit / spontaneous ToM
--> Anticipatory looking paradigms
--> Dot-perspective task (Samson et al., 2010)
| Property | Value | Source |
|---|---|---|
| Original citation | Wimmer & Perner, 1983; Baron-Cohen et al., 1985 | |
| Age range | 3-6 years (standard); used in ASD at any age | Wellman et al., 2001 |
| Administration | Acted out with dolls/puppets or illustrated story | Baron-Cohen et al., 1985 |
| Test question | "Where will Sally look for her marble?" | |
| Control questions | Reality question + memory question (must pass both) | Baron-Cohen et al., 1985 |
| Scoring | Pass/fail (binary) | |
| Passing criterion | Correct test question + both control questions | Baron-Cohen et al., 1985 |
| Typical passing rates | ~20% at 3 years, ~50% at 4 years, ~90% by 5-6 years | Wellman et al., 2001 |
| Limitations | Ceiling by age 6; binary scoring limits sensitivity | Wellman et al., 2001 |
| Property | Value | Source |
|---|---|---|
| Original citation | Gopnik & Astington, 1988; Perner et al., 1987 | |
| Age range | 3-6 years | Gopnik & Astington, 1988 |
| Administration | Show container (e.g., Smarties box) with unexpected contents (e.g., pencils) | |
| Test question | "What will [name] think is in the box?" (other's belief) | |
| Self question | "What did you think was in the box before I opened it?" (own prior belief) | |
| Scoring | Pass: predicts other will say "Smarties" (or typical contents) |
| Property | Value | Source |
|---|---|---|
| Original citation | Perner & Wimmer, 1985 | |
| Age range | 6-9 years | Perner & Wimmer, 1985 |
| Construct | "She thinks that he thinks that..." | |
| Administration | Story scenario (ice-cream van paradigm) | Perner & Wimmer, 1985 |
| Test question | "Where does Mary think John will go to buy ice cream?" | |
| Passing rates | ~10% at 5 years, ~50% at 7 years, ~90% by 9 years | Perner & Wimmer, 1985 |
| Comprehension questions | 2-3 memory/comprehension checks required | Standard practice |
| Property | Value | Source |
|---|---|---|
| Original citation | Baron-Cohen et al., 2001 | |
| Version | Revised version (2001) -- 36 items | Baron-Cohen et al., 2001 |
| Age range | Adults (16+ years); child version available (28 items) | Baron-Cohen et al., 2001 |
| Administration | Forced-choice: pick 1 of 4 mental state words matching eye region photo | |
| Scoring | Total correct out of 36 (adults) or 28 (children) | |
| Adult norms | Mean ~ 26.2 (SD ~ 3.6) in typical adults | Baron-Cohen et al., 2001 |
| ASD norms | Mean ~ 21.9 (SD ~ 6.6) in autistic adults | Baron-Cohen et al., 2001 |
| Reliability | Internal consistency: Cronbach's alpha ~ 0.60-0.70 (modest) | Olderbak et al., 2015 |
| Limitations | Low reliability, possible confound with emotion recognition vs. ToM per se | Olderbak et al., 2015 |
| Property | Value | Source |
|---|---|---|
| Original citation | Baron-Cohen et al., 1999 | |
| Age range | 9 years to adult | Baron-Cohen et al., 1999 |
| Administration | Read 10 faux pas stories + 10 control stories | |
| Questions per story | Detection ("Did someone say something awkward?"), identification, belief, empathy | Baron-Cohen et al., 1999 |
| Scoring | 0-2 points per question; max 60 for faux pas stories | Baron-Cohen et al., 1999 |
| Control stories | Must also score comprehension questions for controls | |
| Sensitivity | Good for detecting subtle ToM deficits in ASD, right hemisphere lesions, frontotemporal dementia | Stone et al., 1998; Gregory et al., 2002 |
| Property | Value | Source |
|---|---|---|
| Original citation | Happe, 1994 | |
| Construct | Advanced ToM: irony, white lie, double bluff, misunderstanding, persuasion, appearance/reality, figure of speech, sarcasm, forgetting, contrary emotions | |
| Administration | Read vignettes, open-ended question: "Why did X say that?" | |
| Scoring | 0 (incorrect), 1 (partial), 2 (full mental state reference) | Happe, 1994 |
| Number of stories | 8-16 ToM stories + physical control stories | Happe, 1994 |
| Age range | Children (8+) and adults | Happe, 1994 |
| Reliability | Inter-rater reliability for scoring: kappa > 0.85 recommended | Happe, 1994 |
| Property | Value | Source |
|---|---|---|
| Original citation | Keysar et al., 2003 | |
| Construct | Level 2 perspective-taking under communicative demand | |
| Administration | Grid of objects; director (behind grid) instructs participant to move objects; some slots occluded from director's view | |
| Measure | Eye movements (egocentric intrusions), accuracy, RT | Keysar et al., 2003 |
| Key finding | Even adults show egocentric errors on ~30-50% of critical trials | Keysar et al., 2003 |
| Age range | 7 years to adult | Dumontheil et al., 2010 |
See references/task-database.md for the full task list with administration protocols.
| Task | Internal Consistency | Test-Retest | Source |
|---|---|---|---|
| Sally-Anne (single item) | N/A (binary) | Variable | Wellman et al., 2001 |
| Wellman & Liu Scale | Guttman scalability > 0.90 | Moderate | Wellman & Liu, 2004 |
| RMET | alpha ~ 0.60-0.70 | r ~ 0.63-0.83 | Olderbak et al., 2015; Fernandez-Abascal et al., 2013 |
| Faux pas test | alpha ~ 0.70-0.80 | Not well-established | Baron-Cohen et al., 1999 |
| Strange Stories | Inter-rater: kappa > 0.85 | Moderate | Happe, 1994 |
| MASC | alpha ~ 0.70 | Adequate | Dziobek et al., 2006 |
| Confound | Impact | Mitigation | Source |
|---|---|---|---|
| Verbal demands | False belief tasks require comprehension of complex sentences | Include vocabulary/language control measure | Milligan et al., 2007 |
| Narrative complexity | Second-order tasks have heavy memory load | Add comprehension check questions | Perner & Wimmer, 1985 |
| Word knowledge (RMET) | Vocabulary confound in forced-choice emotion labels | Control for verbal IQ | Olderbak et al., 2015 |
| Confound | Impact | Mitigation | Source |
|---|---|---|---|
| Inhibitory control | Must inhibit own knowledge to attribute false belief | Include inhibition measure (e.g., Stroop, day-night) | Carlson & Moses, 2001 |
| Working memory | Must hold multiple perspectives simultaneously | Control for WM span | Carlson & Moses, 2001 |
| Cognitive flexibility | Must switch between self and other perspective | Include set-shifting measure | Carlson & Moses, 2001 |
For any ToM study, include at minimum:
| Population | Recommended Battery | Rationale |
|---|---|---|
| Preschool (3-5y) | Wellman & Liu Scale (5 tasks) + diverse desires + diverse beliefs | Guttman-scalable, captures developmental progression (Wellman & Liu, 2004) |
| School-age (6-12y) | First-order FB + second-order FB + faux pas + Strange Stories subset | Spans implicit to advanced ToM |
| ASD (children) | Sally-Anne + unexpected contents + Strange Stories (simplified) | Avoids ceiling; includes advanced items |
| ASD (adults) | RMET + faux pas + MASC + Director task | Multiple constructs; includes real-time and reflective tasks |
| Neurological (adults) | Faux pas + Strange Stories + RMET | Sensitive to frontal and right hemisphere lesions (Stone et al., 1998) |
| Aging research | Faux pas + RMET + Strange Stories | Control for processing speed; established aging norms |
If time is limited, prioritize:
See references/ for the full task database with administration protocols and scoring rubrics.