Field connection mapping and systematic ideation for method transfer
Maps statistical problems to analogous methods from physics, CS, economics, and biology for novel solution discovery.
npx claudepluginhub data-wise/scholarThis skill inherits all available tools. When active, it can use any tool Claude has access to.
Systematic framework for discovering statistical innovations through cross-field connections
Use this skill when: brainstorming new methods, seeking novel approaches to statistical problems, looking for inspiration from other fields (physics, CS, biology, economics), or wanting to apply techniques from one domain to another.
Many statistical breakthroughs originated elsewhere:
| Statistical Method | Origin Field | Transfer |
|---|---|---|
| MCMC | Physics (Metropolis) | Statistical computation |
| Boosting | Machine learning | Ensemble methods |
| Lasso | Signal processing | Sparse regression |
| Optimal transport | Mathematics | Distribution comparison |
| Neural networks | Neuroscience/CS | Flexible function estimation |
| Causal graphs | Philosophy/AI | Causal inference |
Problem in Statistics → Abstract Structure → Search Other Fields
↑ ↓
Validate/Adapt ←── Identify Analogues ←── Find Connections
| ML Method | Statistical Application | Transfer Opportunity |
|---|---|---|
| Double ML | Debiased mediation effects | Nuisance parameter estimation |
| Causal Forests | Heterogeneous mediation | Effect modification detection |
| Neural Networks | Flexible g-computation | Nonparametric mediation |
| VAEs | Latent mediator modeling | Measurement error correction |
| Transformers | Sequential mediation | Temporal pattern learning |
| GNNs | Network mediation | Spillover effect estimation |
# Double ML for mediation effect estimation
library(DoubleML)
# Estimate nuisance parameters with ML
estimate_dml_mediation <- function(Y, A, M, X) {
# First stage: E[M|A,X]
mediator_model <- cv.glmnet(cbind(A, X), M)
M_hat <- predict(mediator_model, cbind(A, X))
# Second stage: E[Y|A,M,X]
outcome_model <- cv.glmnet(cbind(A, M, X), Y)
# Debiased estimation
residuals_M <- M - M_hat
list(
direct = coef(outcome_model)["A"],
indirect_component = residuals_M
)
}
| Statistical Concept | Physics Analogue | Insight |
|---|---|---|
| Log-likelihood | Energy | MLE = minimum energy state |
| Posterior | Boltzmann distribution | Temperature = uncertainty |
| Regularization | Physical constraints | Penalties as forces |
| Entropy | Thermodynamic entropy | Information = disorder |
| Diffusion models | Brownian motion | Noise as generative process |
| MCMC | Molecular dynamics | Sampling as physical simulation |
Productive Questions:
| Algorithm Class | Statistical Application | Key Insight |
|---|---|---|
| Dynamic Programming | Sequential mediation | Bellman equation for path effects |
| Graph Algorithms | DAG analysis | d-separation via path finding |
| Approximation Algs | High-dim inference | Trade exactness for scalability |
| Online Learning | Sequential testing | Adaptive experiment design |
| Randomized Algs | Monte Carlo methods | Probabilistic computation |
# Dynamic programming for sequential mediation paths
compute_path_effects <- function(effect_matrix, n_mediators) {
# effect_matrix[i,j] = effect from node i to node j
n <- nrow(effect_matrix)
# Initialize path effects (like shortest path, but products)
path_effects <- matrix(0, n, n)
diag(path_effects) <- 1
# DP recurrence: path[i,j] = sum over k of path[i,k] * edge[k,j]
for (len in 1:n_mediators) {
for (i in 1:n) {
for (j in 1:n) {
for (k in 1:n) {
if (effect_matrix[k, j] != 0) {
path_effects[i, j] <- path_effects[i, j] +
path_effects[i, k] * effect_matrix[k, j]
}
}
}
}
}
path_effects
}
| Statistical Concept | CS Analogue | Insight |
|---|---|---|
| Estimation | Optimization | Different objectives, shared algorithms |
| Hypothesis testing | Decision theory | Error rates as costs |
| Model selection | Algorithm selection | Bias-variance as time-space |
| Bayesian updating | Online learning | Sequential information |
| Sufficient statistics | Data compression | Minimal representation |
| Concentration inequalities | PAC bounds | Finite-sample guarantees |
Productive Questions:
| Statistical Concept | Economics Analogue | Insight |
|---|---|---|
| Utility | Loss function | Preferences over outcomes |
| Equilibrium | MLE/Bayes | Optimal response |
| Game theory | Robust statistics | Adversarial settings |
| Mechanism design | Experimental design | Incentive-compatible elicitation |
| Instrumental variables | Market instruments | Exogenous variation |
| Regression discontinuity | Policy thresholds | Quasi-experiments |
Productive Questions:
| Biological System | Statistical Method | Research Opportunity |
|---|---|---|
| Gene regulatory networks | Causal DAGs | Network mediation methods |
| Mendelian randomization | Instrumental variables | Genetic instruments for mediators |
| Population genetics | Drift models | Selection effects on mediators |
| Systems biology | Structural equations | Multi-level mediation |
| Phylogenetics | Hierarchical models | Evolutionary mediation |
# Mendelian randomization for mediation
# Using genetic variants as instruments
mr_mediation <- function(snp, exposure, mediator, outcome) {
# Stage 1: SNP -> Exposure
gamma_A <- coef(lm(exposure ~ snp))["snp"]
# Stage 2: SNP -> Mediator (genetic effect on M)
gamma_M <- coef(lm(mediator ~ snp + exposure))["snp"]
# Stage 3: Instrument-based mediation
# Indirect via genetic pathway
iv_model <- ivreg(outcome ~ mediator + exposure | snp + exposure)
list(
genetic_effect_exposure = gamma_A,
genetic_effect_mediator = gamma_M,
iv_mediation_estimate = coef(iv_model)["mediator"] * gamma_M
)
}
| Statistical Concept | Biology Analogue | Insight |
|---|---|---|
| Genetic algorithms | Evolution | Optimization by selection |
| Phylogenetics | Hierarchical models | Tree-structured dependence |
| Gene networks | Graphical models | Conditional independence |
| Population dynamics | Time series | Growth and interaction |
| Mendelian randomization | Instrumental variables | Genetic instruments |
| Selection bias | Survivorship | Conditioning on survival |
Productive Questions:
| Statistical Concept | Math Analogue | Insight |
|---|---|---|
| Distributions | Measures | Abstract probability |
| Convergence | Topology | Modes of convergence |
| Sufficiency | Invariance | Group actions |
| Efficiency | Geometry | Information geometry |
| Optimal transport | Measure theory | Wasserstein distance |
| Kernel methods | Functional analysis | RKHS theory |
Productive Questions:
Break the statistical problem into abstract components:
Problem: "Estimate mediation effects with measurement error"
Components:
1. Causal structure (DAG with mediator)
2. Latent variable (true M vs observed M*)
3. Identification (what assumptions needed?)
4. Estimation (how to account for error?)
5. Inference (variance under misspecification?)
Identify the mathematical essence:
Abstract patterns in measurement error mediation:
- Signal + noise model
- Latent variable with proxy
- Product of uncertain quantities
- Attenuation toward null
For each abstract pattern, search analogues:
| Pattern | Field to Search | Possible Analogues |
|---|---|---|
| Signal + noise | Signal processing | Kalman filter, denoising |
| Latent variable | Factor analysis | EM algorithm, identifiability |
| Product of uncertainties | Physics | Error propagation, Heisenberg |
| Attenuation | Econometrics | Errors-in-variables, IV |
For each promising analogue:
Understand the source method deeply
Map to target domain
Identify the gap
Evaluation Criteria:
□ Does it solve a real problem?
□ Is it novel (not already done)?
□ Are assumptions reasonable?
□ Is it computationally feasible?
□ Can it be proven to work (theory)?
□ Does it work in practice (simulation)?
Source: Survey sampling (Horvitz-Thompson estimator) Target: Causal inference (propensity score weighting)
Transfer insight:
Innovation: Rosenbaum & Rubin (1983) - propensity score methods
Source: Basis pursuit in signal processing Target: Variable selection in regression
Transfer insight:
Innovation: Tibshirani (1996) - Lasso regression
Source: Missing data augmented IPW Target: Causal inference estimators
Transfer insight:
Innovation: Robins et al. - AIPW estimators
Source: Robust statistics (Hampel) Target: Semiparametric efficiency
Transfer insight:
Innovation: Bickel et al. - semiparametric theory
When you discover a promising connection:
## Connection: [Source Method] → [Target Application]
### Source Domain
- **Method**: [Name and citation]
- **Problem it solves**: [Description]
- **Key insight**: [Core idea]
- **Assumptions**: [What it requires]
### Target Domain
- **Problem**: [Statistical problem to solve]
- **Current approaches**: [Existing methods and limitations]
- **Gap**: [What's missing]
### Transfer Analysis
- **Structural correspondence**:
- [Source concept] ↔ [Target concept]
- [Source assumption] ↔ [Target assumption]
- **What transfers directly**: [List]
- **What needs modification**: [List]
- **What doesn't transfer**: [List]
### Proposed Innovation
- **Core idea**: [How to adapt]
- **Novel contribution**: [What's new]
- **Theoretical questions**: [What to prove]
- **Empirical questions**: [What to simulate]
### Feasibility Assessment
- [ ] Theoretically sound
- [ ] Computationally tractable
- [ ] Practically relevant
- [ ] Sufficiently novel
- [ ] Publishable venue: [Journal]
### Next Steps
1. [Immediate action]
2. [Follow-up]
3. [Validation approach]
| Source Field | Method/Concept | Target Application | Innovation Potential |
|---|---|---|---|
| ML | Double/debiased ML | Semiparametric mediation | High - removes regularization bias |
| ML | Causal forests | Heterogeneous effects | High - effect modification detection |
| Physics | Diffusion models | Distribution products | Medium - novel density estimation |
| Economics | Control functions | Endogenous mediators | High - relaxes assumptions |
| CS | Sketching algorithms | Large-scale mediation | Medium - computational gains |
| Biology | Network motifs | Mediation topology | Medium - pattern recognition |
# Transfer: Control functions from economics to mediation
# Relaxes sequential ignorability assumption
control_function_mediation <- function(Y, A, M, X, Z) {
# Z is instrument for A
# First stage: A on Z and X
stage1 <- lm(A ~ Z + X)
A_residual <- residuals(stage1)
# Second stage with control function
# Includes residual to correct for endogeneity
stage2 <- lm(M ~ A + X + A_residual)
# Third stage: outcome with control
stage3 <- lm(Y ~ A + M + X + A_residual)
list(
a_to_m = coef(stage2)["A"],
m_to_y = coef(stage3)["M"],
indirect = coef(stage2)["A"] * coef(stage3)["M"],
control_function_coef = coef(stage2)["A_residual"]
)
}
For any cross-disciplinary transfer, evaluate:
This skill works with:
Version: 1.0 Created: 2025-12-08 Domain: Research Innovation, Method Development
Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.
Applies Anthropic's official brand colors and typography to any sort of artifact that may benefit from having Anthropic's look-and-feel. Use it when brand colors or style guidelines, visual formatting, or company design standards apply.
Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.