Search everything...

Stats

Actions

Available In

factor-researcher

Name: factor-researcher
Author: minihellboy

By minihellboy

Mine, evaluate, backtest, benchmark, and report on quantitative alpha factors using the FactorMiner engine, with integrations to major financial data providers for market data ingestion.

npx claudepluginhub minihellboy/factorminer --plugin factor-researcher

Popularity

Stars

Top 10%

Med: 0·Avg: 450

Installs

Top 5%

Med: 0·Avg: 2

What's Inside

Slash Commands7

Backtest

/backtest

Combine factors into a composite signal and quintile-backtest it

Benchmark

/benchmark

Run a FactorMiner benchmark — Table 1, ablation, cost pressure, or suite

Evaluate

/evaluate

Evaluate a factor library's out-of-sample IC, ICIR, and decay

Mine

/mine

Mine a new alpha-factor library from a market dataset

Report

/report

Generate a research note, plots, and exports from a finished run

Agents1

factor-researcher

/factor-researcher

Runs systematic quantitative alpha-factor research end to end — validates a market dataset, mines a factor library with the FactorMiner engine, evaluates it (IC/ICIR/correlation), backtests a composite signal under transaction costs, benchmarks it against baselines, and packages a research note. Use when an analyst or PM asks to discover, evaluate, or stress-test predictive signals on a price/volume universe. Not for fundamental single-name work — use earnings-reviewer or model-builder for that.

Skills6

factor-backtest

/factor-backtest

Combine a factor library into a composite signal and quintile-backtest it under transaction costs — long-short return, monotonicity, turnover, and tearsheets. Use for the portfolio-level view that single-factor IC does not give. Triggers on "backtest", "composite signal", "combine factors", "long-short return", "portfolio", "quintile", "tearsheet", "transaction costs".

factor-benchmark

/factor-benchmark

Run FactorMiner benchmark workflows — the Table 1 Top-K freeze benchmark, memory and strategy ablations, transaction-cost pressure tests, and the full suite. Use to compare FactorMiner against baselines or to reproduce paper results. Triggers on "benchmark", "ablation", "compare to baseline", "reproduce table 1", "cost pressure", "benchmark suite".

factor-data

/factor-data

Validate, resample, and ingest market data for factor mining. Schema-checks OHLCV files (CSV/Parquet/HDF5), resamples bar frequencies, and pulls live data from external MCP connectors (FactSet, Daloopa, Morningstar). Use before any mining run. Triggers on "validate data", "check my dataset", "resample", "load market data", "fetch data", "ingest prices", "is this dataset usable".

factor-evaluation

/factor-evaluation

Evaluate a factor library — recompute Information Coefficient (IC), ICIR, win rate, and turnover on held-out data, and surface train→test decay. Use to judge how good a mined library actually is out of sample. Triggers on "evaluate factors", "compute IC", "how good is this library", "factor metrics", "ICIR", "is this factor overfit", "out-of-sample".

factor-mining

/factor-mining

Discover alpha factors by running the FactorMiner research engine — the paper-faithful Ralph loop or the enhanced Helix loop (causal validation, regime conditioning, multi-specialist debate, canonicalization). Use to generate a new factor library from a validated dataset. Triggers on "mine factors", "discover factors", "run mining", "find alpha", "helix loop", "ralph loop", "build a factor library".

MCP Servers12

Stats

Version0.1.0

LanguagePython

Stars70

Forks25

Copy clicks2

MaintenanceGood

LicenseMIT

Last CommitMay 21, 2026

AddedMay 22, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

factorminer70

Safety Signals

Caution

External network access

Connects to servers outside your machine

Uses power tools

Uses Bash, Write, or Edit tools

README

FactorMiner

LLM-driven formulaic alpha mining with typed operators, structured memory, strict runtime recomputation, and a Phase 2 Helix research lane

FactorMiner is a research framework for discovering interpretable alpha factors from market data. It combines:

a typed DSL over OHLCV-style market features
an LLM-guided mining loop
structured experience memory
library admission and replacement based on predictive power and orthogonality
strict runtime recomputation for analysis and benchmark reporting
an extended Helix lane for Phase 2 retrieval, canonicalization, and post-admission validation

The implementation is based on FactorMiner: A Self-Evolving Agent with Skills and Experience Memory for Financial Alpha Discovery (Wang et al., 2026), then extended with a cleaner architecture layer and a broader research surface.

Repository Status

Current implementation focus:

canonical paper-style and research mining lanes
typed DSL operators for OHLCV-style factor formulas
110 paper factors shipped in the built-in catalog
runtime recomputation for analysis and benchmark reporting
CI-backed lint, test, package, CLI smoke, and benchmark-smoke checks

For live local counts, run:

uv run pytest --collect-only -q factorminer/tests
uv run python - <<'PY'
from pathlib import Path
files = sorted(Path("factorminer").rglob("*.py"))
lines = sum(p.read_text(errors="ignore").count("\n") + 1 for p in files)
print(f"Python files: {len(files)}")
print(f"Python lines: {lines}")
PY

Primary execution surfaces:

RalphLoop: canonical paper-style mining loop
HelixLoop: Phase 2 research loop with optional retrieval and validation extensions
factorminer.benchmark.runtime: canonical benchmark runner
factorminer.architecture: canonical contracts, policies, stages, and services

Documentation Map

Architecture At A Glance

flowchart TD
    A["Market Data"] --> B["DatasetContract"]
    B --> C["Typed DSL + Operator Registry"]
    C --> D["Ralph / Helix Stage Pipeline"]
    D --> E["EvaluationKernel"]
    E --> F["FactorAdmissionService"]
    F --> G["FactorLibrary"]
    D --> H["MemoryPolicy"]
    H --> I["PromptContextBuilder"]
    I --> D
    G --> J["Runtime Analysis"]
    G --> K["Runtime Benchmarks"]
    H --> K
    B --> K

Two execution lanes share the same core contracts:

Lane	Purpose	Canonical loop	Typical use
Paper lane	strict, benchmark-facing mining	`RalphLoop`	reproducible paper-style runs, library freeze, runtime evaluation
Helix lane	extended research mode	`HelixLoop`	debate, KG retrieval, family-aware prompts, canonicalization, Phase 2 validation

Core Concepts

1. Typed factor DSL

Factors are formulas over the canonical feature set:

$open, $high, $low, $close, $volume, $amt, $vwap, $returns

The DSL is parsed into expression trees, executed through the operator registry, and recomputed on demand during analysis and benchmarks. Paper appendix operator names such as SignedPower, Med, Rsquare, Slope, Resi, Eq, Min2, Max2, TsDecay, and Scale are accepted by the parser.

2. Memory-guided mining

Mining is not plain prompt-and-filter generation. The loop builds a structured retrieval signal from experience memory and library state, then uses it to steer candidate generation.

Supported memory policies:

paper
none
kg
family_aware
regime_aware

3. Strict runtime recomputation

Saved library metadata is not treated as the final source of truth for analysis. The evaluate, combine, visualize, and benchmark paths recompute factor signals from formulas on the supplied dataset.

4. Canonical benchmark surface

factorminer.benchmark.runtime is the canonical benchmark entry point. It supports:

Top-K freeze evaluation across universes
memory ablations
strategy-grid ablations over memory policy × dependence metric × backend
cost-pressure evaluation
operator and factor efficiency benchmarking

Canonical Runtime Flow

View full README on GitHub

factor-researcher

Popularity

What's Inside

Confidence

README

FactorMiner

Repository Status

Documentation Map

Architecture At A Glance

Core Concepts

1. Typed factor DSL

2. Memory-guided mining

3. Strict runtime recomputation

4. Canonical benchmark surface

Canonical Runtime Flow

Similar Plugins

quant-trading

trading-strategy-backtester

quantitative-trading

llmquant-skills

alva

finlab-plugin

FactorMiner

Repository Status

Documentation Map

Architecture At A Glance

Core Concepts

1. Typed factor DSL

2. Memory-guided mining

3. Strict runtime recomputation

4. Canonical benchmark surface

Canonical Runtime Flow

Popularity

Health & Quality

Similar Plugins

quant-trading

trading-strategy-backtester

quantitative-trading

llmquant-skills

alva

finlab-plugin