Help us improve
Share bugs, ideas, or general feedback.
From research
Multi-round ChatGPT Deep Research pipeline with drill, evaluate, and progression tracking for broad literature exploration.
npx claudepluginhub ohdearquant/lionagi --plugin researchHow this skill is triggered — by the user, by Claude, or both
Slash command
/research:progress-researchThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Multi-round ChatGPT Deep Research pipeline with quality gates. Each round: drill -> refine -> fire -> evaluate -> record.
Runs structured multi-step web research with source synthesis, citations, skeptical evaluation, and confidence/gap analysis. Supports native and dense/frontier modes.
Executes multi-agent research pipeline on any topic with Scout, Investigators, Deep Diver, Verifier, Synthesizer, and Critic reviews to produce verified, sourced reports.
Conducts AI-powered deep research on any topic via triggers like '/deep-research [topic]' or 'deep research on [topic]'. Uses interactive AskUserQuestion for focus, output, and audience selection.
Share bugs, ideas, or general feedback.
Multi-round ChatGPT Deep Research pipeline with quality gates. Each round: drill -> refine -> fire -> evaluate -> record.
Use ChatGPT Deep Research as a parallel breadth-exploration engine, with Claude (Opus) handling synthesis, validation, and decision-making. ChatGPT drills deeper on literature and theory (50+ min per query); Claude cross-references, catches hallucinations, and decides what's tradable/buildable.
Use these phases when beginning a fresh topic with no prior rounds. Skip to Phase 0 if rounds already exist.
Goal: Map the landscape. Get 8-10 concrete research directions.
Context auto-population before generating the mega-prompt:
# From memory
memory.recall("current capital positions portfolio", limit=3)
memory.recall("{topic} prior research findings", limit=5)
# From task queue
work.tasks(assignee="lambda:backtesting", limit=5)
# From project state — read if they exist
# Read RESEARCH_LOG.md, RESEARCH_CATALOG.md
Generate the R1 mega-prompt:
## Context (auto-populated)
- Available capital: [from task queue / memory]
- Platforms: [Kalshi, KuCoin, Coinbase — from project state]
- Data assets: [what data we already have]
- Constraints: [fees, regulatory, technical]
- Academic framework: [relevant theory we already know]
## Request
Given the above context, generate 8-10 specific, CONCRETE research directions for: {TOPIC}
For each direction:
1. One-sentence thesis (falsifiable)
2. Required data (do we have it or need to acquire?)
3. Expected edge mechanism (WHY would this work?)
4. Fatal flaw check (what kills this idea?)
5. Tractability × Impact × Novelty score (1-10 each)
Rank by total score. Flag any direction where fatal flaw is confirmed.
DO NOT give me theoretical curiosities. Every direction must terminate in either:
(a) a tradeable strategy, (b) a buildable product, or (c) an explicit "this cannot work because X"
Fire: Open ChatGPT Pro in browser, send mega-prompt.
Save: .khive/workspaces/{date}/chatgpt-research/{topic}/round1_mega_prompt.md and round1_output.md
Goal: Deep-drill into each surviving direction from R1.
Claude curates R1 output:
Prompt template per direction:
## Context
[Same context block as R1, plus R1 findings for this direction]
## Deep Dive: {Direction Name}
Research this specific direction in depth:
1. Literature: What academic papers cover this? (names, years, key results)
2. Prior art: Has anyone implemented this? (companies, funds, open source)
3. Mathematics: What is the pricing/valuation framework? (formulas, not just concepts)
4. Data requirements: Exactly what data, what frequency, what history depth?
5. Implementation: Pseudocode for the core algorithm
6. Risk model: What are the loss scenarios? Max drawdown estimate?
7. Fee impact: At [specific fee structure], what minimum edge is needed to be profitable?
8. Scale limits: At what capital level does this strategy break down?
Be SPECIFIC. I need numbers, not narratives.
Fire: Open ChatGPT Pro in browser, send all prompts in parallel tabs.
Save: .khive/workspaces/{date}/chatgpt-research/{topic}/round2_prompts/ and round2_outputs/
Goal: Cross-reference all R2 outputs. Decide which 2-3 directions to pursue.
Claude's checklist:
Output: round3_synthesis.md — 2-3 surviving directions with go/no-go decision and rationale. Then proceed to the Phase workflow below for R4+.
# 1. Read the handoff from last round
Read(".khive/workspaces/YYYYMMDD/HANDOFF_R{N+1}_PLANNING.md")
# 2. Check research progression state
Read(".khive/workspaces/research_progression/README.md")
wc -l .khive/workspaces/research_progression/*.md
# 3. Check what responses/evaluations exist
ls .khive/workspaces/YYYYMMDD/chatgpt-responses/
ls .khive/workspaces/YYYYMMDD/evaluations/
Present status table to Ocean. Ask what to do or proceed if standing orders say "keep the steam going."
For each prompt topic, launch an analyst (Opus) agent that:
Agent output: .khive/workspaces/YYYYMMDD/r{N}_drill/R{N}_XX_topic.md
Each drill output MUST contain:
# R{N}-{XX}: {Topic}
## Gaps Identified in Prior Round
[numbered list of corrections/gaps]
## Our Empirical Data
[real numbers from our data files, not agent guesses]
## Refined Prompt
[the actual prompt to copy-paste to ChatGPT Pro]
Mandatory checks per drill agent:
Group prompts into batches of 2-3 for ChatGPT Pro parallel processing:
| Batch | Priority | Criteria |
|---|---|---|
| Batch 1 | P0 (operational) | Directly improves current trading or resolves blockers |
| Batch 2 | P1 (research) | New topics with live data embedded |
| Batch 3 | P2 (academic) | Capstone/paper material, lower urgency |
For each prompt, specify:
Present to Ocean: "Ready for Batch 1. [N] prompts. Fire when ready."
When ChatGPT responses arrive, launch analyst (Opus) evaluators. One per response.
Each evaluator applies three checks:
Evaluator output: .khive/workspaces/YYYYMMDD/evaluations/eval_R{N}_XX_topic.md
Each evaluation MUST contain:
# Evaluation: R{N}-{XX} {Topic}
## Summary Verdict: [APPROVE / APPROVE-WITH-FIXES / REJECT]
## Rigor Check
## Daydream Check
## Numbers Check
## Citations Check (if academic)
## What's Usable
## What Needs Fixing
After ALL evaluators complete, compile a cross-response analysis:
Present to Ocean as a concise summary.
Launch ONE agent to update the progression folder:
{NN}_round_R{N}.md — structured analysis of all responses07_sources.md — add new citations with verification status08_corrections.md — add new corrections with severity and sourceREADME.md — update file table if neededWrite .khive/workspaces/YYYYMMDD/HANDOFF_R{N+1}_PLANNING.md:
□ Did ChatGPT cite a specific paper? → Verify it exists (WebSearch)
□ Did ChatGPT give a formula? → Re-derive from first principles
□ Did ChatGPT claim "this strategy returns X%"? → Backtest yourself
□ Did ChatGPT say "no competitors"? → WebSearch for prior art
□ Did ChatGPT give parameter values? → Sanity check against market data
□ Did all directions agree? → Suspicious. At least one should conflict.
.khive/workspaces/{date}/chatgpt-research/{topic}/
├── round1_mega_prompt.md
├── round1_output.md
├── round2_prompts/
│ ├── direction_01.md
│ └── ...
├── round2_outputs/
│ └── direction_01_output.md
├── round3_synthesis.md # Claude's synthesis (the critical step)
├── r{N}_drill/
│ └── R{N}_XX_topic.md
├── chatgpt-responses/
├── evaluations/
│ └── eval_R{N}_XX_topic.md
└── HANDOFF_R{N+1}_PLANNING.md
.khive/workspaces/research_progression/
├── README.md ← Index + validation guide
├── 01_round_R1.md ← Per-round analysis
├── ...
├── {NN}_round_R{N}.md
├── {NN+1}_own_computation.md ← Lambda's direct math (highest confidence)
├── {NN+2}_frameworks.md ← Theoretical + empirical
├── 07_sources.md ← Master citation list with verification status
└── 08_corrections.md ← All corrections, severity, source, impact
/progress-research "topic" → generates mega-prompt, saves to workspaceround1_output.md/progress-research --round 2 → generates 8 deep-dive prompts from outputround2_outputs//progress-research --synthesize → Claude does R3 synthesisFrom highest to lowest:
python or pip — always uv run