Skill

deep-research

Leverages ChatGPT Pro Deep Research via gpt2agent for multi-source web synthesis. Two modes: light (30-120s) and heavy (5-30 min). Saves Markdown reports. Requires Pro quota.

OpenAI

ai-ml

automation

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/gpt2agent:deep-research

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

BashReadWrite

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Calls `gpt2agent`'s `deep_research` / `deep_research_heavy` directly via

Supporting Files

bin/deep_research.pybin/quota.shbin/run.sh

SKILL.md

167 lines · ~1.9k tokens

Stats

LanguagePython

Stars12

Forks2

MaintenanceExcellent

Last CommitJun 19, 2026

Actions

View Source View Plugin View on GitHub View README

/deep-research — ChatGPT Pro Deep Research

Calls gpt2agent's deep_research / deep_research_heavy directly via pipx Python (bypasses MCP — works even before Claude Code session restart).

Preconditions (check once)

command -v gpt2agent >/dev/null || \
  echo "gpt2agent not installed; run: pipx install git+https://github.com/robotlearning123/gpt2agent.git"
test -f ~/.codex/auth.json || echo "Codex token missing; run: codex login"
~/.claude/skills/deep-research/bin/quota.sh   # prints remaining DR quota

Usage

~/.claude/skills/deep-research/bin/run.sh [--heavy] [-o OUT_DIR] "<query>"

Default mode: light (deep_research, ~1 min, citations included).
--heavy: deep_research_heavy (5-30 min, gpt-5-5-pro + connector). The connector renders an embedded-UI widget; the report is recovered from the hidden widget state (widget_state.report_message) via ?include_visually_hidden_messages=true&include_widget_state=true — see "Heavy DR retrieval" below.
-o OUT_DIR: output directory (default: ./research/dr-YYYYMMDD-HHMM/).
Query can be inline string, - for stdin, or @file.md to read from file.

The script writes:

report.md — final report (reconstructed for heavy mode)
events.jsonl — all raw SSE events (for debugging / re-extraction)
status.txt — START / DONE / ERROR with elapsed seconds + event counts
meta.json — server metadata (model slug, request id, etc.)

When to invoke

Situation	Mode
Quick factual question with citations	light
Literature review, market scan, technical decision matrix	light
Big strategic question (>5 questions, want 10+ KB report)	heavy
Question that might yield 50+ sources	heavy

Skip this skill for: pure code questions, debugging, tasks the user explicitly wants you to handle locally, anything covered by context7 (library docs) or local files.

How to use in conversation

If the user provides a clear topic, draft a 3-8 question structured query (research questions + context + deliverable shape) and show it to the user before firing so they can edit / approve. Heavy mode especially deserves a query review — it costs ~10 min and 1/248 quota.
Save the approved query to a file (avoids shell escaping issues with long multi-line queries):
```
cat > /tmp/dr_query.txt <<'EOF'
<your structured query>
EOF
```

Run:

~/.claude/skills/deep-research/bin/run.sh [--heavy] -o /path/to/out @/tmp/dr_query.txt

Use run_in_background: true for heavy mode — it takes minutes. Tail status.txt or events.jsonl wc -l to monitor progress; the bash completion notification will arrive when finished.
After completion: Read report.md, summarize key findings in 5-8 bullets for the user, point to the full file path.

Heavy DR retrieval — connector widget state (fixed 2026-06-11)

deep_research_heavy dispatches the connector_openai_deep_research connector (the "Deep Research App", pineapple URI connectors://connector_openai_deep_research), which renders an embedded-UI widget. The connector never writes its report as an assistant text node in the conversation mapping (the assistant text node stays 0-char), so the legacy poll — which only scanned for assistant text — timed out at 1800s even though the research completed server-side.

The report lives in the hidden widget state. Fetching the conversation with ?include_visually_hidden_messages=true&include_widget_state=true exposes a node carrying widget_state.report_message.content.parts[0] (the full Markdown report) plus report_message.metadata.content_references (the source URLs). The widget state is delivered via two carriers, both handled by _dr_report_from_widget_state:

a tool text node whose part starts with "The latest state of the widget is: {…}", and
message.metadata.chatgpt_sdk.widget_state (a JSON string).

_poll_dr_completion now checks the widget state each poll and emits the report as a done event the moment it appears. Verified by recovering three real completed reports headlessly (45.6K / 52.4K / 51.5K chars). Set GPT2AGENT_RAW_DUMP=<path> to capture the raw heavy stream + polls.

Light mode TODO: the light deep_research (model=research) path uses the SearchGPT web-search backend, a different mechanism. It can still save the search-dispatch JSON ({"queries": […]}) as the body when the real report is a later/longer done; the bin script mitigates by picking the longest done. A dedicated light-mode fix is tracked separately.

Upstream tracking: see gpt2agent/sse.py (https://github.com/robotlearning123/gpt2agent/blob/main/gpt2agent/sse.py) around _dr_report_from_widget_state / _poll_dr_completion.

Quota management

Pro plan: 248 DR calls / monthly cycle (resets ~the 21st). Check before heavy calls:

~/.claude/skills/deep-research/bin/quota.sh

Refuse to fire --heavy if remaining < 10 unless the user explicitly acknowledges the consumption.

Account limits & concurrency (IMPORTANT — read before launching)

The DR quota (≈248/cycle) is NOT the only limit. The ChatGPT backend also rate-limits the conversation endpoints per account.

Run heavy DR SERIALLY — never concurrently. One heavy DR at a time. While a heavy DR is running (it polls /backend-api/conversation/{id} every ~120s during its 5–30 min Phase-2 wait), do NOT launch anything else that hits the same account's chatgpt.com backend — a second heavy/light DR, codex/cx exec jobs, agent mode, or get_conversation/list_conversations polling. Two pollers on one account collide.

Observed failure (2026-06-03): running a heavy DR concurrently with several codex jobs caused sustained HTTP 429 on the poll for ~30 min, then the run died with ERROR RuntimeError: DR polling timed out after 1800.0s waiting for conv <id> and events.jsonl had only the initial meta event. The exact per-account request-rate limit is not officially documented — do not assume a number; just keep account access serial.

Recovery from a 429 / poll-timeout (no extra quota): the run almost always completed server-side — only the local poll failed. The conv_id is in the error line. Recover the finished report instead of re-running --heavy (which would burn another quota): GET the conversation with the hidden-widget flags /backend-api/conversation/<conv_id>?include_visually_hidden_messages=true&include_widget_state=true via BackendClient, then read the connector's hidden widget state — the report lives in widget_state.report_message.content.parts[0] (text) with report_message.metadata.content_references (citation URLs). This is exactly what sse._dr_report_from_widget_state does; heavy DR via the connector does not write the report as an assistant text node, so walking mapping for assistant text alone will miss it. Wait for the rate limit to ease first — repeated GETs while rate-limited keep it hot.

deep-research

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

deep-research

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

/deep-research — ChatGPT Pro Deep Research

Preconditions (check once)

Usage

When to invoke

How to use in conversation

Heavy DR retrieval — connector widget state (fixed 2026-06-11)

Quota management

Account limits & concurrency (IMPORTANT — read before launching)

Similar Skills

/deep-research — ChatGPT Pro Deep Research

Preconditions (check once)

Usage

When to invoke

How to use in conversation

Heavy DR retrieval — connector widget state (fixed 2026-06-11)

Quota management

Account limits & concurrency (IMPORTANT — read before launching)

Similar Skills