Skill

video-intel

Ingests and curates a YouTube video corpus: scans channels for new videos, generates Gemini-powered mind maps, transcribes videos (URL or local MP4), extracts concepts, rebuilds a LanceDB hybrid-search index, deduplicates, and prunes YouTube Shorts.

ai-ml

data-engineering

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/video-intel:video-intel

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Multimodal video scanning and transcription powered by Gemini.

SKILL.md

751 lines · ~11.5k tokens(exceeds 5k compaction limit)

Stats

LanguagePython

Stars5

Forks4

MaintenanceExcellent

Last CommitJun 22, 2026

Actions

View Source View Plugin View on GitHub View README

Video Intel

Multimodal video scanning and transcription powered by Gemini.

What This Skill Does

Three layers, designed as a narrowing funnel.

scan - Fetch new videos from configured YouTube channels. For each video, in this order: (a) Gemini multimodal transcript reading frames + audio + on-screen text; (b) mindmap built from that transcript via a text-only Gemini call (~10× cheaper than reading video, no 10800-frame cap); (c) concept extraction from the mindmap. Only step (a) touches the video; (b) and (c) are text-only and read what (a) wrote to disk. The mindmap_source: auto per-channel knob (default) routes step (b) to the text-only path when a transcript exists, with a fallback to mindmap-from- video when no transcript is on disk. Per-channel transcript_source (default gemini) can route step (a) to yt-captions (caption track only, no Gemini) or auto (Gemini then captions on failure/timeout, issue #60). Before any Gemini call, scan drops upcoming/live premieres and non-public videos via a pre-flight metadata check (issue #70) - the corpus indexes what has aired, not what is scheduled.
transcript - Generate a fused document for a single video: diarized speech interleaved with timestamped SCREEN sections describing what was shown (slides, diagrams, code, demos). Uses a three-task decoupled prompt for best quality. Gemini multimodal is the default and the only path that captures on-screen content and diarization. A cheaper speech-only fallback exists (issue #60): transcript_source: yt-captions builds the transcript from the YouTube caption track alone (no SCREEN, no diarization, no Gemini), and transcript_source: auto tries Gemini first and falls back to the caption track when the Gemini transcript fails (token-cap, 403, the prompt=0 confab guard; and a hung call on the single-shot path). See the captions rows in the intent table below. (A separate SRT path lives in translate_video.py for BCS subtitle translation only - that is a different thing; do not confuse the two.)
concepts - Extract and normalize key concepts from mind maps into a canonical vocabulary (taxonomy.json). Different videos use different words for the same idea — the concept layer resolves synonyms so cross-video queries work without reading every file.

Triage workflow — pick the right mode first:

Query type	Examples	Command
Evidence (who/what/when/how)	"which companies adopted X?", "what did they say about Y?"	`search "X" --vector`
Discovery (which videos / themes)	"which videos cover X?", "what themes recur?"	`search "X"` (no flag)
Synthesis (what do creators say, together)	"nugget brief on X", "what do creators agree/disagree about Y?", "consultant brief"	`nugget "X"`

--vector uses hybrid search (BM25 keyword + vector semantic + RRF fusion). Results include full transcript passages — follow-up reads usually unnecessary.
Concept search (no flag) matches taxonomy labels/aliases. Fast, no API calls.
Read only the files returned by search — don't scan the entire corpus.

Prerequisites

Two API keys required as environment variables:

GEMINI_API_KEY - Get free at https://aistudio.google.com/apikey
YOUTUBE_API_KEY - Get free at https://console.cloud.google.com/apis/credentials (enable "YouTube Data API v3")

Python dependencies:

pip install google-genai google-api-python-client pyyaml

# Optional: for vector search
pip install lancedb voyageai

If prerequisites are missing, tell the user what's needed and where to get it.

Important: These Commands Are Slow

Gemini API calls read video frames and audio — they take 1-5 minutes per video. A scan of 10 videos can take 10-30 minutes. This is normal.

Default log level is info - progress is visible without extra flags.
--log-level goes BEFORE the subcommand. python video_intel.py --log-level info scan works; python video_intel.py scan --log-level info errors with argparse. Applies to every subcommand.
--dry-run is preview only - shows what would be processed but creates no files and makes no Gemini calls. Use it to verify config before committing to a real scan.
Use a long bash timeout (at least 600000ms / 10 minutes) for scan and transcript commands. The default 2-minute timeout WILL kill multi-video scans prematurely.
Silence between log lines is normal. Gemini is processing video - don't diagnose or interrupt. A genuinely hung call is bounded: each transcript Gemini call has a hard transcript_timeout_seconds wall-clock cap (default 600s, issue #74). On expiry it raises, so one hung video can no longer freeze a whole scan; on the single-shot path under transcript_source: auto it then falls back to the caption track.
For large scans (10+ videos): run in the background so the user isn't blocked. Check the output directory afterward for results.
For single transcripts: 1-3 minutes is typical. Wait for the "Saved:" line before proceeding.
Transcripts are resilient to malformed JSON. If Gemini returns broken JSON, the script tries to salvage partial content (speech entries, screen content) and writes a partial transcript with a visible warning. A partial transcript is useful for curiosity/search. For strategically important videos, rerun with --model gemini-2.5-pro or retry later.
Raw Gemini responses are saved on failure as .transcript.raw.txt sidecars for debugging.
Exit 0 ≠ success on --url paths. Always verify. mindmap --url and transcript --url exit 0 even when Gemini returns 403 PERMISSION_DENIED (members-only, age-gated, region-locked, or otherwise restricted videos). The error is logged inline but not re-raised, so the script can keep going inside a scan batch. After any URL run, grep the captured output for PERMISSION_DENIED (or check that the expected <prefix>.mindmap.md / <prefix>.transcript.md actually landed on disk) before reporting success to the user. If 403 is found, jump to "When a YouTube URL returns 403" below.

Interpreting User Intent

The verb a user reaches for doesn't always match a CLI command name. This table is the canonical mapping — read it before picking a command.

User says (intent)	Run	Notes
"transcribe this video" + URL	`transcript --url URL --channel <NAME>`	Single video. Grep log for `PERMISSION_DENIED` before reporting success.
"scan this single video" + URL, "process this URL", "full pipeline on this URL", "do everything on [URL]"	`process --url URL --channel <NAME>`	Single-shot URL pipeline. Issue #54 ordering: transcript first (auto-chunked at 50-min default for 2h+ videos) → mindmap built FROM the on-disk transcript text (text-only Gemini, ~10× cheaper than mindmap-from-video, no 10800-frame cap) → concepts. Resolver picks mindmap source per channel: `mindmap_source: auto` (default) uses transcript, falls back to video when transcript fails. Same exit-code contract as `process --file`: exit 0 if mindmap succeeded. After the run, grep log for `PERMISSION_DENIED` — exit 0 lies on gated content.
"fully index this 3-hour talk", "transcribe this conference recording", "process this Lex Fridman episode"	`process --url URL --channel <NAME> --chunk-minutes 50`	Long-form ergonomic. Same issue #54 ordering — transcript first (auto-chunks via `--chunk-minutes`, default 50; each chunk → one Gemini call → merged into single `.transcript.md` with offset-applied timestamps and a coverage-table header) then mindmap-from-transcript (one fast text call no matter how long the video) then concepts. Failure mode: per-chunk failures land as `.transcript.raw.chunkN-START-END.txt` sidecars; meta.json carries `transcript_status: partial` if any chunk failed; the resulting mindmap inherits `mindmap_source_status: partial` and a `<!-- source: partial transcript -->` HTML comment header.
"run full pipeline on [local file]", "mindmap + transcript + concepts on one upload", "process this MP4"	`process --file PATH`	Local video only; single upload, lazy-skipped when artifacts already exist. Stays on legacy mindmap-from-video because chunking would multiply uploads (one-upload guarantee). For local files exceeding the chunk threshold, manual `--start`/`--end` segments are still the workaround.
"regenerate mindmap on [URL]", "redo the mindmap for [video]", "mindmap from transcript instead of video"	`mindmap --url URL --channel <NAME>` (with on-disk transcript)	Issue #54: when a transcript is already on disk for the URL, `mindmap --url` automatically routes to the cheap text-only path. Pass `--force` to overwrite an existing mindmap. Per-channel `mindmap_source: video` forces the legacy video path even when transcript exists.
"scan", "what's new", "check for new videos"	`scan`	All channels, configured `since`
"what's new from [creator]"	`scan --channel X`	Single channel, configured `since`
"transcribe [creator]'s backlog", "videos I'm missing from [creator]", "catch up on [creator]"	`scan --channel X --since 2y` (or wider)	Always `--dry-run` first to surface scope
"fully scan [creator]", "everything from [creator]"	`scan --channel X --since 2005-01-01`	Always `--dry-run` first — implies entire channel history
Backlog of N videos to transcribe	`scan` with `auto_transcript: all` configured	NOT N separate `transcript --url` calls
"rebuild the index", "reindex after dedupe"	`index --force`	Write-path rebuild of LanceDB; query uses `video-intel-search`
"prune shorts", "remove shorts", "too many shorts in my corpus"	`prune-shorts [--apply]`	Always `--dry-run` first — destructive on `--apply`; deletes mindmap/transcript/concepts/meta per Short
"rebuild taxonomy", "update master vocabulary"	`taxonomy-build`	Derived artifact; rebuildable anytime
"catch me up on what I missed", "what haven't I been briefed on", "videos I haven't seen yet", "generate a catch-up briefing", "fill the gaps in my viewing guides"	`briefings --unseen [--dry-run] [--since DATE] [--until DATE] [--limit N]`	Surfaces corpus videos absent from every existing `_briefings/*.md` `video_ids` list (strict set difference, never re-surfaced once briefed), within a default 30-day UTC window, ranked by overlap with the inferred profile in `_briefings/profile.yaml` and capped to the top `N` (default 30; `--limit 0` = no cap). `--dry-run` previews; otherwise writes `_briefings/<date>-catch-up-unseen.md`. Widen with `--since 120d` for a one-time backlog sweep. No Gemini, no `channels:` required.
"this video keeps getting re-transcribed", "fix identity-less metas", "backfill missing video_id", "meta.json has no video_id"	`repair-metas [--apply]`	Dry-run first (issue #66). Reconstructs `video_id`/`url`/`title`/`published` from the `.transcript.md` header for metas missing identity; only fills missing fields; refuses local/non-YouTube sources. Re-run `index --force` after `--apply`.
"skip transcript on this video", "stop trying to transcribe [URL]", "this video keeps failing transcript", "block transcript only"	`mark-skip --url URL --mode transcript [--reason TEXT]`	Per-mode skip (issue #42). Mindmap and concepts continue to run. Repeatable: `--mode transcript --mode concepts`.
"permanently ignore this video", "never re-process [video_id]", "skip these IDs on backfill", "stop touching this URL on every scan"	edit `skip_video_ids` under the channel in `config.yaml`	Declarative pre-fetch blocklist. Listed IDs never reach Gemini, never get a meta.json, no cost. Override = remove the ID from config. Cheaper than `mark-skip` since no meta.json roundtrip needed.
"I want notifications about new videos but no auto-processing", "discover only", "tell me what's new but don't run Gemini", "follow [creator] but don't pay for it"	set `auto_mindmap: none` and `auto_transcript: none` on the channel in `config.yaml`	Notify-only mode: scan logs new videos but skips both Gemini calls. Cherry-pick episodes manually with `mindmap --url URL --channel <name>` when one looks worth indexing. Useful for long-form podcasters (Lex Fridman).
"drop short videos for [creator]", "only count [creator]'s long-form content", "for Lex anything under 30 min isn't worth it"	set `min_duration_seconds: 1800` on the channel in `config.yaml`	Per-channel duration floor (30 min = 1800s in this example). Drops anything shorter before Gemini sees it. Independent from the standard 60s `skip_shorts` filter.
"index this cheaply", "captions only", "skip the expensive transcript for [creator]", "discovery-only indexing", "just use the YouTube captions"	`transcript --url URL --transcript-source yt-captions` (one-off) or set `transcript_source: yt-captions` on the channel (issue #60)	Skips Gemini entirely; builds a speech-only transcript from the YouTube caption track. Cheap, but no on-screen content / no diarization (flagged `transcript_source: youtube_captions`). Fails when the video has no captions.
"fall back to captions when Gemini fails", "auto-recover failed transcripts", "this channel keeps hitting the token cap", "this channel keeps hanging"	set `transcript_source: auto` on the channel, or `--transcript-source auto` (issue #60)	Tries Gemini first; on failure (token-cap `INVALID_ARGUMENT`, 403, the `prompt=0` confabulation guard, or a wall-clock timeout / hang per issue #74) falls back to the YouTube caption track. Never worse than `gemini`.
"find videos about X", "search for Y", "nugget brief on Z", "corpus status", "verify quote", "fact-check claim against [creator]"	—	Wrong skill. These are read-only queries; use the video-intel-search skill.

Channel name resolution

When the user names a creator (e.g. "Grace Leung", "Nate Jones"):

Read ${CLAUDE_SKILL_DIR}/../../config.yaml and match the name case-insensitively against both the name field and the handle in url.
If exactly one match — use that channel's name.
If multiple matches — list them and ask which one.
If zero matches — list available channels and ask whether to add the creator. Do not invent a YouTube handle and proceed.

When to pause and confirm

Before running scan (which costs Gemini quota), run --dry-run first if any of these are true:

The user said "all", "fully", "missing", "backlog", or "catch up" without an explicit date window
The implied scope is more than ~10 new videos
The channel name was fuzzy and required config-file resolution
auto_transcript: all is set on the target channel (each new video = 3 Gemini calls — 1 expensive transcript reading video frames + audio, plus 2 cheap text-only calls: mindmap-from-transcript and concepts)

Report the count of new videos and the estimated Gemini call count. Per video with auto_transcript: all: 1 expensive transcript call + 2 cheap text-only calls (mindmap-from-transcript + concepts). With auto_transcript: none: 1 expensive mindmap-from-video call (legacy path, used when no transcript is on disk). Wait for the user's go-ahead before running the real scan.

When processing fails

A video that does not process cleanly almost always matches one of these scenarios. Route by symptom; the full causes + step-by-step recovery SOPs live in docs/troubleshooting.md, and the meta.json fields these reference are in docs/meta-json-schema.md.

Symptom	Cause	Recovery
Scan never finds a video that exists	Unlisted (not in the uploads feed - a hard YouTube Data API limit)	manual `process --url`/`--file`, or `--transcript-source yt-captions` for a cheap captions index
`403 PERMISSION_DENIED` (grep every URL run for it)	Members-only / gated	download via membership, then `process --file`
`400 INVALID_ARGUMENT`, fails fast	Token cap on a long video	`process --url --chunk-minutes 50`, or set `transcript_source: auto` (captions failover)
Transcript hangs for many minutes	Gemini stall	Auto-capped per transcript (`transcript_timeout_seconds`, default 600s, issue #74) -> failover under `transcript_source: auto`; `mark-skip --mode transcript` or `skip_video_ids` as backup
Tiny `prompt=0` transcript that looked "complete"	Future/scheduled premiere confabulated	the issue #60 confab guard now discards it; delete any old stub
Two mindmaps/metas for one video	Title rotation	`dedupe`

Governing principle: the corpus indexes things that have happened, not things that will happen - skip future/scheduled premieres (liveBroadcastContent: upcoming).

Model Selection

The default model (gemini-3-flash-preview from config.yaml) works for most operations. Override with --model / -m when needed:

Scenario	Model	Why
Default (transcripts, mindmaps, concepts, scan)	`gemini-3-flash-preview`	Best deep video understanding for the transcript step; cheap text-only model for mindmap+concepts
Transcripts failing with JSON errors	`gemini-2.5-pro`	More reliable structured JSON, higher output token limit
Gemini 3.x backend unreliable / 503s	`gemini-2.5-pro`	Stable fallback
Long videos (>60 min transcripts)	`gemini-2.5-pro`	Less likely to truncate mid-output

# Override model for a single command
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --model gemini-2.5-pro transcript --url "URL"

# Model for scan (all videos in batch use this model)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --model gemini-2.5-pro scan --channel natebjones

Precedence: --model flag > config.yaml model field > gemini-3-flash-preview.

Query Workflow (Different Skill)

For searching the corpus, nugget briefs, corpus status, or summarizing a video that is already indexed, use the video-intel-search skill. It is read-only, globally installable, and reads the same output_dir this skill writes to. This skill covers the write path only: scan, transcribe, process, index, concepts, dedupe, taxonomy-build, prune-shorts, mark-skip, repair-metas, briefings.

How to Use

Scan channels for new videos

python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info scan

Scans all channels in config.yaml, processes new videos since each channel's since window. Per video the order is: transcript (Gemini multimodal) → mindmap (text-only Gemini, reads the just-written transcript) → concepts (text-only Gemini, reads the mindmap). All three artifacts land in the output directory. This command is slow — multiple Gemini API calls, 1-5 min each (the transcript step dominates wall-clock). Use a 600000ms bash timeout. --log-level info is mandatory so progress is visible; without it the command appears to produce no output.

Options:

--since 14d - Override the time window for this run
--channel natebjones - Scan only this channel
--dry-run - Show what would be processed without calling Gemini
--force - Regenerate even if output files exist

Transcribe a specific video

YouTube URL:

python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info transcript \
  --url "https://www.youtube.com/watch?v=XXXXX"

Local MP4 file (works for screen recordings, meetings, Dropbox/GDrive sync folders):

# Full file (<1GB)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info transcript \
  --file ~/Videos/meeting.mp4

# Specific segment (required for files >1GB)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info transcript \
  --file ~/Videos/meeting.mp4 --start 05:30 --end 18:45

Local files produce {name}.transcript.md and {name}.meta.json in the same directory as the source by default. Uploaded files auto-expire from Gemini after 48 hours.

LOW media resolution by default. Both the single-shot transcript path (local files and short YouTube videos) and the chunked-transcript path (long YouTube videos) use Gemini's LOW media resolution by default (~70 tokens/frame instead of HIGH ~258 tokens/frame). LOW gives equivalent quality on talking-head + slide content at 3× lower input-token cost, and keeps hour-long videos under Gemini's 1M-token cap. Pass --media-resolution high only when the prompt depends on reading fine on-screen text (slides, burned-in captions). HIGH on a video over ~67 minutes will fail with 400 INVALID_ARGUMENT (token-cap exceeded).

When a YouTube URL returns 403 (members-only / gated content)

Detection. Gemini cannot fetch members-only, paid, age-gated, or region-locked videos and returns 403 PERMISSION_DENIED. The script logs the error and exits 0 — there will be a stub <prefix>.meta.json on disk with modes_completed: [] and last_error: "...PERMISSION_DENIED...", but no .mindmap.md or .transcript.md. This applies to scan, mindmap --url, and transcript --url alike. The stub meta is not garbage — it carries the canonical identity (video_id, title, published, channel) and the recovery flow below will reuse it to write artifacts under the canonical {YYYY-MM-DD}-{slug} prefix.

Hint. If output_dir/<channel>/ already contains an MKV/MP4 with a companion .transcript.md, the user has done this recovery before for the same creator — follow the same pattern.

Recovery (preferred: process --file — one upload, both modes):

Download the video locally via the user's member access. The script does not download for you.
Save under output_dir/<channel>/ named <videoId>.mp4 (the 11-char YouTube ID). This makes the tool G2-dedup against any existing stub meta and route artifacts to the canonical {YYYY-MM-DD}-{slug} prefix automatically. .mkv, .mp4, .mov, .webm, .avi are all accepted.
Run process --file:

python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info process \
  --file "${OUTPUT_DIR}/everyinc/jPrwIL2B56Q.mp4" \
  --video-id jPrwIL2B56Q \
  --title "Camp: Codex for Knowledge Work" \
  --date 2026-04-24

--video-id / --title / --date are redundant when a stub .meta.json already exists with those fields, but passing them is harmless and explicit. With a <videoId>.mp4 filename and no stub, they let the tool stamp identity into a fresh canonical meta.

Why process --file over separate mindmap --file + transcript --file: single Gemini upload (the legacy form uploaded twice), lazy-skip when artifacts already exist, automatic file-expiry recovery if Gemini's 48h TTL expires mid-run, and inline concepts when the channel is configured.

Legacy two-call form (still works, kept for reference; prefer process --file for new recoveries):

# Drop the MP4 under output_dir/everyinc/ first, then:
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" mindmap \
  --file "${OUTPUT_DIR}/everyinc/Compound Engineering Camp.mkv"
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" transcript \
  --file "${OUTPUT_DIR}/everyinc/Compound Engineering Camp.mkv"

# Or keep the MP4 elsewhere and pass --channel explicitly:
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" transcript \
  --file "~/Downloads/lfML5OJc-CM.mp4" --channel everyinc

When the local filename is <videoId>.mp4 (11-char YouTube ID), the tool matches it against an existing canonical scan-generated .meta.json in the channel folder and writes artifacts under the canonical {YYYY-MM-DD}-{slug} prefix, keeping a single meta.json per video. Otherwise the filename stem is used as both the title and the artifact prefix.

Options:

--url - YouTube URL to transcribe (mutually exclusive with --file)
--file - Path to local MP4 / MKV / MOV / WebM / AVI (mutually exclusive with --url)
--start/--end - Segment time offsets (accepts MM:SS, HH:MM:SS, or raw seconds)
--channel <NAME> - Save output under this channel's folder; with --file, enables in-place recovery routing
--video-id <ID> - 11-char YouTube video ID for explicit canonical-meta matching
--title <T> / --date YYYY-MM-DD - Override filename-inferred defaults
--force - Regenerate even if transcript exists
--transcript-source {gemini,yt-captions,auto} - Where the transcript text comes from (issue #60). Default gemini (multimodal). yt-captions = caption track only, speech-only, no SCREEN/diarization. auto = Gemini then captions fallback on failure. Overrides the per-channel config knob. Captions need a YouTube URL, so keep a local --file on gemini - yt-captions/auto have no caption track to fetch on a local upload.

Process a local video (full pipeline on one upload)

Use process --file when the user has a local MP4 and wants the complete artifact set — mindmap + transcript + concepts — from a single Gemini upload. The existing mindmap --file and transcript --file commands still work for single-mode runs; process is the opt-in efficient path when both are wanted.

# Channel inferred from parent folder (drop the MP4 under output_dir/<channel>/)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info process \
  --file "./video-intel/earlyaidopters/some-talk.mp4"

# Or keep the MP4 anywhere and pass --channel explicitly
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info process \
  --file "~/Downloads/some-talk.mp4" --channel earlyaidopters

# Regenerate everything from scratch (bypasses lazy-upload skip)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" process \
  --file "./video-intel/earlyaidopters/some-talk.mp4" --force

How process --file works:

Pipeline ordering: transcript first, then mindmap, then concepts. Step 1 transcribes the video (chunked into ~50-min windows on long files, single-shot otherwise). Step 2 generates the mindmap by reading the on-disk transcript via a cheap text-only Gemini call (~10× cheaper than reading video frames again, no 10800-frame cap). Step 3 extracts concepts from the mindmap. Mirrors process --url. Local files where the on-disk transcript can't be produced (e.g., transcript step fails entirely) fall back to mindmap-from-video automatically via the mindmap_source resolver.
Long-video chunking. Transcripts on videos longer than --chunk-minutes (default 50) auto-chunk into uniform windows. Each chunk is a separate Gemini call with VideoMetadata.start_offset/end_offset against the SAME upload — implicit caching makes follow-up chunks cheap. The "one upload" guarantee is preserved. Without chunking, hour-long single-shot transcript requests return malformed JSON intermittently from Gemini 2.5 Pro (different break point each retry, irrecoverable).
One upload per invocation. The MP4 is uploaded to Gemini Files API exactly once and referenced by every Step-1 chunk + the optional Step-2 mindmap-from-video fallback.
Lazy upload skip. When meta.json already shows all modes completed and artifacts exist, process uploads nothing and exits quickly. A partial prior run (e.g., transcript succeeded but mindmap did not) re-uploads once and regenerates only the missing steps.
File-expiry recovery. If Gemini's 48h Files API TTL expires mid-run (rare), process detects the expiry error, re-uploads once, retries once, then fails cleanly.
Observability. Each Gemini call emits a usage <label> prompt=N cached=N candidates=N total=N log line at info level. cached>0 on follow-up calls (chunks 2..N, or the mindmap step when it falls back to source=video) means implicit caching fired and you got a token discount.
Exit-code contract. Exit 0 if mindmap succeeded, regardless of transcript / concepts outcome. Automation callers that need partial-success detail inspect modes_completed in the resulting meta.json.
LOW media resolution by default. The mindmap-from-video fallback step (and the chunked-transcript step) use Gemini's LOW media resolution by default (~70 tokens/frame) instead of the API default HIGH (~258 tokens/frame). LOW yields equivalent quality for theme/concept extraction at 3× lower input-token cost, and removes the previous ~67-minute ceiling under Gemini's 1M-token input cap. Pass --media-resolution high only when the prompt depends on reading fine on-screen text (slides, burned-in captions).

Concepts extraction runs inline when the channel is configured in config.yaml. For loose files (no channel match), concepts is skipped with a warning log and the run still exits 0.

Options:

--file PATH - Path to local video file (required).
--channel NAME - Channel name (must exist in config.yaml). Overrides parent-folder inference.
--video-id ID - 11-char YouTube video ID for G2 dedup against a canonical scan meta.json.
--title T / --date YYYY-MM-DD - Override filename-inferred defaults.
--start/--end - Segment time offsets (shared across both video calls).
--force - Regenerate all artifacts from scratch.
--prompt NAME - Mindmap prompt override (default from config.yaml).
--chunk-minutes N - Chunk size for the transcript step on long videos (default: 50). Auto-triggered when video duration exceeds this; disabled when manual --start/--end is set.
--media-resolution {low,high} - Gemini media resolution for the mindmap step (default: low). Use high only when the prompt depends on reading fine on-screen text. LOW handles hour-long videos that HIGH cannot fit under Gemini's 1M-token cap.
--transcript-source {gemini,yt-captions,auto} - Transcript source for the --url path (issue #60): gemini (default), yt-captions (caption track only), or auto (Gemini then captions fallback). The --file path is always Gemini multimodal (a local upload has no caption track to fall back to).

Build the search index

# Build or rebuild the LanceDB index from all transcripts (requires VOYAGE_API_KEY)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" index

# Force a rebuild from scratch after dedupe or large corpus changes
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" index --force

The index command is a write-path operation (rebuilds the LanceDB hybrid index). Querying the index belongs to the video-intel-search skill.

Generate a catch-up briefing (unseen videos)

# Preview what hasn't been surfaced in any _briefings/ guide yet (writes nothing)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" briefings --unseen --dry-run

# Write a catch-up briefing: top 30 unseen videos from the last 30 days
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" briefings --unseen

# One-time backlog sweep: widen the window and raise the cap
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" briefings --unseen --since 120d --limit 60

briefings --unseen surfaces corpus videos that appear in no existing _briefings/*.md front-matter video_ids list (strict set difference, so a video is never re-surfaced once it lands in any briefing), bounded to a UTC date window (default 30-day recency floor; --since / --until override) and capped to the top N by relevance (--limit, default 30; --limit 0 = no cap). Uncapped videos stay unseen for the next run, so the cap creates a rolling catch-up rather than dropping anything. Ranking is concept/taxonomy overlap with an inferred interest profile persisted at _briefings/profile.yaml (hand-edit to retune; never overwritten once it has content). No Gemini calls and no channels: config required. Writes _briefings/<date>-catch-up-unseen.md; --dry-run only prints the ranked unseen set.

Extract and normalize concepts

# Extract concepts from all mindmaps that don't have concepts yet
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info concepts

# Re-extract for a specific channel
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" --log-level info concepts --channel natebjones --force

# Rebuild master taxonomy from all concept files (fast, no Gemini call)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" taxonomy-build

Clean up title-rotation duplicates

YouTube creators A/B-test video titles for SEO, which rotates the slug and can fool is_processed() into re-scanning the same video_id under a second prefix. Prevention is automatic (the video_id index inside is_processed() catches repeats across any slug change), but historical duplicates from earlier scans need a one-shot cleanup.

# Dry-run: report all video_id groups with >1 meta.json (no mutation)
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" dedupe

# Restrict to one channel
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" dedupe --channel natebjones

# Apply: merges discarded titles into canonical meta's alt_titles list,
# moves any artifact only a loser has, deletes all loser siblings.
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" dedupe --apply

Canonical selection: latest processed timestamp wins. Tie-breaks: larger modes_completed set, then alphabetical prefix. Discarded titles are preserved as alt_titles: [...] on the surviving meta.

After dedupe --apply, derived artifacts may be stale. Re-run:

python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" taxonomy-build
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" index --force

Dedupe is dry-run by default because it mutates shared state (disk). Only pass --apply after reviewing the dry-run report.

Prune YouTube Shorts

Shorts polluted the corpus before the scan-time skip_shorts filter existed (default-on as of plugin v1.11.0). The prune-shorts subcommand removes them retroactively. Detection: duration < 60s OR /shorts/<id> HEAD redirect returns 200. Sidecars from translate_video.py (.en.srt, .translate-bcs.txt) are preserved.

# Dry-run: report all Shorts with title, duration, URL, artifact count
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" prune-shorts

# Restrict to one channel
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" prune-shorts --channel chase_h_ai

# Apply: deletes mindmap.md, transcript.md (+ raw forensics), concepts.json,
# meta.json, and any mindmap.<variant>.md files for each detected Short.
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" prune-shorts --channel chase_h_ai --apply

After prune-shorts --apply, derived artifacts may be stale. Re-run:

python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" taxonomy-build
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" index --force

Like dedupe, prune-shorts is dry-run by default. Always review the dry-run output before passing --apply — eyeball the 60-90s edge-case rows to make sure nothing substantive is in the deletion list.

Skip a single mode on one video (per-mode skip)

Issue #42: a 2h 24m video kept truncating its transcript and hung scan for hours. Marking the whole video skip: true worked but also blocked the concepts pass that the existing mindmap could have fed. Use mark-skip when you want to silence one mode (commonly transcript) and let the others keep running.

# Stop trying to transcribe a single video. Mindmap and concepts keep going.
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" mark-skip \
  --url "https://www.youtube.com/watch?v=X5UN2LrRK48" \
  --mode transcript \
  --reason "JSON truncation on 2h24m video, structured output exceeds MAX_OUTPUT_TOKENS"

# Block both transcript AND concepts but keep mindmap
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" mark-skip \
  --url "https://www.youtube.com/watch?v=X5UN2LrRK48" \
  --mode transcript \
  --mode concepts

Writes skip_modes: ["transcript"] (or whatever modes you passed) into the video's .meta.json. On subsequent scans, is_skipped(..., mode="transcript") sees the array and skips just that loop. Repeating the same --mode is idempotent. The optional --reason lands as a skip_reason field for your own bookkeeping.

The default 2-hour filter (transcript_max_duration_seconds in config.yaml) handles the long-video case automatically — mark-skip is for cases where the duration is under threshold but transcript fails for other reasons (poor audio, region-locked transcript fetch, etc.).

Backward compat: existing meta.json files with the old skip: true keep behaving as full-skip on every mode. To migrate to per-mode, just re-run mark-skip with the new flags — skip_modes wins outright when both keys exist.

Backfill identity-less metas (repair-metas)

Issue #66: a transcript meta.json written without video_id is skipped by the video_id index, so the video is re-transcribed every scan (and a re-queued one could hang). Going forward the transcript writers stamp full identity; for metas already on disk, repair-metas reconstructs identity from the .transcript.md header.

# Dry-run (default): report which metas would be backfilled, write nothing.
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" repair-metas

# Apply: write the reconstructed video_id/url/title/published/channel.
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" repair-metas --apply

# Restrict to one channel
python "${CLAUDE_SKILL_DIR}/../../scripts/video_intel.py" repair-metas --channel twist --apply

It only fills MISSING fields (never overwrites an existing value) and refuses to guess for non-YouTube sources (local/Skool recordings with no Source URL in the header - those are reported as "unrepairable"). A meta that already has video_id is skipped entirely, even if other identity fields are missing - the index only needs video_id. Like dedupe/prune-shorts, it is dry-run by default. After --apply, re-run index --force so the backfilled videos' chunks carry their identity.

Manage channels

Edit config.yaml directly or ask Claude Code to add/remove channels. Claude Code has write access to the config file.

Configuration

Configuration lives at the plugin root, ${CLAUDE_SKILL_DIR}/../../config.yaml. Key settings:

output_dir: ~/video-intel          # Where output files are saved
default_since: 10d                 # Default lookback window
default_prompt: mindmap-knowledge  # Which prompt to use by default
auto_concepts: true                # Extract concepts after mindmap generation
model: gemini-3-flash-preview     # Gemini model (overridable via --model)
transcript_max_duration_seconds: 7200   # Skip transcripts on videos longer than this
                                        # (issue #42). Default 2 hours - leaves headroom
                                        # for technical talks. Mindmap phase is
                                        # unaffected. Override per workload.
transcript_timeout_seconds: 600         # Hard wall-clock cap per transcript Gemini call
                                        # (issue #74). Default 10 min. On expiry the call
                                        # raises -> failover under transcript_source: auto.
                                        # Per-channel override supported.

channels:
  - name: natebjones               # Folder name for output
    url: https://youtube.com/@natebjones
    auto_transcript: all            # all | none
    since: 10d                      # Override default lookback

  - name: seankochel               # Selective mode: playlists + keywords
    url: https://youtube.com/@iamseankochel
    playlists:                      # Playlist names (resolved via YouTube API)
      - Agent Skills
    keywords:                       # Channel-scoped search terms
      - ux design
    auto_transcript: none             # mindmaps for discovery, transcript manually
    since: 30d                        # also catch recent uploads (additive)

  - name: lennyspodcast            # Manual one-offs only: skipped by `scan`.
    url: https://youtube.com/@lennyspodcast
    auto_transcript: all
    enabled: false                   # see "One-off creators" below

  - name: seankochel
    url: https://youtube.com/@iamseankochel
    auto_transcript: all
    skip_video_ids:                  # Issue #42: declarative pre-fetch blocklist.
      - X5UN2LrRK48                  # 2h24m SaaS workshop - transcript truncates,
                                     # mindmap + concepts already done by hand.
      - SOMEOTHERID                  # add IDs here as you see them fail. The scan
                                     # never touches these on subsequent runs.

Selective scanning: Channels with playlists or keywords target specific content instead of scanning all uploads. Playlist names are resolved via YouTube API (case-insensitive contains matching). Keywords search the entire channel history (capped at 200 results per keyword). If since is also set, recent uploads are fetched as an additional source alongside playlists/keywords.

One-off creators (enabled: false): When a creator posts content you occasionally want a transcript or mindmap of, but you do NOT want them in the regular scan rotation, add them with enabled: false. The channel stays in config (so mindmap --url --channel <name>, transcript --url --channel <name>, and concepts --channel <name> all work) but scan skips them entirely — including when targeted explicitly via --channel. To temporarily bulk-scan such a creator, remove the flag rather than overriding it on the command line. The flag's purpose is durable manual-only routing, not advisory exclusion.

Prompt files

Prompt templates live at the plugin root, ${CLAUDE_SKILL_DIR}/../../prompts/:

mindmap-knowledge.md - Thematic mind map with domain terminology + timestamps (default)
mindmap-light.md - Fast thematic scan (4-6 branches)
mindmap-heavy.md - Comprehensive conceptual extraction
transcript.md - Full diarized transcript with screen content
concepts.md - Concept extraction + normalization against taxonomy
nugget-brief.md - Consultant-grade cross-creator synthesis with attributed nuggets

Each prompt is self-contained. Users can modify or add their own.

Output Structure

~/video-intel/
├── taxonomy.json                                    # Master vocabulary (derived)
├── .lancedb/                                        # Vector search index (derived)
│   └── transcript_chunks.lance
├── natebjones/
│   ├── 2026-03-20-building-mcp-agents.mindmap.md
│   ├── 2026-03-20-building-mcp-agents.transcript.md
│   ├── 2026-03-20-building-mcp-agents.concepts.json
│   ├── 2026-03-20-building-mcp-agents.meta.json
│   └── ...
└── ramjad/
    └── ...

taxonomy.json - Master vocabulary at the output root. Read this first for any topic query.
concepts.json - Per-video normalized concepts with canonical IDs and aliases.
mindmap.md - Thematic mind map with timestamps. Read for detail after finding via concepts.
transcript.md - Full diarized transcript. Read for evidence/quotes after finding via concepts.

Files are idempotent. Re-running a scan skips already-processed videos. Use --force on any command to regenerate.

video-intel

Popularity

Invocation

Context Preview

SKILL.md

video-intel

Popularity

Invocation

Context Preview

SKILL.md

Video Intel

What This Skill Does

Prerequisites

Important: These Commands Are Slow

Interpreting User Intent

Channel name resolution

When to pause and confirm

When processing fails

Model Selection

Query Workflow (Different Skill)

How to Use

Scan channels for new videos

Transcribe a specific video

Process a local video (full pipeline on one upload)

Build the search index

Generate a catch-up briefing (unseen videos)

Extract and normalize concepts

Clean up title-rotation duplicates

Prune YouTube Shorts

Skip a single mode on one video (per-mode skip)

Backfill identity-less metas (repair-metas)

Manage channels

Configuration

Prompt files

Output Structure

Similar Skills

Video Intel

What This Skill Does

Prerequisites

Important: These Commands Are Slow

Interpreting User Intent

Channel name resolution

When to pause and confirm

When processing fails

Model Selection

Query Workflow (Different Skill)

How to Use

Scan channels for new videos

Transcribe a specific video

Process a local video (full pipeline on one upload)

Build the search index

Generate a catch-up briefing (unseen videos)

Extract and normalize concepts

Clean up title-rotation duplicates

Prune YouTube Shorts

Skip a single mode on one video (per-mode skip)

Backfill identity-less metas (repair-metas)

Manage channels

Configuration

Prompt files

Output Structure

Similar Skills