Skill

Transcription Expert

Audio/video transcription - Whisper, Deepgram, AssemblyAI comparison and usage

Install

npx claudepluginhub willsigmon/sigstack --plugin media

Tool Access

This skill is limited to using the following tools:

ReadEditBashWebFetch

Preview

Choose the right transcription service for your use case.

SKILL.md

Similar Skills

cache-components

Guides Next.js Cache Components and Partial Prerendering (PPR) with cacheComponents enabled. Implements 'use cache', cacheLife(), cacheTag(), revalidateTag(), static/dynamic optimization, and cache debugging.

cache-components

139.1k

claude-opus-4-5-migration

2 files

Migrates code, prompts, and API calls from Claude Sonnet 4.0/4.5 or Opus 4.1 to Opus 4.5, updating model strings on Anthropic, AWS, GCP, Azure platforms.

claude-opus-4-5-migration

83.2k

claude-code-plugin-release

1 file

Automates semantic versioning and release workflow for Claude Code plugins: bumps versions in package.json, marketplace.json, plugin.json; verifies builds; creates git tags, GitHub releases, changelogs.

claude-mem

64.7k

Stats

Parent Repo Stars9

Parent Repo Forks1

Last CommitFeb 1, 2026

Actions

View Source View Plugin View on GitHub View README

Transcription Expert

Choose the right transcription service for your use case.

Pricing Comparison (2026)

Service	Price/min	Speed	Diarization	Real-time
Whisper API	$0.006	Slow	No (+extra)	No
Deepgram	$0.0043	20s/hr	Yes	Yes
AssemblyAI	$0.0025	Fast	+$0.02/hr	Yes

When to Use Each

Whisper

One-time batch processing
Self-hosting option (free)
Privacy-sensitive (local)
Best: Podcasts, offline processing

Deepgram

Real-time applications
Live captioning
Speaker identification built-in
Best: Meetings, call centers, voice apps

AssemblyAI

Cheapest per-minute
AI features (sentiment, topics)
PII redaction
Best: Content analysis, compliance

Quick Implementations

Whisper (OpenAI)

from openai import OpenAI
client = OpenAI()

with open("audio.mp3", "rb") as f:
    transcript = client.audio.transcriptions.create(
        model="whisper-1", file=f
    )
print(transcript.text)

Deepgram

from deepgram import DeepgramClient, PrerecordedOptions

dg = DeepgramClient(api_key="...")
options = PrerecordedOptions(model="nova-3", diarize=True)

response = dg.listen.rest.v1.transcribe_file(
    {"buffer": open("audio.mp3", "rb")}, options
)

AssemblyAI

import assemblyai as aai

aai.settings.api_key = "..."
transcriber = aai.Transcriber()

transcript = transcriber.transcribe("audio.mp3")
print(transcript.text)

Speaker Diarization

Deepgram (Built-in)

options = PrerecordedOptions(diarize=True)
# Response includes speaker labels automatically

AssemblyAI

config = aai.TranscriptionConfig(speaker_labels=True)
# +$0.02/hr additional

Whisper (Requires Extra)

# Need separate diarization service like pyannote
from pyannote.audio import Pipeline
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization")

Batch Processing

import asyncio

async def transcribe_batch(files):
    tasks = [transcribe(f) for f in files]
    return await asyncio.gather(*tasks)

Output Formats

Plain text
SRT/VTT subtitles
JSON with timestamps
Word-level timing

Use when: Podcast transcription, meeting notes, video subtitles, voice content indexing