ElevenLabs Speech-to-Text transcription workflows with Scribe v1 supporting 99 languages, speaker diarization, and Vercel AI SDK integration. Use when implementing audio transcription, building STT features, integrating speech-to-text, setting up Vercel AI SDK with ElevenLabs, or when user mentions transcription, STT, Scribe v1, audio-to-text, speaker diarization, or multi-language transcription.
/plugin marketplace add vanman2024/ai-dev-marketplace/plugin install elevenlabs@ai-dev-marketplaceThis skill is limited to using the following tools:
README.mdexamples/basic-stt/README.mdexamples/diarization/README.mdexamples/multi-language/README.mdexamples/vercel-ai-stt/README.mdexamples/webhook-integration/README.mdscripts/batch-transcribe.shscripts/setup-vercel-ai.shscripts/test-stt.shscripts/transcribe-audio.shscripts/validate-audio.shtemplates/api-transcribe.py.templatetemplates/api-transcribe.ts.templatetemplates/diarization-config.json.templatetemplates/stt-config.json.templatetemplates/vercel-ai-transcribe.py.templatetemplates/vercel-ai-transcribe.ts.templateThis skill provides comprehensive guidance for implementing ElevenLabs Speech-to-Text (STT) capabilities using the Scribe v1 model, which supports 99 languages with state-of-the-art accuracy, speaker diarization for up to 32 speakers, and seamless Vercel AI SDK integration.
# Install dependencies
bash scripts/setup-vercel-ai.sh
# Verify installation
npm list @ai-sdk/elevenlabs
# Transcribe a single audio file
bash scripts/transcribe-audio.sh path/to/audio.mp3 en
# Validate audio before transcription
bash scripts/validate-audio.sh path/to/audio.mp3
# Batch transcribe multiple files
bash scripts/batch-transcribe.sh path/to/audio/directory en
# Run comprehensive tests
bash scripts/test-stt.sh
// Read Vercel AI SDK template
Read: templates/vercel-ai-transcribe.ts.template
// Customize for your use case
// - Set language code
// - Configure diarization
// - Enable audio event tagging
// - Set timestamp granularity
# Basic STT example
Read: examples/basic-stt/README.md
# Vercel AI SDK example
Read: examples/vercel-ai-stt/README.md
# Speaker diarization example
Read: examples/diarization/README.md
30 languages including: English, French, German, Spanish, Italian, Japanese, Portuguese, Dutch, Polish, Russian
19 languages including: Bengali, Mandarin Chinese, Tamil, Telugu, Vietnamese, Turkish
30 languages including: Arabic, Korean, Thai, Indonesian, Hebrew, Czech
19 languages including: Amharic, Khmer, Lao, Burmese, Nepali
Use direct API or Vercel AI SDK for single-language, single-speaker transcription.
Enable diarization and set numSpeakers for interviews, meetings, podcasts.
Detect language automatically or specify when known for content in 99 languages.
Extract audio from video formats and transcribe with timestamps for subtitles.
Process long files asynchronously using webhook callbacks for results.
# Validate audio format
bash scripts/validate-audio.sh your-audio.mp3
All scripts are located in skills/stt-integration/scripts/:
All templates are located in skills/stt-integration/templates/:
All examples are located in skills/stt-integration/examples/:
Skill Location: plugins/elevenlabs/skills/stt-integration/
Version: 1.0.0
Last Updated: 2025-10-29