OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing.
/plugin marketplace add zechenzhangAGI/AI-research-SKILLs/plugin install whisper@zechenzhangAGI/AI-research-SKILLs