OpenAI Whisper Large v3
OpenAI Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation.
Model ID
openai/whisper-large-v3
Source
Modality
- Input: Audio (e.g.,
.mp3
,.mp4
,.wav
) - Output: Text
Endpoints
Context Limit
Whisper has a receptive field of 30-seconds.