Skip to main content
Version: Next

OpenAI Whisper Large v3

OpenAI Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation.

Model ID

openai/whisper-large-v3

Source

Hugging Face

Modality

  • Input: Audio (e.g., .mp3, .mp4, .wav)
  • Output: Text

Endpoints

Context Limit

Whisper has a receptive field of 30-seconds.