OpenAI
OpenAI

GPT-4o Transcribe

2024-06-01

GPT-4o Transcribe is OpenAI's full-featured speech-to-text model, built on the GPT-4o architecture for maximum transcription accuracy. It delivers lower word error rates and superior language recognition compared to both Whisper and GPT-4o Mini Transcribe, making it the best choice for high-accuracy transcription needs. The model supports real-time audio streaming via WebSocket connections, contextual prompts for specialized vocabulary, and log probability outputs for confidence scoring.

API|Proprietary Model
Knowledge Cutoff
2024-06-01
Input → Output Format
Context Memory
Cost/1M Words
$2.5IN$10OUT
Calculate Cost

AI Performance Evaluation

Output Speed
Standard Mode
31tok/s↓51
First Output 0.70s