1 / 3
Swipe to compare

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model, built on the GPT-4o Mini architecture. It converts written text into natural-sounding, expressive spoken audio with high steerability — developers can control speech characteristics such as tone, pacing, and emphasis through natural language instructions in the prompt. Compared to previous OpenAI TTS models, it delivers significantly lower word error rates and more natural prosody, making it ideal for voice agents, accessibility features, and audio content production at scale.

Author
OpenAIOpenAI
Release Date
2025-12-15
Knowledge Cutoff
Unknown
License
Proprietary
I/O Format
Context Length
2K
API I/O (1M)
$0.6 / $12
How to Use
API Access
Output Speed
Arena Overall
Intelligence Index
Coding Index
Math Index
LiveBench
ForecastBench
GPQA Diamond
HLE
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
LB Math
LB Data Analysis
LiveCodeBench
LB Coding
LB Agentic
TAU2
TerminalBench
SciCode
IFBench
AA-LCR
Hallucination (HHEM)
9.6%
Factual Consistency (HHEM)
90.4%
LB Language
LB Instruction Following