1 / 3
Swipe to compare

ElevenLabs Flash v2.5 is an ultra-low-latency text-to-speech model from ElevenLabs, generating speech in under 75 milliseconds. It supports 32 languages and handles up to 40,000 characters per request, making it ideal for real-time voice agents, conversational AI, and developer applications where response speed is critical. While it trades some expressiveness compared to the Multilingual v2 and v3 models, its speed and efficiency make it the go-to choice for latency-sensitive production deployments.

Author
ElevenLabsElevenLabs
Release Date
2024-12-18
Knowledge Cutoff
Unknown
License
Proprietary
I/O Format
Context Length
40K
API I/O (1M)
$0.00005~$0.00005/character
How to Use
API Access
Output Speed
Arena Overall
Intelligence Index
Coding Index
Math Index
LiveBench
ForecastBench
GPQA Diamond
HLE
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
LB Math
LB Data Analysis
LiveCodeBench
LB Coding
LB Agentic
TAU2
TerminalBench
SciCode
IFBench
AA-LCR
Hallucination (HHEM)
Factual Consistency (HHEM)
LB Language
LB Instruction Following