1 / 3
Swipe to compare

Gemini 3 Flash is Google's high-speed reasoning model that combines near-Pro-level intelligence with the speed and cost efficiency of the Flash line. It outperforms Gemini 2.5 Pro on most benchmarks while running 3× faster and at a fraction of the cost, scoring 78% on SWE-bench Verified. The model supports a 1M-token context window, multimodal inputs (text, images, audio, video, PDFs), configurable thinking levels, and automatic context caching, making it well-suited for agentic workflows, multi-turn chat, and interactive coding assistance.

Author
GoogleGoogle
Release Date
2025-12-17
Knowledge Cutoff
2025-01-31
License
Proprietary
I/O Format
Context Length
1.0M / 66K
API I/O (1M)
$0.5 / $3
How to Use
Google AI Free or above / API Access
Output Speed
177 tok/s
Arena Overall
1474
Intelligence Index
46.4
Coding Index
42.6
Math Index
97.0
LiveBench
54.4
ForecastBench
58.7
GPQA Diamond
89.8%
HLE
34.7%
MMLU-Pro
89.0%
AIME 2025
97.0%
MATH-500
LB Reasoning
49.2
LB Math
68.1
LB Data Analysis
48.3
LiveCodeBench
90.8%
LB Coding
78.6
LB Agentic
43.3
TAU2
80.4%
TerminalBench
38.6%
SciCode
50.6%
IFBench
78.0%
AA-LCR
0.7
Hallucination (HHEM)
13.5%
Factual Consistency (HHEM)
86.5%
LB Language
78.7
LB Instruction Following
28.3