1 / 3
Swipe to compare

Gemini 3.1 Flash Lite is Google's high-efficiency model optimized for cost-sensitive, high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities including audio input, RAG snippet ranking, translation, data extraction, and code completion. It supports full thinking levels (minimal/low/medium/high) for fine-grained cost-performance trade-offs, and is priced at half the cost of Gemini 3 Flash.

Author
GoogleGoogle
Release Date
2026-03-03
Knowledge Cutoff
2025-01-31
License
Proprietary
I/O Format
Context Length
1.0M / 66K
API I/O (1M)
$0.25 / $1.5
How to Use
API Access
Output Speed
335 tok/s
Arena Overall
1439
Intelligence Index
33.5
Coding Index
30.1
Math Index
LiveBench
62.1
ForecastBench
GPQA Diamond
82.2%
HLE
16.2%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
59.7
LB Math
73.6
LB Data Analysis
54.9
LiveCodeBench
LB Coding
68.5
LB Agentic
33.3
TAU2
31.3%
TerminalBench
24.2%
SciCode
41.9%
IFBench
77.2%
AA-LCR
0.7
Hallucination (HHEM)
8.2%
Factual Consistency (HHEM)
91.8%
LB Language
73.2
LB Instruction Following
68.6