AI Model Comparison

Our Story

Gemini 3 Flash is Google's high-speed reasoning model that combines near-Pro-level intelligence with the speed and cost efficiency of the Flash line. It outperforms Gemini 2.5 Pro on most benchmarks while running 3× faster and at a fraction of the cost, scoring 78% on SWE-bench Verified. The model supports a 1M-token context window, multimodal inputs (text, images, audio, video, PDFs), configurable thinking levels, and automatic context caching, making it well-suited for agentic workflows, multi-turn chat, and interactive coding assistance.

Author

Google

Release Date

2025-12-17

Knowledge Cutoff

2025-01-31

License

Proprietary

I/O Format

Context Length

1.0M / 66K

API I/O (1M)

$0.5 / $3

How to Use

Google AI Free or above / API Access

Output Speed

177 tok/s

Arena Overall

1474

Intelligence Index

46.4

Coding Index

42.6

Math Index

97.0

LiveBench

54.4

ForecastBench

58.7

GPQA Diamond

89.8%

HLE

34.7%

MMLU-Pro

89.0%

AIME 2025

97.0%

MATH-500

—

LB Reasoning

49.2

LB Math

68.1

LB Data Analysis

48.3

LiveCodeBench

90.8%

LB Coding

78.6

LB Agentic

43.3

TAU2

80.4%

TerminalBench

38.6%

SciCode

50.6%

IFBench

78.0%

AA-LCR

0.7

Hallucination (HHEM)

13.5%

Factual Consistency (HHEM)

86.5%

LB Language

78.7

LB Instruction Following

28.3

Calculate Cost View Model Details

1 / 3

Swipe to compare