1 / 3
Swipe to compare

Gemma 4 31B is Google DeepMind's most capable open-weight model, a 30.7-billion-parameter dense multimodal model released under the Apache 2.0 license. It processes text and image inputs with a 256K-token context window, supports configurable thinking/reasoning modes, native function calling, structured JSON output, and over 140 languages. Ranking among the top three open models globally on the Arena AI leaderboard, it matches or exceeds much larger models like Llama 4 and Qwen 3.5 on math, coding, and agent tool use, and can run quantized on consumer GPUs with 24GB of VRAM.

Author
GoogleGoogle
Release Date
2026-04-02
Knowledge Cutoff
2025-01-01
License
Open Model
I/O Format
Context Length
262K / 131K
API I/O (1M)
$0.13 / $0.38
How to Use
API Access
Output Speed
14 tok/s
Arena Overall
1451
Intelligence Index
39.2
Coding Index
38.7
Math Index
LiveBench
62.4
ForecastBench
GPQA Diamond
85.7%
HLE
22.7%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
59.4
LB Math
73.9
LB Data Analysis
58.8
LiveCodeBench
LB Coding
60.3
LB Agentic
40.0
TAU2
59.9%
TerminalBench
36.4%
SciCode
43.4%
IFBench
75.6%
AA-LCR
0.6
Hallucination (HHEM)
7.4%
Factual Consistency (HHEM)
92.6%
LB Language
71.3
LB Instruction Following
67.6