AI Model Comparison

Our Story

Gemma 4 31B is Google DeepMind's most capable open-weight model, a 30.7-billion-parameter dense multimodal model released under the Apache 2.0 license. It processes text and image inputs with a 256K-token context window, supports configurable thinking/reasoning modes, native function calling, structured JSON output, and over 140 languages. Ranking among the top three open models globally on the Arena AI leaderboard, it matches or exceeds much larger models like Llama 4 and Qwen 3.5 on math, coding, and agent tool use, and can run quantized on consumer GPUs with 24GB of VRAM.

Author

Google

Release Date

2026-04-02

Knowledge Cutoff

2025-01-01

License

Open Model

I/O Format

Context Length

262K / 131K

API I/O (1M)

$0.13 / $0.38

How to Use

API Access

Output Speed

14 tok/s

Arena Overall

1451

Intelligence Index

39.2

Coding Index

38.7

Math Index

—

LiveBench

62.4

ForecastBench

—

GPQA Diamond

85.7%

HLE

22.7%

MMLU-Pro

—

AIME 2025

—

MATH-500

—

LB Reasoning

59.4

LB Math

73.9

LB Data Analysis

58.8

LiveCodeBench

—

LB Coding

60.3

LB Agentic

40.0

TAU2

59.9%

TerminalBench

36.4%

SciCode

43.4%

IFBench

75.6%

AA-LCR

0.6

Hallucination (HHEM)

7.4%

Factual Consistency (HHEM)

92.6%

LB Language

71.3

LB Instruction Following

67.6

Calculate Cost View Model Details

1 / 3

Swipe to compare