AI Model Comparison

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro. It can independently and autonomously complete professional tasks that would take human experts days or weeks, involving more than a thousand tool calls. Its context length of up to 1M makes it well suited for integration with a wide range of agent frameworks.

Author

Xiaomi

Release Date

2026-04-22

Knowledge Cutoff

—

License

Proprietary

I/O Format

Context Length

1.0M / 131K

API I/O (1M)

$1 / $3

How to Use

—

Output Speed

68 tok/s

Arena Overall

1463

Intelligence Index

53.8

Coding Index

45.5

Math Index

—

LiveBench

—

ForecastBench

—

GPQA Diamond

86.6%

HLE

33.8%

MMLU-Pro

—

AIME 2025

—

MATH-500

—

LB Reasoning

—

LB Math

—

LB Data Analysis

—

LiveCodeBench

—

LB Coding

—

LB Agentic

—

TAU2

94.2%

TerminalBench

43.2%

SciCode

50.2%

IFBench

79.9%

AA-LCR

0.7

Hallucination (HHEM)

—

Factual Consistency (HHEM)

—

LB Language

—

LB Instruction Following

—

Calculate Cost View Model Details

1 / 3

Swipe to compare