1 / 3
Swipe to compare

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro. It can independently and autonomously complete professional tasks that would take human experts days or weeks, involving more than a thousand tool calls. Its context length of up to 1M makes it well suited for integration with a wide range of agent frameworks.

Author
XiaomiXiaomi
Release Date
2026-04-22
Knowledge Cutoff
License
Proprietary
I/O Format
Context Length
1.0M / 131K
API I/O (1M)
$1 / $3
How to Use
Output Speed
68 tok/s
Arena Overall
1463
Intelligence Index
53.8
Coding Index
45.5
Math Index
LiveBench
ForecastBench
GPQA Diamond
86.6%
HLE
33.8%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
LB Math
LB Data Analysis
LiveCodeBench
LB Coding
LB Agentic
TAU2
94.2%
TerminalBench
43.2%
SciCode
50.2%
IFBench
79.9%
AA-LCR
0.7
Hallucination (HHEM)
Factual Consistency (HHEM)
LB Language
LB Instruction Following