1 / 3
Swipe to compare

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding tasks. Its 1M context window supports complete documents, extended conversations, and complex task contexts in a single pass, making it ideal for integration with agent frameworks where strong reasoning, rich perception, and cost efficiency all matter.

Author
XiaomiXiaomi
Release Date
2026-04-22
Knowledge Cutoff
License
Proprietary
I/O Format
Context Length
1.0M / 131K
API I/O (1M)
$0.4 / $2
How to Use
Output Speed
Arena Overall
1424
Intelligence Index
49.0
Coding Index
42.1
Math Index
LiveBench
ForecastBench
GPQA Diamond
84.9%
HLE
25.2%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
LB Math
LB Data Analysis
LiveCodeBench
LB Coding
LB Agentic
TAU2
90.6%
TerminalBench
41.7%
SciCode
43.1%
IFBench
67.1%
AA-LCR
0.6
Hallucination (HHEM)
Factual Consistency (HHEM)
LB Language
LB Instruction Following