MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding tasks. Its 1M context window supports complete documents, extended conversations, and complex task contexts in a single pass, making it ideal for integration with agent frameworks where strong reasoning, rich perception, and cost efficiency all matter.
Reasoning|Proprietary Model
Knowledge Cutoff
Unknown
Input → Output Format
Context Memory
1.0MIN131KOUT
AI Performance Evaluation
Overall
AA Intelligence Index
49%↑10%
Reasoning & Math
GPQA Diamond
85%↑3%
HLE
25%↑6%
Coding
AA Coding Index
42%↑5%
TAU2
91%↑11%
TerminalBench
42%↑7%
SciCode
43%↑1%
Language & Instructions
IFBench
67%↑9%
AA-LCR
63%↑0%
Source:Artificial Analysis