1 / 3
Swipe to compare

MiniMax M2.5 is a frontier language model trained with reinforcement learning across hundreds of thousands of complex real-world environments, achieving state-of-the-art scores of 80.2% on SWE-Bench Verified, 51.3% on Multi-SWE-Bench, and 76.3% on BrowseComp. Building on the coding expertise of M2.1, it extends into general office productivity — generating and operating Word, Excel, and PowerPoint files, context-switching between diverse software environments, and collaborating across agent and human teams. It completes evaluations 37% faster than M2.1 while being cost-efficient enough to run continuously for $1 per hour.

Author
MiniMaxMiniMax
Release Date
2026-02-12
Knowledge Cutoff
Unknown
License
Open Model
I/O Format
Context Length
197K / 66K
API I/O (1M)
$0.15 / $1.15
How to Use
API Access
Output Speed
104 tok/s
Arena Overall
1400
Intelligence Index
41.9
Coding Index
37.4
Math Index
LiveBench
60.3
ForecastBench
GPQA Diamond
84.8%
HLE
19.1%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
59.3
LB Math
77.4
LB Data Analysis
49.6
LiveCodeBench
LB Coding
70.7
LB Agentic
51.7
TAU2
95.3%
TerminalBench
34.8%
SciCode
42.6%
IFBench
71.6%
AA-LCR
0.7
Hallucination (HHEM)
Factual Consistency (HHEM)
LB Language
55.1
LB Instruction Following
57.2