AI Model Comparison

Our Story

MiniMax M2.5 is a frontier language model trained with reinforcement learning across hundreds of thousands of complex real-world environments, achieving state-of-the-art scores of 80.2% on SWE-Bench Verified, 51.3% on Multi-SWE-Bench, and 76.3% on BrowseComp. Building on the coding expertise of M2.1, it extends into general office productivity — generating and operating Word, Excel, and PowerPoint files, context-switching between diverse software environments, and collaborating across agent and human teams. It completes evaluations 37% faster than M2.1 while being cost-efficient enough to run continuously for $1 per hour.

Author

MiniMax

Release Date

2026-02-12

Knowledge Cutoff

Unknown

License

Open Model

I/O Format

Context Length

197K / 66K

API I/O (1M)

$0.15 / $1.15

How to Use

API Access

Output Speed

104 tok/s

Arena Overall

1400

Intelligence Index

41.9

Coding Index

37.4

Math Index

—

LiveBench

60.3

ForecastBench

—

GPQA Diamond

84.8%

HLE

19.1%

MMLU-Pro

—

AIME 2025

—

MATH-500

—

LB Reasoning

59.3

LB Math

77.4

LB Data Analysis

49.6

LiveCodeBench

—

LB Coding

70.7

LB Agentic

51.7

TAU2

95.3%

TerminalBench

34.8%

SciCode

42.6%

IFBench

71.6%

AA-LCR

0.7

Hallucination (HHEM)

—

Factual Consistency (HHEM)

—

LB Language

55.1

LB Instruction Following

57.2

Calculate Cost View Model Details

1 / 3

Swipe to compare