1 / 3
Swipe to compare

LongCat Flash Chat is a large-scale Mixture-of-Experts model from Meituan with 560 billion total parameters, dynamically activating 18.6B to 31.3B (averaging ~27B) based on contextual demands. Its shortcut-connected MoE design achieves over 100 tokens per second during inference while supporting a 128K-token context window. The model delivers highly competitive performance in reasoning, coding, and instruction following, with exceptional strengths in agentic tasks and complex multi-step tool-use interactions.

Author
MeituanMeituan
Release Date
2025-09-09
Knowledge Cutoff
2025-03-31
License
Open Model
I/O Format
Context Length
131K / 131K
API I/O (1M)
$0.2 / $0.8
How to Use
Output Speed
115 tok/s
Arena Overall
1434
Intelligence Index
23.9
Coding Index
16.5
Math Index
LiveBench
ForecastBench
GPQA Diamond
63.6%
HLE
6.0%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
LB Math
LB Data Analysis
LiveCodeBench
LB Coding
LB Agentic
TAU2
79.5%
TerminalBench
10.6%
SciCode
28.4%
IFBench
43.1%
AA-LCR
0.3
Hallucination (HHEM)
Factual Consistency (HHEM)
LB Language
LB Instruction Following