AI Model Comparison

Our Story

LongCat Flash Chat is a large-scale Mixture-of-Experts model from Meituan with 560 billion total parameters, dynamically activating 18.6B to 31.3B (averaging ~27B) based on contextual demands. Its shortcut-connected MoE design achieves over 100 tokens per second during inference while supporting a 128K-token context window. The model delivers highly competitive performance in reasoning, coding, and instruction following, with exceptional strengths in agentic tasks and complex multi-step tool-use interactions.

Author

Meituan

Release Date

2025-09-09

Knowledge Cutoff

2025-03-31

License

Open Model

I/O Format

Context Length

131K / 131K

API I/O (1M)

$0.2 / $0.8

How to Use

—

Output Speed

115 tok/s

Arena Overall

1434

Intelligence Index

23.9

Coding Index

16.5

Math Index

—

LiveBench

—

ForecastBench

—

GPQA Diamond

63.6%

HLE

6.0%

MMLU-Pro

—

AIME 2025

—

MATH-500

—

LB Reasoning

—

LB Math

—

LB Data Analysis

—

LiveCodeBench

—

LB Coding

—

LB Agentic

—

TAU2

79.5%

TerminalBench

10.6%

SciCode

28.4%

IFBench

43.1%

AA-LCR

0.3

Hallucination (HHEM)

—

Factual Consistency (HHEM)

—

LB Language

—

LB Instruction Following

—

Calculate Cost View Model Details

1 / 3

Swipe to compare