Overall Elo score aggregated from millions of real user preference votes using the Bradley-Terry model. Higher scores indicate models more preferred by actual users.
Anthropic
Claude Opus 4.7
Claude Opus 4.6
Google
Gemini 3.1 Pro
Meta
Muse Spark
Grok
Grok 4.20
Grok 4.20 (Reasoning)
OpenAI
GPT-5.4
GPT-5.4 Pro
Gemini 3 Flash
Claude Opus 4.5
Z.ai
GLM-5.1
DeepSeek
DeepSeek V4 Pro
Claude Sonnet 4.6
Moonshot AI
Kimi K2.6
GLM-5
GPT-5.4 Mini
Claude Sonnet 4.5
Gemma 4 31B
Baidu
ERNIE 5.0 Thinking
Kimi K2.5
Claude Opus 4.1
Gemini 2.5 Pro
Xiaomi
MiMo-V2-Pro
Alibaba
Qwen3.6 Plus
Qwen3.5 397B A17B
DeepSeek V4 Flash
Gemini 3.1 Flash Lite
Meituan
Longcat Flash Chat
GPT-5
Grok 4.1 Fast
Grok 4.1 Fast (Reasoning)
DeepSeek V3.2
Claude Opus 4
Gemini 2.5 Flash
Claude Haiku 4.5
GPT-5.4 Nano
MiniMax
MiniMax M2.7
MiniMax M2.5
Claude Sonnet 4
GPT-5 Mini
Gemini 2.5 Flash Lite
Arcee AI
Trinity Large Thinking
NVIDIA
Nemotron 3 Super
GPT OSS 120B
Amazon
Nova 2 Lite
GPT-5 Nano
Llama 4 Maverick
Llama 4 Scout
GPT-4.1