LiveBench 코딩 카테고리 점수 (0~100). 알고리즘 구현, 버그 수정, 코드 이해 등을 평가합니다.
OpenAI
GPT-5.5
Google
Gemini 3 Flash
Moonshot AI
Kimi K2.6
Anthropic
Claude Opus 4.5
Claude Opus 4.6
Alibaba
Qwen3.6 Plus
Kimi K2.5
GPT-5.4
Claude Sonnet 4
Claude Opus 4.7
Gemini 3.1 Pro
Claude Sonnet 4.5
GPT-5 Mini
DeepSeek
DeepSeek V3.2
Gemini 2.5 Pro
Z.ai
GLM-5.1
GPT-5.4 Mini
Claude Opus 4.1
Claude Sonnet 4.6
GLM-5
Claude Haiku 4.5
GPT-5
MiniMax
MiniMax M2.5
Grok
Grok 4.1 Fast (Reasoning)
Xiaomi
MiMo-V2-Pro
Gemini 3.1 Flash Lite
GPT-5 Nano
Gemini 2.5 Flash Lite
Grok 4.20 (Reasoning)
Gemini 2.5 Flash
Arcee AI
Trinity Large Thinking
GPT-5.4 Nano
Gemma 4 31B
GPT OSS 120B
Grok 4.20
MiniMax M2.7
Grok 4.1 Fast
NVIDIA
Nemotron 3 Super