Mistral AI
Mistral AI

Mistral Small 4

2026-03-16

Mistral Small 4 is the first Mistral model to unify three flagship capabilities into a single system: strong reasoning from Magistral, multimodal understanding from Pixtral, and agentic coding from Devstral. Built on a 119B-parameter Mixture-of-Experts architecture activating just 6.5B parameters per token, it offers a 256K-token context window with configurable reasoning effort — from lightweight instant responses to deep step-by-step analysis. Fully open source, it delivers a 40% reduction in latency and 3× throughput improvement over Mistral Small 3, making it a versatile and efficient choice for coding, analysis, and vision tasks.

API|VisionReasoning|Open ModelApache 2.0
Knowledge Cutoff
2025
Input → Output Format
Context Memory
262KIN262KOUT
Cost/1M Words
$0.15IN$0.6OUT
Calculate Cost

AI Performance Evaluation

Overall
AA Intelligence Index
28%↓11%
Reasoning & Math
GPQA Diamond
77%↓4%
HLE
9.5%↓8%
MMLU-Pro
42%↓40%
MATH-500
56%↓37%
Coding
AA Coding Index
24%↓10%
LiveCodeBench
11%↓54%
TAU2
41%↓32%
TerminalBench
17%↓14%
SciCode
38%↓3%
Language & Instructions
IFBench
48%↓9%
AA-LCR
45%↓17%
Output Speed
Standard Mode
151tok/s↑69
First Output 0.52s
Reasoning Mode
164tok/s↑76
First Output 12.79s