Grok 3 Mini

xAI Best for: coding
Compare this model →
Categories
MATH / AIME 95.8%
reasoning
LiveCodeBench 80.4%
coding
Chatbot Arena (LMSYS) 1366
general
SimpleQA 21.7%
general
SWE-bench
agenticcoding
HumanEval / MBPP
coding
MMLU
general
GPQA (Diamond)
reasoning
TAU-bench
agenticmultiagent
GAIA
agenticmultiagent
WebArena
agentic
MT-Bench
general
AgentBench
multiagent
IFEval
generalagentic

Data unavailable

Data unavailable

Context window
Max output tokens
Input modalities
Output modalities
Supports reasoning
Supports tool use