Grok 4 Fast (Non-reasoning)
Last updated: Recently
Model Type
MText
Max Context
2000K
Max Output
30,000 tokens
Parameter Scale
-
2,000,000 token context window, maximum output of 30,000 tokens. Includes independent benchmarks from Artificial Analysis.
Input Modality
text, image, file
Output Modality
text
Inference Speed
79.156 tokens/s
Latest Release Date
9/19/2025
SDK Ecosystem
-
name
Grok 4 Fast (Non-reasoning)
Release date
9/19/2025
Performance
79.156 tokens/s
model_creator.name
xAI
AA Intelligence Index
23.1
AA Coding Index
19
AA Math Index
41.3
MMLU Pro
0.73
GPQA
0.606
HLE
0.05
LiveCodeBench
0.401
SciCode
0.329
Math-500
-
AIME
-
AIME 2025
0.413333333333333
IFBench
0.37687074829932
LCR
0.2
TerminalBench Hard
0.121212121212121
TAU2
0.637426900584795
Output Speed (tokens/s)
79.156 tok/s
TTFT (s)
0.411s
First Answer Token (s)
0.411s

