DeepSeek R1 Distill Qwen 32B
Last updated: Recently
Model Type
Text
Max Context
32.8K
Max Output
32,768 tokens
Parameter Scale
-
DeepSeek R1 Distill Qwen 32B is distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. 32,768 token context window, maximum output of 32,768 tokens. Includes independent benchmarks from Artificial Analysis.
Input Modality
text
Output Modality
text
Inference Speed
-
Latest Release Date
1/20/2025
SDK Ecosystem
-
name
DeepSeek R1 Distill Qwen 32B
Release date
1/20/2025
Performance
-
model_creator.name
DeepSeek
AA Intelligence Index
17.2
AA Coding Index
-
AA Math Index
63
MMLU Pro
0.739
GPQA
0.615
HLE
0.055
LiveCodeBench
0.27
SciCode
0.376
Math-500
0.940666666666667
AIME
0.686666666666667
AIME 2025
0.63
IFBench
0.229251700680272
LCR
0.0966666666666667
TerminalBench Hard
-
TAU2
-
Output Speed (tokens/s)
-
TTFT (s)
-
First Answer Token (s)
-

