DeepSeek R1 Distill Qwen 1.5B
Last updated: Recently
Model Type
Text
Max Context
64K
Max Output
16,000 tokens
Parameter Scale
-
DeepSeek R1 is here: Performance on par with OpenAI o1, but open sourced and with fully open reasoning tokens. 64,000 token context window, maximum output of 16,000 tokens. Higher uptime with 2 providers. Includes independent benchmarks from Artificial Analysis.
Input Modality
text
Output Modality
text
Inference Speed
-
Latest Release Date
1/20/2025
SDK Ecosystem
-
name
DeepSeek R1 Distill Qwen 1.5B
Release date
1/20/2025
Performance
-
model_creator.name
DeepSeek
AA Intelligence Index
9.1
AA Coding Index
-
AA Math Index
22
MMLU Pro
0.269
GPQA
0.098
HLE
0.033
LiveCodeBench
0.07
SciCode
0.066
Math-500
0.687333333333333
AIME
0.176666666666667
AIME 2025
0.22
IFBench
0.131972789115646
LCR
0.00333333333333333
TerminalBench Hard
-
TAU2
-
Output Speed (tokens/s)
-
TTFT (s)
-
First Answer Token (s)
-

