DeepSeek R1 Distill Llama 8B
Last updated: Recently
Model Type
Text
Max Context
64K
Max Output
16,000 tokens
Parameter Scale
-
DeepSeek R1 is here: Performance on par with OpenAI o1, but open sourced and with fully open reasoning tokens. 64,000 token context window, maximum output of 16,000 tokens. Higher uptime with 2 providers. Includes independent benchmarks from Artificial Analysis.
Input Modality
text
Output Modality
text
Inference Speed
-
Latest Release Date
1/20/2025
SDK Ecosystem
-
name
DeepSeek R1 Distill Llama 8B
Release date
1/20/2025
Performance
-
model_creator.name
DeepSeek
AA Intelligence Index
12.1
AA Coding Index
-
AA Math Index
41.3
MMLU Pro
0.543
GPQA
0.302
HLE
0.042
LiveCodeBench
0.233
SciCode
0.119
Math-500
0.852666666666667
AIME
0.333333333333333
AIME 2025
0.413333333333333
IFBench
0.175510204081633
LCR
0
TerminalBench Hard
-
TAU2
-
Output Speed (tokens/s)
-
TTFT (s)
-
First Answer Token (s)
-

