DeepSeek R1 Distill Llama 8B

Last updated: Recently

Model Type

Text

Max Context

64K

Max Output

16,000 tokens

Parameter Scale

DeepSeek R1 is here: Performance on par with OpenAI o1, but open sourced and with fully open reasoning tokens. 64,000 token context window, maximum output of 16,000 tokens. Higher uptime with 2 providers. Includes independent benchmarks from Artificial Analysis.

Input Modality

text

Output Modality

text

Inference Speed

Latest Release Date

1/20/2025

SDK Ecosystem

artificial-analysismanufactureraa-bootstrap

Model Overview Fields

name

DeepSeek R1 Distill Llama 8B

Release date

1/20/2025

Performance

model_creator.name

DeepSeek

AA Evaluation Scores

AA Intelligence Index

12.1

AA Coding Index

AA Math Index

41.3

MMLU Pro

0.543

GPQA

0.302

HLE

0.042

LiveCodeBench

0.233

SciCode

0.119

Math-500

0.852666666666667

AIME

0.333333333333333

AIME 2025

0.413333333333333

IFBench

0.175510204081633

LCR

TerminalBench Hard

TAU2

Output Speed (tokens/s)

TTFT (s)

First Answer Token (s)

DeepSeek

DeepSeek R1 Distill Llama 8B

$0.00/M$0.00/M

There are no reviews for this model yet.

artificial-analysismanufactureraa-bootstrap

0.0