DeepSeek R1 Distill Qwen 1.5B

Last updated: Recently

Model Type

Text

Max Context

64K

Max Output

16,000 tokens

Parameter Scale

DeepSeek R1 is here: Performance on par with OpenAI o1, but open sourced and with fully open reasoning tokens. 64,000 token context window, maximum output of 16,000 tokens. Higher uptime with 2 providers. Includes independent benchmarks from Artificial Analysis.

Input Modality

text

Output Modality

text

Inference Speed

Latest Release Date

1/20/2025

SDK Ecosystem

artificial-analysismanufactureraa-bootstrap

Model Overview Fields

name

DeepSeek R1 Distill Qwen 1.5B

Release date

1/20/2025

Performance

model_creator.name

DeepSeek

AA Evaluation Scores

AA Intelligence Index

9.1

AA Coding Index

AA Math Index

MMLU Pro

0.269

GPQA

0.098

HLE

0.033

LiveCodeBench

0.07

SciCode

0.066

Math-500

0.687333333333333

AIME

0.176666666666667

AIME 2025

0.22

IFBench

0.131972789115646

LCR

0.00333333333333333

TerminalBench Hard

TAU2

Output Speed (tokens/s)

TTFT (s)

First Answer Token (s)

DeepSeek

DeepSeek R1 Distill Qwen 1.5B

$0.00/M$0.00/M

There are no reviews for this model yet.

artificial-analysismanufactureraa-bootstrap

0.0