DeepSeek R1 Distill Qwen 32B

Last updated: Recently

Model Type

Text

Max Context

32.8K

Max Output

32,768 tokens

Parameter Scale

DeepSeek R1 Distill Qwen 32B is distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. 32,768 token context window, maximum output of 32,768 tokens. Includes independent benchmarks from Artificial Analysis.

Input Modality

text

Output Modality

text

Inference Speed

Latest Release Date

1/20/2025

SDK Ecosystem

artificial-analysismanufactureraa-bootstrap

Model Overview Fields

name

DeepSeek R1 Distill Qwen 32B

Release date

1/20/2025

Performance

model_creator.name

DeepSeek

AA Evaluation Scores

AA Intelligence Index

17.2

AA Coding Index

AA Math Index

MMLU Pro

0.739

GPQA

0.615

HLE

0.055

LiveCodeBench

0.27

SciCode

0.376

Math-500

0.940666666666667

AIME

0.686666666666667

AIME 2025

0.63

IFBench

0.229251700680272

LCR

0.0966666666666667

TerminalBench Hard

TAU2

Output Speed (tokens/s)

TTFT (s)

First Answer Token (s)

DeepSeek

DeepSeek R1 Distill Qwen 32B

$0.00/M$0.00/M

There are no reviews for this model yet.

artificial-analysismanufactureraa-bootstrap

0.0