Qwen3.5 4B (Non-reasoning)

Last updated: Recently

Model Type

Text

Max Context

Max Output

Parameter Scale

Qwen3.5 4B Non reasoning by Alibaba: Text model; TTFT 0.488s, 199.5 tok/s.

Input Modality

Output Modality

Inference Speed

199.514 tokens/s

Latest Release Date

3/2/2026

SDK Ecosystem

artificial-analysismanufactureraa-bootstrap

Model Overview Fields

name

Qwen3.5 4B (Non-reasoning)

Release date

3/2/2026

Performance

199.514 tokens/s

model_creator.name

Alibaba

AA Evaluation Scores

AA Intelligence Index

22.6

AA Coding Index

13.7

AA Math Index

MMLU Pro

GPQA

0.712

HLE

0.075

LiveCodeBench

SciCode

0.183

Math-500

AIME

AIME 2025

IFBench

0.333

LCR

0.283

TerminalBench Hard

0.114

TAU2

0.877

Output Speed (tokens/s)

199.514 tok/s

TTFT (s)

0.488s

First Answer Token (s)

0.488s

Alibaba

◎

Qwen3.5 4B (Non-reasoning)

$0.03/M$0.15/M

Qwen3.5 4B Non reasoning by Alibaba: Text model; TTFT 0.488s, 199.5 tok/s.

There are no reviews for this model yet.

artificial-analysismanufactureraa-bootstrap

0.0