crawler-status-logs-v1
AI API Hunt
Q

Qwen3.5 4B (Non-reasoning)

Last updated: Recently

Model Type

Text

Max Context

-

Max Output

-

Parameter Scale

-

Qwen3.5 4B Non reasoning by Alibaba: Text model; TTFT 0.488s, 199.5 tok/s.

Input Modality

-

Output Modality

-

Inference Speed

199.514 tokens/s

Latest Release Date

3/2/2026

SDK Ecosystem

-

artificial-analysismanufactureraa-bootstrap
Model Overview Fields

name

Qwen3.5 4B (Non-reasoning)

Release date

3/2/2026

Performance

199.514 tokens/s

model_creator.name

Alibaba

AA Evaluation Scores

AA Intelligence Index

22.6

AA Coding Index

13.7

AA Math Index

-

MMLU Pro

-

GPQA

0.712

HLE

0.075

LiveCodeBench

-

SciCode

0.183

Math-500

-

AIME

-

AIME 2025

-

IFBench

0.333

LCR

0.283

TerminalBench Hard

0.114

TAU2

0.877

Output Speed (tokens/s)

199.514 tok/s

TTFT (s)

0.488s

First Answer Token (s)

0.488s

Share:
Report / Feedback