
Alibaba | Qwen3.5 4B (Non-reasoning)
Qwen3.5 4B Non reasoning by Alibaba: Text model; TTFT 0.488s, 199.5 tok/s.
artificial-analysismanufactureraa-bootstrap
Latency
488ms
Throughput
-
Total Context
-
Max Output
-
Input Price
$0.03/M
Output Price
$0.15/M
API Parameters & Capabilities
Model Type
-
Parameter Size
-
Input Modality
-
Output Modality
-
Inference Speed
199.514 tokens/s
Success Rate
-
Peak Concurrency
-
Release Date
5/7/2026
Integration & Pricing Details
Pricing Mode
-
Free Tier
-
Supported Languages
-
SDK
-
API Key Acquisition
-
Rate Limit
-
User Reviews
0 verified user reviews
Loading reviews...
