
IBM | Granite 4.0 H Small
Granite 4.0 H Small by IBM: Text model; TTFT 8.716s, 502.3 tok/s.
artificial-analysismanufactureraa-bootstrap
Latency
8716ms
Throughput
-
Total Context
-
Max Output
-
Input Price
$0.06/M
Output Price
$0.25/M
API Parameters & Capabilities
Model Type
-
Parameter Size
-
Input Modality
-
Output Modality
-
Inference Speed
502.349 tokens/s
Success Rate
-
Peak Concurrency
-
Release Date
5/7/2026
Integration & Pricing Details
Pricing Mode
-
Free Tier
-
Supported Languages
-
SDK
-
API Key Acquisition
-
Rate Limit
-
User Reviews
0 verified user reviews
Loading reviews...
