
Nous Research | Hermes 4 - Llama-3.1 405B (Non-reasoning)
Hermes 4 Llama 3.1 405B Non reasoning by Nous Research: Text model; TTFT 0.744s, 33.7 tok/s.
artificial-analysismanufactureraa-bootstrap
Latency
744ms
Throughput
-
Total Context
-
Max Output
-
Input Price
$1/M
Output Price
$3/M
API Parameters & Capabilities
Model Type
-
Parameter Size
-
Input Modality
-
Output Modality
-
Inference Speed
33.660 tokens/s
Success Rate
-
Peak Concurrency
-
Release Date
5/7/2026
Integration & Pricing Details
Pricing Mode
-
Free Tier
-
Supported Languages
-
SDK
-
API Key Acquisition
-
Rate Limit
-
User Reviews
0 verified user reviews
Loading reviews...
