
Nous Research | Hermes 4 - Llama-3.1 70B (Non-reasoning)
Hermes 4 Llama 3.1 70B Non reasoning by Nous Research: Text model; TTFT 0.613s, 76.7 tok/s.
artificial-analysismanufactureraa-bootstrap
Latency
613ms
Throughput
-
Total Context
-
Max Output
-
Input Price
$0.13/M
Output Price
$0.4/M
API Parameters & Capabilities
Model Type
-
Parameter Size
-
Input Modality
-
Output Modality
-
Inference Speed
76.742 tokens/s
Success Rate
-
Peak Concurrency
-
Release Date
5/7/2026
Integration & Pricing Details
Pricing Mode
-
Free Tier
-
Supported Languages
-
SDK
-
API Key Acquisition
-
Rate Limit
-
User Reviews
0 verified user reviews
Loading reviews...
