Hermes 4 – Llama-3.1 70B (Reasoning)
Analysis Summary
Hermes 4 – Llama-3.1 70B (Reasoning) sits in the Efficient tier on our leaderboard, ranked #206 of 557 published models on overall intelligence. At $0.130 input and $0.400 output per 1M tokens, it is among the most expensive on the market.
Editorial notes
Hermes 4 Llama-3.1 70B Reasoning offers solid math and coding benchmark scores at low cost, but the intelligence index is limited, capping its usefulness for complex reasoning tasks.
Assessed May 14, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Hermes 4 – Llama-3.1 70B (Reasoning) stack up?
Compare side-by-side with other efficient models.
Model Information
| Provider | Nous Research |
| Release Date | August 27, 2025 |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.13 | $0.000130 |
| Output | $0.40 | $0.000400 |
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 20, 2026 8:38 pm