Hermes 4 – Llama-3.1 405B (Non-reasoning)
Analysis Summary
Hermes 4 – Llama-3.1 405B (Non-reasoning) sits in the Efficient tier on our leaderboard, ranked #212 of 557 published models on overall intelligence. At $1.00 input and $3.00 output per 1M tokens, it is among the most expensive on the market.
Editorial notes
Hermes 4 Llama-3.1 405B Non-reasoning delivers reasonable coding and content capability on a large open-weight base, but intelligence and agentic scores are low relative to its $1/1M input price.
Assessed May 14, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Hermes 4 – Llama-3.1 405B (Non-reasoning) stack up?
Compare side-by-side with other efficient models.
Model Information
| Provider | Nous Research |
| Release Date | August 27, 2025 |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $1.00 | $0.001000 |
| Output | $3.00 | $0.003000 |
Leaderboard Categories
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 20, 2026 8:38 pm