Hermes 4 – Llama-3.1 70B (Non-reasoning)
Analysis Summary
Hermes 4 on Llama-3.1 70B (Non-reasoning) is a cost-efficient fine-tune from Nous Research targeting general instruction following at the 70B scale. Its MMLU-Pro of 0.664 and GPQA of 0.491 are respectable for the model size, though its LiveCodeBench score of 0.269 and low long-context reliability limit its technical utility.
For businesses, this model is best suited to lighter content tasks: drafting, summarisation, and structured text generation where budget is a constraint. Its agentic index is low, and its coding capability falls short of what most software teams would require. The very low pricing at $0.13 input makes it attractive for high-volume, lower-complexity workloads.
Teams running cost-sensitive content pipelines or needing a capable general-purpose model without a large budget will find value here, but should look to the 405B or reasoning variants for more demanding tasks.
Assessed June 6, 2026
Editorial notes
Hermes 4 Llama-3.1 70B (Non-reasoning) from Nous Research offers moderate general capability at a low price of $0.13/$0.40 per million tokens, with limited coding and agentic depth.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Hermes 4 – Llama-3.1 70B (Non-reasoning) compares
Hermes 4 – Llama-3.1 70B (Non-reasoning) ranks #275 of 378 AI models we track for overall intelligence, #249 of 315 for coding, #170 of 289 for agentic tasks. At $0.13 per million input tokens it is cheaper than 59% of comparable models.
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Hermes 4 – Llama-3.1 70B (Non-reasoning) stack up?
Compare side-by-side with other professional models.
Model Information
| Provider | Nous Research |
| Release Date | August 27, 2025 |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.13 | $0.000130 |
| Output | $0.40 | $0.000400 |
Leaderboard Categories
Explore Related Models
Frequently asked questions about Hermes 4 – Llama-3.1 70B (Non-reasoning)
How much does Hermes 4 – Llama-3.1 70B (Non-reasoning) cost?
Hermes 4 – Llama-3.1 70B (Non-reasoning) costs $0.13 per million input tokens and $0.40 per million output tokens.
Is Hermes 4 – Llama-3.1 70B (Non-reasoning) good for coding?
On our coding benchmark index, Hermes 4 – Llama-3.1 70B (Non-reasoning) ranks #249 of 315 models, placing it in the broader range of the field for code generation and debugging.
Who created Hermes 4 – Llama-3.1 70B (Non-reasoning)?
Hermes 4 – Llama-3.1 70B (Non-reasoning) is developed by Nous Research and was released on August 27, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 11, 2026 8:38 pm