Hermes 4 – Llama-3.1 70B (Non-reasoning)

Hermes 4 – Llama-3.1 70B (Non-reasoning)

Nous Research · Released Aug 27, 2025 Professional
Intelligence #10 / 576
82.0 Our Score
Speed #136 / 271
95.3 tokens / sec
Input #235 / 577
$0.130 per 1M tokens
Output #226 / 577
$0.400 per 1M tokens
Context
— Not reported

Analysis Summary

Hermes 4 on Llama-3.1 70B (Non-reasoning) is a cost-efficient fine-tune from Nous Research targeting general instruction following at the 70B scale. Its MMLU-Pro of 0.664 and GPQA of 0.491 are respectable for the model size, though its LiveCodeBench score of 0.269 and low long-context reliability limit its technical utility.

For businesses, this model is best suited to lighter content tasks: drafting, summarisation, and structured text generation where budget is a constraint. Its agentic index is low, and its coding capability falls short of what most software teams would require. The very low pricing at $0.13 input makes it attractive for high-volume, lower-complexity workloads.

Teams running cost-sensitive content pipelines or needing a capable general-purpose model without a large budget will find value here, but should look to the 405B or reasoning variants for more demanding tasks.

Assessed June 6, 2026

Editorial notes

Hermes 4 Llama-3.1 70B (Non-reasoning) from Nous Research offers moderate general capability at a low price of $0.13/$0.40 per million tokens, with limited coding and agentic depth.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.5Technical2.1Value7Content1.7
Intelligence 2.5/10
Technical 2.1/10
Content 1.7/10
Value 7/10

How Hermes 4 – Llama-3.1 70B (Non-reasoning) compares

Hermes 4 – Llama-3.1 70B (Non-reasoning) ranks #275 of 378 AI models we track for overall intelligence, #249 of 315 for coding, #170 of 289 for agentic tasks. At $0.13 per million input tokens it is cheaper than 59% of comparable models.

Performance Indices

Source: Artificial Analysis

12.6 Intelligence Index
9.2 Coding Index
21.6 Agentic Index
11.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 49.1% Graduate-level scientific reasoning
HLE 3.6% Humanity's Last Exam
MMLU Pro 66.4% Multi-task language understanding
AIME 2025 11.3% Competition mathematics (2025)
SciCode 27.7% Scientific computing

Technical

LiveCodeBench 26.9% Live coding evaluation
τ²-Bench 21.6% Conversational agent benchmark

Content

IFBench 29% Instruction following
LCR 2% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Hermes 4 – Llama-3.1 70B (Non-reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

ProviderNous Research
Release Date August 27, 2025
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.13 $0.000130
Output $0.40 $0.000400

Leaderboard Categories

Frequently asked questions about Hermes 4 – Llama-3.1 70B (Non-reasoning)

How much does Hermes 4 – Llama-3.1 70B (Non-reasoning) cost?

Hermes 4 – Llama-3.1 70B (Non-reasoning) costs $0.13 per million input tokens and $0.40 per million output tokens.

Is Hermes 4 – Llama-3.1 70B (Non-reasoning) good for coding?

On our coding benchmark index, Hermes 4 – Llama-3.1 70B (Non-reasoning) ranks #249 of 315 models, placing it in the broader range of the field for code generation and debugging.

Who created Hermes 4 – Llama-3.1 70B (Non-reasoning)?

Hermes 4 – Llama-3.1 70B (Non-reasoning) is developed by Nous Research and was released on August 27, 2025.