Hermes 4 – Llama-3.1 70B (Non-reasoning)

Hermes 4 – Llama-3.1 70B (Non-reasoning)

Nous Research · Released Aug 27, 2025 Efficient
Intelligence #232 / 557
37.0 Our Score
Speed #174 / 259
60.3 tokens / sec
Input #227 / 561
$0.130 per 1M tokens
Output #217 / 561
$0.400 per 1M tokens
Context
Not reported

Analysis Summary

Hermes 4 – Llama-3.1 70B (Non-reasoning) sits in the Efficient tier on our leaderboard, ranked #232 of 557 published models on overall intelligence. At $0.130 input and $0.400 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Hermes 4 Llama-3.1 70B Non-reasoning is affordably priced at $0.13 input but scores poorly on intelligence and coding benchmarks, limiting it to basic content and summarisation tasks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.8Technical2.2Value7Content3
Intelligence 2.8/10
Technical 2.2/10
Content 3/10
Value 7/10

Performance Indices

Source: Artificial Analysis

12.6 Intelligence Index
9.2 Coding Index
21.6 Agentic Index
11.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 49.1% Graduate-level scientific reasoning
HLE 3.6% Humanity's Last Exam
MMLU Pro 66.4% Multi-task language understanding
AIME 2025 11.3% Competition mathematics (2025)
SciCode 27.7% Scientific computing

Technical

LiveCodeBench 26.9% Live coding evaluation
τ²-Bench 21.6% Conversational agent benchmark

Content

IFBench 29% Instruction following
LCR 2% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Hermes 4 – Llama-3.1 70B (Non-reasoning) stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

ProviderNous Research
Release Date August 27, 2025
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.13 $0.000130
Output $0.40 $0.000400