Hermes 4 – Llama-3.1 405B (Non-reasoning)

Hermes 4 – Llama-3.1 405B (Non-reasoning)

Nous Research · Released Aug 27, 2025 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #244 / 279
41.2 tokens / sec
Input #443 / 592
$1.00 per 1M tokens
Output #438 / 592
$3.00 per 1M tokens
Context
— Not reported

Analysis Summary

Hermes 4 405B (Non-reasoning) is Nous Research's fine-tune of Meta's Llama 3.1 405B base, targeting general-purpose instruction following and content tasks. Despite the large parameter count, its intelligence index sits in the lower range of benchmarked models, and its agentic score is weak, limiting its usefulness for autonomous or multi-step workflows.

For businesses, it is best suited to straightforward content generation, summarisation, and light Q&A tasks where the 405B scale may provide some quality lift over smaller models. Coding capability is moderate (livecodebench 0.546), but without strong reasoning or agentic reliability it is not a first choice for software engineering pipelines. Instruction following (ifbench 0.35) is below average.

At $1/$3 per million tokens it is not cheap for its capability tier. Teams needing a large open-weight model for content tasks may find better value in more recent fine-tunes or base models at lower price points.

Assessed June 30, 2026

Editorial notes

Hermes 4 405B (Non-reasoning) from Nous Research offers broad general capability on a large parameter base, with moderate instruction following, but limited reasoning depth and low agentic performance keep it in the mid-tier.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.2Technical2.5Value6Content3.1
Intelligence 2.2/10
Technical 2.5/10
Content 3.1/10
Value 6/10

How Hermes 4 – Llama-3.1 405B (Non-reasoning) compares

Hermes 4 – Llama-3.1 405B (Non-reasoning) ranks #253 of 385 AI models we track for overall intelligence, #195 of 293 for agentic tasks. At $1.00 per million input tokens it is cheaper than 25% of comparable models.

Performance Indices

Source: Artificial Analysis

8.8 Intelligence Index
18.2 Agentic Index
15.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 53.6% Graduate-level scientific reasoning
HLE 4.2% Humanity's Last Exam
MMLU Pro 72.9% Multi-task language understanding
AIME 2025 15.3% Competition mathematics (2025)
SciCode 34.6% Scientific computing

Technical

LiveCodeBench 54.6% Live coding evaluation
TerminalBench Hard 9.8% Agentic terminal tasks
τ²-Bench 26.6% Conversational agent benchmark

Content

IFBench 34.8% Instruction following
LCR 20% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Hermes 4 – Llama-3.1 405B (Non-reasoning) stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

ProviderNous Research
Release Date August 27, 2025
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $1.00 $0.001000
Output $3.00 $0.003000

Leaderboard Categories

Frequently asked questions about Hermes 4 – Llama-3.1 405B (Non-reasoning)

How much does Hermes 4 – Llama-3.1 405B (Non-reasoning) cost?

Hermes 4 – Llama-3.1 405B (Non-reasoning) costs $1.00 per million input tokens and $3.00 per million output tokens.

Who created Hermes 4 – Llama-3.1 405B (Non-reasoning)?

Hermes 4 – Llama-3.1 405B (Non-reasoning) is developed by Nous Research and was released on August 27, 2025.