Nous: Hermes 3 70B Instruct

Nous: Hermes 3 70B Instruct

nousresearch · Released Aug 18, 2024 Efficient
Intelligence #263 / 551
32.2 Our Score
Speed #248 / 257
31.1 tokens / sec
Input #315 / 552
$0.300 per 1M tokens
Output #199 / 552
$0.300 per 1M tokens
Context #206 / 552
131,072 tokens

Analysis Summary

Nous: Hermes 3 70B Instruct sits in the Efficient tier on our leaderboard, ranked #263 of 551 published models on overall intelligence. At $0.300 input and $0.300 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window.

Editorial notes

Nous Hermes 3 70B Instruct shows limited benchmark scores with an intelligence index of 10.6, offering a 131K context window at moderate pricing but insufficient for demanding business tasks.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.4Technical0Value7.8Content3.5
Intelligence 2.4/10
Technical 0/10
Content 3.5/10
Value 7.8/10

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the..

70B Parameters

Architecture Detail

Instruct Typechatml

Performance Indices

Source: Artificial Analysis

10.6 Intelligence Index

Benchmark Scores

Intelligence

GPQA Diamond 40.1% Graduate-level scientific reasoning
HLE 4.1% Humanity's Last Exam
MMLU Pro 57.1% Multi-task language understanding
MATH 500 53.8% Mathematical problem-solving
AIME 2.3% Competition mathematics
SciCode 23.1% Scientific computing

Technical

LiveCodeBench 18.8% Live coding evaluation

Benchmark data from Artificial Analysis and Hugging Face

How does Nous: Hermes 3 70B Instruct stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID nousresearch/hermes-3-llama-3.1-70b
Providernousresearch
Release Date August 18, 2024
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.30 $0.000300
Output $0.30 $0.000300

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
558ms
Best Latency (TTFT)
25 tok/s
Best Throughput
1/1
Active Endpoints
Available via: DeepInfra

Leaderboard Categories