Nous: Hermes 4 405B

Nous: Hermes 4 405B

nousresearch · Released Aug 26, 2025
32
Our Score

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with.. traces or respond directly, offering flexibility between speed and depth. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs The model is instruction-tuned with an expanded post-training corpus (~60B tokens) emphasizing reasoning traces, improving performance in math, code, STEM, and logical reasoning, while retaining broad assistant utility. It also supports structured outputs, including JSON mode, schema adherence, function calling, and tool use. Hermes 4 is trained for steerability, lower refusal rates, and alignment toward neutral, user-directed behavior.

$1.00 / 1M Input Price
$3.00 / 1M Output Price
131,072 tokens Context Window
405B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerOther
Parameters405B

Model Information

OpenRouter ID nousresearch/hermes-4-405b
Providernousresearch
Release Date August 26, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $1.00 $0.001000
Output $3.00 $0.003000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

420ms
Best Latency (TTFT)
27 tok/s
Best Throughput
0/1
Active Endpoints
Available via: Nebius