NVIDIA: Nemotron 3 Super

NVIDIA: Nemotron 3 Super

nvidia · Released Mar 11, 2026 Specialist
Intelligence #67 / 525
66.7 Our Score
Speed #48 / 244
160.7 tokens / sec
Input #168 / 525
$0.090 per 1M tokens
Output #228 / 525
$0.450 per 1M tokens
Context #80 / 525
262,144 tokens

Analysis Summary

NVIDIA: Nemotron 3 Super sits in the Specialist tier on our leaderboard, ranked #67 of 525 published models on overall intelligence. At $0.090 input and $0.450 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

NVIDIA's Nemotron 3 Super offers a solid balance of reasoning and instruction-following capability at very competitive pricing ($0.09/$0.45 per million tokens) with a large 262K context window — a cost-effective choice for businesses running high-volume content or tool-use workflows.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6.4Technical5.9Value8Content6.5
Intelligence 6.4/10
Technical 5.9/10
Content 6.5/10
Value 8/10

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

36 Intelligence Index
31.2 Coding Index
48.3 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 80% Graduate-level scientific reasoning
HLE 19.2% Humanity's Last Exam
SciCode 36% Scientific computing

Technical

TerminalBench Hard 28.8% Agentic terminal tasks
τ²-Bench 67.8% Conversational agent benchmark

Content

IFBench 71.5% Instruction following
LCR 60% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does NVIDIA: Nemotron 3 Super stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-super-120b-a12b
Providernvidia
Release Date March 11, 2026
Context Length262,144 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.09 $0.000090
Output $0.45 $0.000450

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.4%
Avg Uptime
750ms
Best Latency (TTFT)
91 tok/s
Best Throughput
3/3
Active Endpoints
Available via: DekaLLM, DeepInfra, Nebius

Leaderboard Categories