NVIDIA: Nemotron 3 Super

NVIDIA: Nemotron 3 Super

nvidia · Released Mar 11, 2026 Professional
Intelligence #62 / 557
70.4 Our Score
Speed #39 / 259
173.1 tokens / sec
Input #185 / 560
$0.090 per 1M tokens
Output #244 / 560
$0.450 per 1M tokens
Context #49 / 560
1M tokens

Analysis Summary

NVIDIA: Nemotron 3 Super sits in the Professional tier on our leaderboard, ranked #62 of 557 published models on overall intelligence. At $0.090 input and $0.450 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use and function calling.

Editorial notes

Nemotron 3 Super from NVIDIA offers good reasoning and a 1M token context window with tool use and function calling at low cost, though coding and agentic scores sit below the top tier.

Assessed May 17, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6.4Technical5.9Value8.3Content6.5
Intelligence 6.4/10
Technical 5.9/10
Content 6.5/10
Value 8.3/10

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

36 Intelligence Index
31.2 Coding Index
48.3 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 80% Graduate-level scientific reasoning
HLE 19.2% Humanity's Last Exam
SciCode 36% Scientific computing

Technical

TerminalBench Hard 28.8% Agentic terminal tasks
τ²-Bench 67.8% Conversational agent benchmark

Content

IFBench 71.5% Instruction following
LCR 60% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does NVIDIA: Nemotron 3 Super stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-super-120b-a12b
Providernvidia
Release Date March 11, 2026
Context Length1,000,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.09 $0.000090
Output $0.45 $0.000450

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.8%
Avg Uptime
1,053ms
Best Latency (TTFT)
107 tok/s
Best Throughput
3/3
Active Endpoints
Available via: DekaLLM, DeepInfra, Nebius

Leaderboard Categories