Home > AI Models > NVIDIA: Nemotron 3 Super

NVIDIA: Nemotron 3 Super

Name: NVIDIA: Nemotron 3 Super Review
Item: NVIDIA: Nemotron 3 Super
Author: Design for Online Editorial

NVIDIA: Nemotron 3 Super

nvidia · Released Mar 11, 2026 Professional

Intelligence #10 / 576

82.0 Our Score

Speed #42 / 271

184.7 tokens / sec

Input #188 / 577

$0.090 per 1M tokens

Output #251 / 577

$0.450 per 1M tokens

Context #51 / 577

1M tokens

NVIDIA Nemotron 3 Super is a cost-efficient model with an intelligence index of 36.0 and a coding index of 31.2, placing it in the good-to-strong range for general reasoning tasks. Its agentic index of 48.3 is moderate, and it supports tool use and function calling across a 1M token context window. Pricing at $0.09 input and $0.45 output per million tokens is very competitive, making it one of the better value options for teams with budget constraints.

For businesses, Nemotron 3 Super is a practical choice for high-volume automation, structured content generation, and lightweight agentic tasks where cost efficiency is a priority. The 1M context window is a genuine advantage for long-document workflows. Coding and agentic performance are adequate but not class-leading, so it is better suited to supporting roles than frontier reasoning tasks.

At this price point, it is worth considering for teams running large volumes of moderate-complexity tasks. Pair it with a stronger model for high-stakes reasoning or complex coding work.

Assessed June 6, 2026

Editorial notes

NVIDIA Nemotron 3 Super offers good reasoning with a 36.0 intelligence index, tool use, function calling, a 1M token context, and very low pricing at $0.09 input per million tokens.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 1M tokens
Tokenizer: Other
Released: Mar 11, 2026

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How NVIDIA: Nemotron 3 Super compares

NVIDIA: Nemotron 3 Super ranks #77 of 378 AI models we track for overall intelligence, #84 of 315 for coding, #89 of 289 for agentic tasks. Its 1M-token context window is larger than 91% of the models we list. At $0.09 per million input tokens it is cheaper than 67% of comparable models.

About NVIDIA: Nemotron 3 Super

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

36 Intelligence Index

31.2 Coding Index

48.3 Agentic Index

Benchmark Scores

GPQA Diamond 80% Graduate-level scientific reasoning

HLE 19.2% Humanity's Last Exam

SciCode 36% Scientific computing

TerminalBench Hard 28.8% Agentic terminal tasks

τ²-Bench 67.8% Conversational agent benchmark

IFBench 71.5% Instruction following

LCR 60% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does NVIDIA: Nemotron 3 Super stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID	`nvidia/nemotron-3-super-120b-a12b`
Provider	nvidia
Release Date	March 11, 2026
Context Length	1,000,000 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.09	$0.000090
Output	$0.45	$0.000450

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

96.4%

Avg Uptime

1,072ms

Best Latency (TTFT)

98.5 tok/s

Best Throughput

4/4

Active Endpoints

Available via: DekaLLM, DeepInfra, DigitalOcean, Nebius

Leaderboard Categories

Coding Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about NVIDIA: Nemotron 3 Super

How much does NVIDIA: Nemotron 3 Super cost?

NVIDIA: Nemotron 3 Super costs $0.09 per million input tokens and $0.45 per million output tokens.

What is the context window of NVIDIA: Nemotron 3 Super?

NVIDIA: Nemotron 3 Super has a context window of 1,000,000 tokens (1M).

Is NVIDIA: Nemotron 3 Super good for coding?

On our coding benchmark index, NVIDIA: Nemotron 3 Super ranks #84 of 315 models, placing it in the broader range of the field for code generation and debugging.

What can NVIDIA: Nemotron 3 Super do?

NVIDIA: Nemotron 3 Super supports tool use and function calling.

Who created NVIDIA: Nemotron 3 Super?

NVIDIA: Nemotron 3 Super is developed by NVIDIA and was released on March 11, 2026.

NVIDIA: Nemotron 3 Super

NVIDIA: Nemotron 3 Super

Analysis Summary

Performance Profile

How NVIDIA: Nemotron 3 Super compares

About NVIDIA: Nemotron 3 Super

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Frequently asked questions about NVIDIA: Nemotron 3 Super

How much does NVIDIA: Nemotron 3 Super cost?

What is the context window of NVIDIA: Nemotron 3 Super?

Is NVIDIA: Nemotron 3 Super good for coding?

What can NVIDIA: Nemotron 3 Super do?

Who created NVIDIA: Nemotron 3 Super?

NVIDIA: Nemotron 3 Super

Performance Profile

How NVIDIA: Nemotron 3 Super compares

About NVIDIA: Nemotron 3 Super

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models

Frequently asked questions about NVIDIA: Nemotron 3 Super

How much does NVIDIA: Nemotron 3 Super cost?

What is the context window of NVIDIA: Nemotron 3 Super?

Is NVIDIA: Nemotron 3 Super good for coding?

What can NVIDIA: Nemotron 3 Super do?

Who created NVIDIA: Nemotron 3 Super?