NVIDIA: Nemotron 3 Super (free)

NVIDIA: Nemotron 3 Super (free)

nvidia · Released Mar 11, 2026 Emerging
Intelligence #525 / 557
8.1 Our Score
Speed
Not reported
Input
Not priced
Output
Not priced
Context #49 / 560
1M tokens

Analysis Summary

NVIDIA: Nemotron 3 Super (free) sits in the Emerging tier on our leaderboard, ranked #525 of 557 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use and function calling.

Editorial notes

Nemotron 3 Super (free) shares the same architecture as the paid variant but has no independent benchmark data listed; the 1M context window and tool use are useful features pending verified scores.

Assessed May 17, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value0Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 0/10

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer..

120B Parameters

Capabilities

Tool Use Function Calling

How does NVIDIA: Nemotron 3 Super (free) stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-super-120b-a12b:free
Providernvidia
Release Date March 11, 2026
Context Length1,000,000 tokens
Max Completion262,144 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.4%
Avg Uptime
15,200ms
Best Latency (TTFT)
21 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Nvidia