NVIDIA: Nemotron 3 Super (free)

NVIDIA: Nemotron 3 Super (free)

nvidia · Released Mar 11, 2026 Emerging
Intelligence #539 / 556
5.1 Our Score
Speed
Not reported
Input
Not priced
Output
Not priced
Context #49 / 557
1M tokens

Analysis Summary

NVIDIA: Nemotron 3 Super (free) sits in the Emerging tier on our leaderboard, ranked #539 of 556 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use and function calling.

Editorial notes

NVIDIA Nemotron 3 Super (free) offers a 262K context with tool use and function calling at no cost, but lacks its own benchmark data for a reliable capability assessment.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value0Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 0/10

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer..

120B Parameters

Capabilities

Tool Use Function Calling

How does NVIDIA: Nemotron 3 Super (free) stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-super-120b-a12b:free
Providernvidia
Release Date March 11, 2026
Context Length1,000,000 tokens
Max Completion262,144 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.2%
Avg Uptime
15,138ms
Best Latency (TTFT)
22 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Nvidia