NVIDIA: Nemotron 3 Nano 30B A3B

NVIDIA: Nemotron 3 Nano 30B A3B

nvidia · Released Dec 14, 2025 Specialist
Intelligence #128 / 556
53.8 Our Score
Speed #94 / 257
122.9 tokens / sec
Input #150 / 557
$0.050 per 1M tokens
Output #170 / 557
$0.200 per 1M tokens
Context #98 / 557
262,144 tokens

Analysis Summary

NVIDIA: Nemotron 3 Nano 30B A3B sits in the Specialist tier on our leaderboard, ranked #128 of 556 published models on overall intelligence. At $0.050 input and $0.200 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

NVIDIA Nemotron 3 Nano 30B A3B is very affordable with a 262K context window, strong math and instruction-following scores, and tool use support, but its intelligence index of 24.3 limits broader business applicability.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.8Technical4Value8.3Content4.5
Intelligence 4.8/10
Technical 4/10
Content 4.5/10
Value 8.3/10

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully..

30B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

24.3 Intelligence Index
19 Coding Index
27.3 Agentic Index
91 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 75.7% Graduate-level scientific reasoning
HLE 10.2% Humanity's Last Exam
MMLU Pro 79.4% Multi-task language understanding
AIME 2025 91% Competition mathematics (2025)
SciCode 29.6% Scientific computing

Technical

LiveCodeBench 74.1% Live coding evaluation
TerminalBench Hard 13.6% Agentic terminal tasks
τ²-Bench 40.9% Conversational agent benchmark

Content

IFBench 71.1% Instruction following
LCR 33.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does NVIDIA: Nemotron 3 Nano 30B A3B stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-nano-30b-a3b
Providernvidia
Release Date December 14, 2025
Context Length262,144 tokens
Max Completion228,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.05 $0.000050
Output $0.20 $0.000200

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
1,745ms
Best Latency (TTFT)
64 tok/s
Best Throughput
1/1
Active Endpoints
Available via: DeepInfra

Leaderboard Categories