NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia · Released Apr 28, 2026 Emerging New
Intelligence #314 / 561
27.6 Our Score
Speed #10 / 260
302.6 tokens / sec
Input
Not priced
Output
Not priced
Context #158 / 561
256,000 tokens

Analysis Summary

NVIDIA: Nemotron 3 Nano Omni (free) sits in the Emerging tier on our leaderboard, ranked #314 of 561 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

NVIDIA Nemotron 3 Nano Omni is a free multimodal model supporting image, audio, and video with a 256K context; benchmark scores are limited but the free tier and broad modality support add value.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.6Technical2.9Value0Content4
Intelligence 3.6/10
Technical 2.9/10
Content 4/10
Value 0/10

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and..

30B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

21.4 Intelligence Index
14.8 Coding Index
26.8 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 46.9% Graduate-level scientific reasoning
HLE 5.3% Humanity's Last Exam
SciCode 27.8% Scientific computing

Technical

TerminalBench Hard 8.3% Agentic terminal tasks
τ²-Bench 45.3% Conversational agent benchmark

Content

IFBench 63.2% Instruction following
LCR 35.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does NVIDIA: Nemotron 3 Nano Omni (free) stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
Providernvidia
Release Date April 28, 2026
Context Length256,000 tokens
Max Completion65,536 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.8%
Avg Uptime
686ms
Best Latency (TTFT)
178 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Nvidia

Leaderboard Categories