NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia · Released Apr 28, 2026 New
Intelligence
Awaiting review
Speed
Not reported
Input
Not priced
Output
Not priced
Context #136 / 538
256,000 tokens

Analysis Summary

At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and..

30B Parameters

Capabilities

Tool Use Function Calling Vision

How does NVIDIA: Nemotron 3 Nano Omni (free) stack up?

Compare side-by-side with other similar models.

Compare Models

Model Information

OpenRouter ID nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
Providernvidia
Release Date April 28, 2026
Context Length256,000 tokens
Max Completion65,536 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
401ms
Best Latency (TTFT)
202 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Nvidia