NVIDIA: Nemotron Nano 12B 2 VL
Analysis Summary
NVIDIA: Nemotron Nano 12B 2 VL sits in the Efficient tier on our leaderboard, ranked #196 of 557 published models on overall intelligence. At $0.200 input and $0.600 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports vision.
Editorial notes
NVIDIA Nemotron Nano 12B 2 VL supports vision and video input with tool use at low pricing, but its intelligence and coding indices are limited, restricting it to lighter multimodal tasks.
Assessed May 14, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s..
Capabilities
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does NVIDIA: Nemotron Nano 12B 2 VL stack up?
Compare side-by-side with other efficient models.
Model Information
| OpenRouter ID |
nvidia/nemotron-nano-12b-v2-vl
|
| Provider | nvidia |
| Release Date | October 28, 2025 |
| Context Length | 131,072 tokens |
| Max Completion | 16,384 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.20 | $0.000200 |
| Output | $0.60 | $0.000600 |
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 17, 2026 8:40 pm