Home > AI Models > NVIDIA: Nemotron Nano 12B 2 VL

NVIDIA: Nemotron Nano 12B 2 VL

Name: NVIDIA: Nemotron Nano 12B 2 VL Review
Item: NVIDIA: Nemotron Nano 12B 2 VL
Rating: 4.2
Author: Design for Online

NVIDIA: Nemotron Nano 12B 2 VL

nvidia · Released Oct 28, 2025 Efficient

Intelligence #196 / 557

41.6 Our Score

Speed #18 / 257

234.1 tokens / sec

Input #264 / 557

$0.200 per 1M tokens

Output #261 / 557

$0.600 per 1M tokens

Context #220 / 557

131,072 tokens

NVIDIA: Nemotron Nano 12B 2 VL sits in the Efficient tier on our leaderboard, ranked #196 of 557 published models on overall intelligence. At $0.200 input and $0.600 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports vision.

Editorial notes

NVIDIA Nemotron Nano 12B 2 VL supports vision and video input with tool use at low pricing, but its intelligence and coding indices are limited, restricting it to lighter multimodal tasks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 131,072 tokens
Max output: 16,384 tokens
Tokenizer: Other
Released: Oct 28, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s..

12B Parameters

Capabilities

Vision

Performance Indices

Source: Artificial Analysis

14.9 Intelligence Index

11.7 Coding Index

12.9 Agentic Index

75 Math Index

Benchmark Scores

GPQA Diamond 57.2% Graduate-level scientific reasoning

HLE 5.3% Humanity's Last Exam

MMLU Pro 75.9% Multi-task language understanding

AIME 2025 75% Competition mathematics (2025)

SciCode 26.2% Scientific computing

LiveCodeBench 69.4% Live coding evaluation

TerminalBench Hard 4.5% Agentic terminal tasks

τ²-Bench 21.3% Conversational agent benchmark

IFBench 31.9% Instruction following

LCR 40% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does NVIDIA: Nemotron Nano 12B 2 VL stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID	`nvidia/nemotron-nano-12b-v2-vl`
Provider	nvidia
Release Date	October 28, 2025
Context Length	131,072 tokens
Max Completion	16,384 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.20	$0.000200
Output	$0.60	$0.000600

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

NVIDIA: Nemotron Nano 12B 2 VL

NVIDIA: Nemotron Nano 12B 2 VL

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

External Resources

NVIDIA: Nemotron Nano 12B 2 VL

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

External Resources

Explore Related Models