Nemotron Nano 12B 2 VL pricing
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s... This page tracks 1 listing in total. Highlighted lows are $0.200 per million input and $0.600 per million output (see table for which seller matches each).
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Nemotron Nano 12B 2 VL. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.200 | $0.600 | — | — |
Input vs output · 1M tokens
Cost calculator
Pick any provider row and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Nemotron Nano 12B 2 VL.
0.020000¢ / req
0.030000¢ / req
Model specifications
Quick spec sheet for Nemotron Nano 12B 2 VL before you dive back into pricing. Reported under Nvidia.
- Context window
- 131,072 tokens
- Max output
- N/A
- Vision (images)
- Yes
- Tool / function calling
- No
- Streaming
- Yes
- Released
- Oct 2025
- Primary provider
- Nvidia
- Model family
- N/A
Compare Nemotron Nano 12B 2 VL
Open a pair page to see Nemotron Nano 12B 2 VL next to another model with a shared provider matrix. 6 shortcuts below.
- Nemotron Nano 12B 2 VL vs GPT-4o
Compare pricing side by side
- Nemotron Nano 12B 2 VL vs GPT-4o mini
Compare pricing side by side
- Nemotron Nano 12B 2 VL vs Claude Sonnet 4.6
Compare pricing side by side
- Nemotron Nano 12B 2 VL vs Gemini 2.0 Flash
Compare pricing side by side
- Nemotron Nano 12B 2 VL vs o3
Compare pricing side by side
- Nemotron Nano 12B 2 VL vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Nemotron Nano 12B 2 VL pricing and limits. The short model note from our index: NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer...
Also from Nvidia
Other models by Nvidia with live pricing in our catalog.