Nemotron Nano 9B V2 pricing
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and... Below you will find 1 current row with input and output dollars per million. Right now the lowest input is $0.040 and the lowest output is $0.160.
Pricing across providers
Every row is a seller of Nemotron Nano 9B V2 with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.040 | $0.160 | — | — |
Input vs output · 1M tokens
Cost calculator
Pick any provider row and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Nemotron Nano 9B V2.
0.004000¢ / req
0.008000¢ / req
Model specifications
Quick spec sheet for Nemotron Nano 9B V2 before you dive back into pricing. Reported under Nvidia.
- Context window
- 131,072 tokens
- Max output
- N/A
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- Sep 2025
- Primary provider
- Nvidia
- Model family
- N/A
Compare Nemotron Nano 9B V2
Open a pair page to see Nemotron Nano 9B V2 next to another model with a shared provider matrix. 6 shortcuts below.
Locked
Compare with
Pick a model on both sides.
Popular Nemotron Nano 9B V2 comparisons
- Nemotron Nano 9B V2 vs GPT-4o
Compare pricing side by side
- Nemotron Nano 9B V2 vs GPT-4o mini
Compare pricing side by side
- Nemotron Nano 9B V2 vs Claude Sonnet 4.6
Compare pricing side by side
- Nemotron Nano 9B V2 vs Gemini 2.0 Flash
Compare pricing side by side
- Nemotron Nano 9B V2 vs o3
Compare pricing side by side
- Nemotron Nano 9B V2 vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
Also from Nvidia
Other models by Nvidia with live pricing in our catalog.