Nemotron 3 Ultra (free) pricing
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it... This page tracks 1 listing in total. Highlighted lows are $0.0000 per million input and $0.0000 per million output (see table for which seller matches each).
Pricing across providers
Every row is a seller of Nemotron 3 Ultra (free) with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.0000 | $0.0000 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Nemotron 3 Ultra (free) cost scales with traffic.
Model specifications
These fields describe Nemotron 3 Ultra (free) as we store it (source: Nvidia). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 1,000,000 tokens
- Max output
- 65,536 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- Jun 2026
- Primary provider
- Nvidia
- Model family
- N/A
Compare Nemotron 3 Ultra (free)
Open a pair page to see Nemotron 3 Ultra (free) next to another model with a shared provider matrix. 6 shortcuts below.
Locked
Compare with
Pick a model on both sides.
Popular Nemotron 3 Ultra (free) comparisons
- Nemotron 3 Ultra (free) vs GPT-4o
Compare pricing side by side
- Nemotron 3 Ultra (free) vs GPT-4o mini
Compare pricing side by side
- Nemotron 3 Ultra (free) vs Claude Sonnet 4.6
Compare pricing side by side
- Nemotron 3 Ultra (free) vs Gemini 2.0 Flash
Compare pricing side by side
- Nemotron 3 Ultra (free) vs o3
Compare pricing side by side
- Nemotron 3 Ultra (free) vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Also from Nvidia
Other models by Nvidia with live pricing in our catalog.