Mistral Nemo pricing
A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,... Below you will find 3 current rows with input and output dollars per million. Right now the lowest input is $0.020 and the lowest output is $0.030.
Pricing across providers
Use this table to read Mistral Nemo list prices. We show 3 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.020 | $0.030 | — | — |
A Azure | $0.150 | $0.150 | — | — |
N Novita | $0.040 | $0.170 | — | — |
Input vs output · per provider
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Mistral Nemo cost scales with traffic.
Provider
0.002000¢ / req
0.001500¢ / req
Model specifications
Quick spec sheet for Mistral Nemo before you dive back into pricing. Reported under Mistral AI.
- Context window
- 131,072 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jul 2024
- Primary provider
- Mistral AI
- Model family
- N/A
Compare Mistral Nemo
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Mistral Nemo.
- Mistral Nemo vs Mistral 7B
Compare pricing side by side
- Mistral Nemo vs Mistral Large
Compare pricing side by side
- Mistral Nemo vs Mixtral 8x7B
Compare pricing side by side
- Mistral Nemo vs GPT-4o
Compare pricing side by side
- Mistral Nemo vs GPT-4o mini
Compare pricing side by side
- Mistral Nemo vs Claude Sonnet 4.6
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Mistral Nemo rates. The short model note from our index: A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...
Also from Mistral AI
Other models by Mistral AI with live pricing in our catalog.