R1 Distill Qwen 32B pricing
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.\n\nOther benchmark results include:\n\n- AIME 2024 pass@1: 72.6\n- MATH-500 pass@1: 94.3\n- CodeForces Rating: 1691\n\nThe model leverages fine-tuning from DeepSeek R1's outputs, enabling competit. Below you will find 5 current rows with input and output dollars per million. Right now the lowest input is $0.150 and the lowest output is $0.150.
Pricing across providers
Use this table to read R1 Distill Qwen 32B list prices. We show 5 sources right now. Lowest input in the grid: Nscale. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
N Nscale | $0.150 | $0.150 | — | — |
N Novita | $0.300 | $0.300 | — | — |
O Openrouter | $0.290 | $0.290 | — | — |
D DeepInfra | $0.270 | $0.270 | — | — |
FA Fireworks AI | $0.900 | $0.900 | — | — |
Input vs output · per provider
Cost calculator
Use this block to stress test R1 Distill Qwen 32B cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.015000¢ / req
0.007500¢ / req
Model specifications
Quick spec sheet for R1 Distill Qwen 32B before you dive back into pricing. Family: DeepSeek R1. Reported under Alibaba.
- Context window
- 131,072 tokens
- Max output
- 131,072 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jan 2025
- Primary provider
- Alibaba
- Model family
- DeepSeek R1
Compare R1 Distill Qwen 32B
Open a pair page to see R1 Distill Qwen 32B next to another model with a shared provider matrix. 6 shortcuts below.
- R1 Distill Qwen 32B vs GPT-4o
Compare pricing side by side
- R1 Distill Qwen 32B vs GPT-4o mini
Compare pricing side by side
- R1 Distill Qwen 32B vs Claude Sonnet 4.6
Compare pricing side by side
- R1 Distill Qwen 32B vs Gemini 2.0 Flash
Compare pricing side by side
- R1 Distill Qwen 32B vs o3
Compare pricing side by side
- R1 Distill Qwen 32B vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for R1 Distill Qwen 32B pricing and limits. The short model note from our index: DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini ...
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.