QwQ 32B pricing
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini. This page tracks 7 listings in total. Highlighted lows are $0.150 per million input and $0.200 per million output (see table for which seller matches each).
Pricing across providers
Every row is a seller of QwQ 32B with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
N Nscale | $0.180 | $0.200 | — | — |
S Sambanova | $0.500 | $1.00 | — | — |
O Openrouter | $0.150 | $0.580 | — | — |
H Hyperbolic | $0.200 | $0.200 | — | — |
N Nebius | $0.150 | $0.450 | — | — |
FA Fireworks AI | $0.900 | $0.900 | — | — |
D DeepInfra | $0.150 | $0.400 | — | — |
Input vs output · per provider
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how QwQ 32B cost scales with traffic.
Provider
0.018000¢ / req
0.010000¢ / req
Model specifications
Quick spec sheet for QwQ 32B before you dive back into pricing. Reported under Alibaba.
- Context window
- 131,072 tokens
- Max output
- 131,072 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Mar 2025
- Primary provider
- Alibaba
- Model family
- N/A
Compare QwQ 32B
Open a pair page to see QwQ 32B next to another model with a shared provider matrix. 6 shortcuts below.
Frequently asked questions
Read these after the table if you want plain language around QwQ 32B rates. The short model note from our index: QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, esp...
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.