AlibabaTool use

QwQ 32B pricing

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini. This page tracks 7 listings in total. Highlighted lows are $0.150 per million input and $0.200 per million output (see table for which seller matches each).

131K context·7 providers·verified Apr 7, 2026
Best input$0.150per 1M tokens · Openrouter
Best output$0.200per 1M tokens · Nscale

Pricing across providers

Every row is a seller of QwQ 32B with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.

N
Nscale
Input / 1M
$0.180
Output / 1M
$0.200
S
Sambanova
Input / 1M
$0.500
Output / 1M
$1.00
O
Openrouter
Input / 1M
$0.150
Output / 1M
$0.580
H
Hyperbolic
Input / 1M
$0.200
Output / 1M
$0.200
N
Nebius
Input / 1M
$0.150
Output / 1M
$0.450
FA
Fireworks AI
Input / 1M
$0.900
Output / 1M
$0.900
D
DeepInfra
Input / 1M
$0.150
Output / 1M
$0.400

Input vs output · per provider

Cost calculator

The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how QwQ 32B cost scales with traffic.

Provider

In: $0.180/M·Out: $0.200/M

0.018000¢ / req

0.010000¢ / req

Daily
$2.80
Monthly
$84
Annual
$1.0K

Model specifications

Quick spec sheet for QwQ 32B before you dive back into pricing. Reported under Alibaba.

Context window
131,072 tokens
Max output
131,072 tokens
Vision (images)
No
Tool / function calling
Yes
Streaming
No
Released
Mar 2025
Primary provider
Alibaba
Model family
N/A

Compare QwQ 32B

Open a pair page to see QwQ 32B next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Read these after the table if you want plain language around QwQ 32B rates. The short model note from our index: QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, esp...

Yes. QwQ 32B is available on Nscale, Sambanova, Openrouter, Hyperbolic, Nebius, Fireworks AI, DeepInfra.

Also from Alibaba

Other models by Alibaba with live pricing in our catalog.