Is R1 Distill Qwen 32B available on other providers?

Name: R1 Distill Qwen 32B
Price: 0.2900 USD
Author: Alibaba

AlibabaDeepSeek R1Tool use

R1 Distill Qwen 32B pricing

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.\n\nOther benchmark results include:\n\n- AIME 2024 pass@1: 72.6\n- MATH-500 pass@1: 94.3\n- CodeForces Rating: 1691\n\nThe model leverages fine-tuning from DeepSeek R1's outputs, enabling competit. Below you will find 5 current rows with input and output dollars per million. Right now the lowest input is $0.150 and the lowest output is $0.150.

131K context·5 providers·verified Apr 7, 2026

Estimate cost Compare with GPT-4o Pick any other model

Best input$0.150per 1M tokens · Nscale

Best output$0.150per 1M tokens · Nscale

Pricing across providers

Use this table to read R1 Distill Qwen 32B list prices. We show 5 sources right now. Lowest input in the grid: Nscale. The chart below the table helps when output prices are much higher than input prices.

Openrouter

Input / 1M

$0.290

Output / 1M

$0.290

Novita

Input / 1M

$0.300

Output / 1M

$0.300

Nscale

Input / 1M

$0.150

Output / 1M

$0.150

Fireworks AI

Input / 1M

$0.900

Output / 1M

$0.900

DeepInfra

Input / 1M

$0.270

Output / 1M

$0.270

Provider	Input / 1M	Output / 1M	Cached input	Batch
O Openrouter	$0.290	$0.290	—	—
N Novita	$0.300	$0.300	—	—
N Nscale	$0.150	$0.150	—	—
FA Fireworks AI	$0.900	$0.900	—	—
D DeepInfra	$0.270	$0.270	—	—

Input vs output · per provider

Cost calculator

Use this block to stress test R1 Distill Qwen 32B cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.290/M·Out: $0.290/M

Requests / day

Avg input tokens

0.029000¢ / req

Avg output tokens

0.014500¢ / req

Daily

$4.35

Monthly

$131

Annual

$1.6K

Model specifications

Quick spec sheet for R1 Distill Qwen 32B before you dive back into pricing. Family: DeepSeek R1. Reported under Alibaba.

Context window: 131,072 tokens
Max output: 131,072 tokens
Vision (images): No
Tool / function calling: Yes
Streaming: No
Released: Jan 2025
Primary provider: Alibaba
Model family: DeepSeek R1

Compare R1 Distill Qwen 32B

Open a pair page to see R1 Distill Qwen 32B next to another model with a shared provider matrix. 6 shortcuts below.

Locked

Compare with

Pick a model on both sides.

Popular R1 Distill Qwen 32B comparisons

Frequently asked questions

Quick frequently asked items for R1 Distill Qwen 32B pricing and limits. The short model note from our index: DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini ...

Yes. R1 Distill Qwen 32B is available on Openrouter, Novita, Nscale, Fireworks AI, DeepInfra.