Is Qwen3 32B available on other providers?

Name: Qwen3 32B
Price: 0.0800 USD
Author: Alibaba

AlibabaQwenTool use

Qwen3 32B pricing

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles. Below you will find 7 current rows with input and output dollars per million. Right now the lowest input is $0.080 and the lowest output is $0.230.

41K context·7 providers·verified May 8, 2026

Estimate cost Compare with GPT-4o Pick any other model

Best input$0.080per 1M tokens · Openrouter

Best output$0.230per 1M tokens · Ovhcloud

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 7 offers are listed for Qwen3 32B. Best input in this view: Openrouter.

Openrouter

Input / 1M

$0.080

Output / 1M

$0.280

Sambanova

Input / 1M

$0.400

Output / 1M

$0.800

Ovhcloud

Input / 1M

$0.080

Output / 1M

$0.230

Nebius

Input / 1M

$0.100

Output / 1M

$0.300

Fireworks AI

Input / 1M

$0.900

Output / 1M

$0.900

Groq

Input / 1M

$0.290

Output / 1M

$0.590

DeepInfra

Input / 1M

$0.100

Output / 1M

$0.280

Provider	Input / 1M	Output / 1M	Cached input	Batch
O Openrouter	$0.080	$0.280	—	—
S Sambanova	$0.400	$0.800	—	—
O Ovhcloud	$0.080	$0.230	—	—
N Nebius	$0.100	$0.300	—	—
FA Fireworks AI	$0.900	$0.900	—	—
G Groq	$0.290	$0.590	—	—
D DeepInfra	$0.100	$0.280	—	—

Input vs output · per provider

Cost calculator

Use this block to stress test Qwen3 32B cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.080/M·Out: $0.280/M

Requests / day

Avg input tokens

0.008000¢ / req

Avg output tokens

0.014000¢ / req

Daily

$2.20

Monthly

$66

Annual

$803

Model specifications

These fields describe Qwen3 32B as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.

Context window: 40,960 tokens
Max output: 40,960 tokens
Vision (images): No
Tool / function calling: Yes
Streaming: No
Released: Apr 2025
Primary provider: Alibaba
Model family: Qwen

Compare Qwen3 32B

Open a pair page to see Qwen3 32B next to another model with a shared provider matrix. 6 shortcuts below.

Locked

Compare with

Pick a model on both sides.

Popular Qwen3 32B comparisons

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math,...

Yes. Qwen3 32B is available on Openrouter, Sambanova, Ovhcloud, Nebius, Fireworks AI, Groq, DeepInfra.