AlibabaQwenTool use

Qwen3 32B pricing

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles. Below you will find 7 current rows with input and output dollars per million. Right now the lowest input is $0.080 and the lowest output is $0.230.

41K context·7 providers·verified Apr 7, 2026
Best input$0.080per 1M tokens · Ovhcloud
Best output$0.230per 1M tokens · Ovhcloud

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 7 offers are listed for Qwen3 32B. Best input in this view: Ovhcloud.

N
Nebius
Input / 1M
$0.100
Output / 1M
$0.300
O
Ovhcloud
Input / 1M
$0.080
Output / 1M
$0.230
S
Sambanova
Input / 1M
$0.400
Output / 1M
$0.800
O
Openrouter
Input / 1M
$0.080
Output / 1M
$0.240
G
Groq
Input / 1M
$0.290
Output / 1M
$0.590
FA
Fireworks AI
Input / 1M
$0.900
Output / 1M
$0.900
D
DeepInfra
Input / 1M
$0.100
Output / 1M
$0.280

Input vs output · per provider

Cost calculator

Use this block to stress test Qwen3 32B cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.100/M·Out: $0.300/M

0.010000¢ / req

0.015000¢ / req

Daily
$2.50
Monthly
$75
Annual
$913

Model specifications

These fields describe Qwen3 32B as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.

Context window
40,960 tokens
Max output
40,960 tokens
Vision (images)
No
Tool / function calling
Yes
Streaming
No
Released
Apr 2025
Primary provider
Alibaba
Model family
Qwen

Compare Qwen3 32B

Open a pair page to see Qwen3 32B next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math,...

Yes. Qwen3 32B is available on Nebius, Ovhcloud, Sambanova, Openrouter, Groq, Fireworks AI, DeepInfra.

Also from Alibaba

Other models by Alibaba with live pricing in our catalog.