AlibabaQwenVisionTool use

Qwen3 VL 32B Instruct pricing

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text comprehension, enabling fine-grained spatial reasoning, document and scene analysis, and long-horizon video understanding.Robust OCR in 32 languages, and enhanced multimodal fusion through Interleaved-MRoPE and DeepStack architectures. Optimized for agentic. Below you will find 3 current rows with input and output dollars per million. Right now the lowest input is $0.104 and the lowest output is $0.416.

4K context·3 providers·verified Apr 7, 2026
Best input$0.104per 1M tokens · Openrouter
Best output$0.416per 1M tokens · Openrouter

Pricing across providers

Use this table to read Qwen3 VL 32B Instruct list prices. We show 3 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.

O
Openrouter
Input / 1M
$0.104
Output / 1M
$0.416
AC
Alibaba Cloud
Input / 1M
$0.160
Output / 1M
$0.640
FA
Fireworks AI
Input / 1M
$0.900
Output / 1M
$0.900

Input vs output · per provider

Cost calculator

Use this block to stress test Qwen3 VL 32B Instruct cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.104/M·Out: $0.416/M

0.010400¢ / req

0.020800¢ / req

Daily
$3.12
Monthly
$94
Annual
$1.1K

Model specifications

Context length, caps, and capability flags for Qwen3 VL 32B Instruct. Family: Qwen. Values follow the main provider (Alibaba) record in our index.

Context window
4,096 tokens
Max output
4,096 tokens
Vision (images)
Yes
Tool / function calling
Yes
Streaming
No
Released
Oct 2025
Primary provider
Alibaba
Model family
Qwen

Compare Qwen3 VL 32B Instruct

Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 VL 32B Instruct.

Frequently asked questions

Read these after the table if you want plain language around Qwen3 VL 32B Instruct rates. The short model note from our index: Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual percepti...

Yes. Qwen3 VL 32B Instruct is available on Openrouter, Alibaba Cloud, Fireworks AI.

Also from Alibaba

Other models by Alibaba with live pricing in our catalog.