Qwen3 VL 32B Instruct pricing
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text comprehension, enabling fine-grained spatial reasoning, document and scene analysis, and long-horizon video understanding.Robust OCR in 32 languages, and enhanced multimodal fusion through Interleaved-MRoPE and DeepStack architectures. Optimized for agentic. Below you will find 3 current rows with input and output dollars per million. Right now the lowest input is $0.104 and the lowest output is $0.416.
Pricing across providers
Use this table to read Qwen3 VL 32B Instruct list prices. We show 3 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.104 | $0.416 | — | — |
AC Alibaba Cloud | $0.160 | $0.640 | — | — |
FA Fireworks AI | $0.900 | $0.900 | — | — |
Input vs output · per provider
Cost calculator
Use this block to stress test Qwen3 VL 32B Instruct cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.010400¢ / req
0.020800¢ / req
Model specifications
Context length, caps, and capability flags for Qwen3 VL 32B Instruct. Family: Qwen. Values follow the main provider (Alibaba) record in our index.
- Context window
- 4,096 tokens
- Max output
- 4,096 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Oct 2025
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 VL 32B Instruct
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 VL 32B Instruct.
- Qwen3 VL 32B Instruct vs GPT-4o
Compare pricing side by side
- Qwen3 VL 32B Instruct vs GPT-4o mini
Compare pricing side by side
- Qwen3 VL 32B Instruct vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 VL 32B Instruct vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 VL 32B Instruct vs o3
Compare pricing side by side
- Qwen3 VL 32B Instruct vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Qwen3 VL 32B Instruct rates. The short model note from our index: Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual percepti...
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.