AlibabaQwenVisionTool use

Qwen3 VL 235B A22B Instruct pricing

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table extraction, multilingual OCR). The series emphasizes robust perception (recognition of diverse real-world and synthetic categories), spatial understanding (2D/3D grounding), and long-form visual comprehension, with competitive results on public multimodal ben. Below you will find 4 current rows with input and output dollars per million. Right now the lowest input is $0.200 and the lowest output is $0.880.

262K context·4 providers·verified Apr 7, 2026
Best input$0.200per 1M tokens · Openrouter
Best output$0.880per 1M tokens · Openrouter

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 4 offers are listed for Qwen3 VL 235B A22B Instruct. Best input in this view: Openrouter.

O
Openrouter
Input / 1M
$0.200
Output / 1M
$0.880
N
Novita
Input / 1M
$0.300
Output / 1M
$1.50
AC
Alibaba Cloud
Input / 1M
$0.400
Output / 1M
$1.60
FA
Fireworks AI
Input / 1M
$0.220
Output / 1M
$0.880

Input vs output · per provider

Cost calculator

Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Qwen3 VL 235B A22B Instruct.

Provider

In: $0.200/M·Out: $0.880/M

0.020000¢ / req

0.044000¢ / req

Daily
$6.40
Monthly
$192
Annual
$2.3K

Model specifications

Quick spec sheet for Qwen3 VL 235B A22B Instruct before you dive back into pricing. Family: Qwen. Reported under Alibaba.

Context window
262,144 tokens
Max output
262,144 tokens
Vision (images)
Yes
Tool / function calling
Yes
Streaming
No
Released
Sep 2025
Primary provider
Alibaba
Model family
Qwen

Compare Qwen3 VL 235B A22B Instruct

Open a pair page to see Qwen3 VL 235B A22B Instruct next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document ...

Yes. Qwen3 VL 235B A22B Instruct is available on Openrouter, Novita, Alibaba Cloud, Fireworks AI.

Also from Alibaba

Other models by Alibaba with live pricing in our catalog.