Qwen3 VL 8B Thinking pricing
Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and... This page tracks 1 listing in total. Highlighted lows are $0.117 per million input and $1.36 per million output (see table for which seller matches each).
Pricing across providers
Use this table to read Qwen3 VL 8B Thinking list prices. We show 1 source right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.117 | $1.36 | — | — |
Input vs output · 1M tokens
Cost calculator
Pick any provider row and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Qwen3 VL 8B Thinking.
0.011700¢ / req
0.068250¢ / req
Model specifications
These fields describe Qwen3 VL 8B Thinking as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 131,072 tokens
- Max output
- 32,768 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- Oct 2025
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 VL 8B Thinking
These links open full side by side pages for Qwen3 VL 8B Thinking. We picked pairs that people often shop together. 6 ready to open.
Locked
Compare with
Pick a model on both sides.
Popular Qwen3 VL 8B Thinking comparisons
- Qwen3 VL 8B Thinking vs GPT-4o
Compare pricing side by side
- Qwen3 VL 8B Thinking vs GPT-4o mini
Compare pricing side by side
- Qwen3 VL 8B Thinking vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 VL 8B Thinking vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 VL 8B Thinking vs o3
Compare pricing side by side
- Qwen3 VL 8B Thinking vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Qwen3 VL 8B Thinking rates. The short model note from our index: Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enha...
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.