Qwen3 32b Fp8 pricing
Compare Qwen3 32b Fp8 API pricing across 2 listed sources. The best input rate we show is $0.050 per million tokens from Lambda. The best output rate is $0.100 per million tokens from Lambda.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 2 offers are listed for Qwen3 32b Fp8. Best input in this view: Lambda.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
N Novita | $0.100 | $0.450 | — | — |
L Lambda | $0.050 | $0.100 | — | — |
Input vs output · per provider
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Qwen3 32b Fp8 cost scales with traffic.
Provider
0.010000¢ / req
0.022500¢ / req
Model specifications
Context length, caps, and capability flags for Qwen3 32b Fp8. Family: Qwen. Values follow the main provider (Alibaba) record in our index.
- Context window
- 131,072 tokens
- Max output
- 131,072 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- N/A
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 32b Fp8
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 32b Fp8.
- Qwen3 32b Fp8 vs GPT-4o
Compare pricing side by side
- Qwen3 32b Fp8 vs GPT-4o mini
Compare pricing side by side
- Qwen3 32b Fp8 vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 32b Fp8 vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 32b Fp8 vs o3
Compare pricing side by side
- Qwen3 32b Fp8 vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Qwen3 32b Fp8 rates.
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.