Qwen3 4b Fp8 pricing
If you are budgeting for Qwen3 4b Fp8, start with the numbers below. We index 1 provider price. Cheapest input is $0.030 per million tokens. Cheapest output is $0.030 per million tokens. The model lists a 128K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Qwen3 4b Fp8. Best input in this view: Novita.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
N Novita | $0.030 | $0.030 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Qwen3 4b Fp8 cost scales with traffic.
0.003000¢ / req
0.001500¢ / req
Model specifications
Quick spec sheet for Qwen3 4b Fp8 before you dive back into pricing. Family: Qwen. Reported under Alibaba.
- Context window
- 128,000 tokens
- Max output
- 20,000 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 4b Fp8
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 4b Fp8.
Locked
Compare with
Pick a model on both sides.
Popular Qwen3 4b Fp8 comparisons
- Qwen3 4b Fp8 vs GPT-4o
Compare pricing side by side
- Qwen3 4b Fp8 vs GPT-4o mini
Compare pricing side by side
- Qwen3 4b Fp8 vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 4b Fp8 vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 4b Fp8 vs o3
Compare pricing side by side
- Qwen3 4b Fp8 vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Qwen3 4b Fp8 pricing and limits.
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.