AlibabaQwen

Qwen3 4b Fp8 pricing

If you are budgeting for Qwen3 4b Fp8, start with the numbers below. We index 1 provider price. Cheapest input is $0.030 per million tokens. Cheapest output is $0.030 per million tokens. The model lists a 128K context window in our data.

128K context·1 provider·verified Apr 7, 2026
Best input$0.030per 1M tokens · Novita
Best output$0.030per 1M tokens · Novita

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Qwen3 4b Fp8. Best input in this view: Novita.

N
Novita
Input / 1M
$0.030
Output / 1M
$0.030

Input vs output · 1M tokens

Cost calculator

The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Qwen3 4b Fp8 cost scales with traffic.

In: $0.030/M·Out: $0.030/M

0.003000¢ / req

0.001500¢ / req

Daily
$0.45
Monthly
$14
Annual
$164

Model specifications

Quick spec sheet for Qwen3 4b Fp8 before you dive back into pricing. Family: Qwen. Reported under Alibaba.

Context window
128,000 tokens
Max output
20,000 tokens
Vision (images)
No
Tool / function calling
No
Streaming
No
Released
N/A
Primary provider
Alibaba
Model family
Qwen

Compare Qwen3 4b Fp8

Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 4b Fp8.

Locked

Compare with

Pick a model on both sides.

Popular Qwen3 4b Fp8 comparisons

Frequently asked questions

Quick frequently asked items for Qwen3 4b Fp8 pricing and limits.

Yes. Qwen3 4b Fp8 is available on Novita.

Also from Alibaba

Other models by Alibaba with live pricing in our catalog.