Qwen3 4b Instruct 2507 pricing
Compare Qwen3 4b Instruct 2507 API pricing across 1 listed source. The best input rate we show is $0.200 per million tokens from Fireworks AI. The best output rate is $0.200 per million tokens from Fireworks AI.
Pricing across providers
Every row is a seller of Qwen3 4b Instruct 2507 with token pricing we track. The cheapest input in this snapshot is from Fireworks AI. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
FA Fireworks AI | $0.200 | $0.200 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Qwen3 4b Instruct 2507 cost scales with traffic.
0.020000¢ / req
0.010000¢ / req
Model specifications
Context length, caps, and capability flags for Qwen3 4b Instruct 2507. Family: Qwen. Values follow the main provider (Alibaba) record in our index.
- Context window
- 262,144 tokens
- Max output
- 262,144 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 4b Instruct 2507
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 4b Instruct 2507.
- Qwen3 4b Instruct 2507 vs GPT-4o
Compare pricing side by side
- Qwen3 4b Instruct 2507 vs GPT-4o mini
Compare pricing side by side
- Qwen3 4b Instruct 2507 vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 4b Instruct 2507 vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 4b Instruct 2507 vs o3
Compare pricing side by side
- Qwen3 4b Instruct 2507 vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page.
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.