Qwen3 4b Instruct 2507 Gguf pricing
Compare Qwen3 4b Instruct 2507 Gguf API pricing across 1 listed source. The best input rate we show is $0.0000 per million tokens from Lemonade. The best output rate is $0.0000 per million tokens from Lemonade.
Pricing across providers
Every row is a seller of Qwen3 4b Instruct 2507 Gguf with token pricing we track. The cheapest input in this snapshot is from Lemonade. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
L Lemonade | $0.0000 | $0.0000 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Qwen3 4b Instruct 2507 Gguf cost scales with traffic.
Model specifications
These fields describe Qwen3 4b Instruct 2507 Gguf as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 262,144 tokens
- Max output
- 32,768 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- N/A
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 4b Instruct 2507 Gguf
Open a pair page to see Qwen3 4b Instruct 2507 Gguf next to another model with a shared provider matrix. 6 shortcuts below.
- Qwen3 4b Instruct 2507 Gguf vs GPT-4o
Compare pricing side by side
- Qwen3 4b Instruct 2507 Gguf vs GPT-4o mini
Compare pricing side by side
- Qwen3 4b Instruct 2507 Gguf vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 4b Instruct 2507 Gguf vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 4b Instruct 2507 Gguf vs o3
Compare pricing side by side
- Qwen3 4b Instruct 2507 Gguf vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Qwen3 4b Instruct 2507 Gguf rates.
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.