Is Qwen3 4b Instruct 2507 Gguf available on other providers?

Name: Qwen3 4b Instruct 2507 Gguf
Author: Alibaba

AlibabaQwenTool use

Qwen3 4b Instruct 2507 Gguf pricing

Compare Qwen3 4b Instruct 2507 Gguf API pricing across 1 listed source. The best input rate we show is $0.0000 per million tokens from Lemonade. The best output rate is $0.0000 per million tokens from Lemonade.

262K context·1 provider·verified Apr 7, 2026

Estimate cost Compare with GPT-4o Pick any other model

Best inputFreeper 1M tokens · Lemonade

Best outputFreeper 1M tokens · Lemonade

Pricing across providers

Every row is a seller of Qwen3 4b Instruct 2507 Gguf with token pricing we track. The cheapest input in this snapshot is from Lemonade. The bar chart shows the same input and output dollars per million for a quick scan.

Lemonade

Input / 1M

$0.0000

Output / 1M

$0.0000

Provider	Input / 1M	Output / 1M	Cached input	Batch
L Lemonade	$0.0000	$0.0000	—	—

Input vs output · 1M tokens

Cost calculator

The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Qwen3 4b Instruct 2507 Gguf cost scales with traffic.

In: $0.0000/M·Out: $0.0000/M

Requests / day

Avg input tokens

Avg output tokens

Daily

$0.0000

Monthly

$0.0000

Annual

$0.0000

Model specifications

These fields describe Qwen3 4b Instruct 2507 Gguf as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.

Context window: 262,144 tokens
Max output: 32,768 tokens
Vision (images): No
Tool / function calling: Yes
Streaming: No
Released: N/A
Primary provider: Alibaba
Model family: Qwen