AlibabaQwenTool use

Qwen3 14B pricing

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, programming, and logical inference, and a "non-thinking" mode for general-purpose conversation. The model is fine-tuned for instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts a. Below you will find 4 current rows with input and output dollars per million. Right now the lowest input is $0.060 and the lowest output is $0.200.

41K context·4 providers·verified Apr 7, 2026
Best input$0.060per 1M tokens · Openrouter
Best output$0.200per 1M tokens · Fireworks AI

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 4 offers are listed for Qwen3 14B. Best input in this view: Openrouter.

O
Openrouter
Input / 1M
$0.060
Output / 1M
$0.240
N
Nebius
Input / 1M
$0.080
Output / 1M
$0.240
FA
Fireworks AI
Input / 1M
$0.200
Output / 1M
$0.200
D
DeepInfra
Input / 1M
$0.060
Output / 1M
$0.240

Input vs output · per provider

Cost calculator

Use this block to stress test Qwen3 14B cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.060/M·Out: $0.240/M

0.006000¢ / req

0.012000¢ / req

Daily
$1.80
Monthly
$54
Annual
$657

Model specifications

These fields describe Qwen3 14B as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.

Context window
40,960 tokens
Max output
40,960 tokens
Vision (images)
No
Tool / function calling
Yes
Streaming
No
Released
Apr 2025
Primary provider
Alibaba
Model family
Qwen

Compare Qwen3 14B

Open a pair page to see Qwen3 14B next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, ...

Yes. Qwen3 14B is available on Openrouter, Nebius, Fireworks AI, DeepInfra.

Also from Alibaba

Other models by Alibaba with live pricing in our catalog.