Qwen3 14B pricing
Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, programming, and logical inference, and a "non-thinking" mode for general-purpose conversation. The model is fine-tuned for instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts a. Below you will find 4 current rows with input and output dollars per million. Right now the lowest input is $0.060 and the lowest output is $0.200.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 4 offers are listed for Qwen3 14B. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.060 | $0.240 | — | — |
N Nebius | $0.080 | $0.240 | — | — |
FA Fireworks AI | $0.200 | $0.200 | — | — |
D DeepInfra | $0.060 | $0.240 | — | — |
Input vs output · per provider
Cost calculator
Use this block to stress test Qwen3 14B cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.006000¢ / req
0.012000¢ / req
Model specifications
These fields describe Qwen3 14B as we store it (Family: Qwen. source: Alibaba). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 40,960 tokens
- Max output
- 40,960 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Apr 2025
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 14B
Open a pair page to see Qwen3 14B next to another model with a shared provider matrix. 6 shortcuts below.
- Qwen3 14B vs GPT-4o
Compare pricing side by side
- Qwen3 14B vs GPT-4o mini
Compare pricing side by side
- Qwen3 14B vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 14B vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 14B vs o3
Compare pricing side by side
- Qwen3 14B vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, ...
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.