Qwen3 Next 80B A3B Thinking pricing
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic planning, and reports strong results across knowledge, reasoning, coding, alignment, and multilingual evaluations. Compared with prior Qwen3 variants, it emphasizes stability under long chains of thought and efficient scaling during inference, and it is tuned t. Below you will find 6 current rows with input and output dollars per million. Right now the lowest input is $0.098 and the lowest output is $0.780.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 6 offers are listed for Qwen3 Next 80B A3B Thinking. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
AC Alibaba Cloud | $0.150 | $1.20 | — | — |
N Novita | $0.150 | $1.50 | — | — |
O Openrouter | $0.098 | $0.780 | — | — |
D DeepInfra | $0.140 | $1.40 | — | — |
TA Together AI | $0.150 | $1.50 | — | — |
FA Fireworks AI | $0.900 | $0.900 | — | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Qwen3 Next 80B A3B Thinking.
Provider
0.015000¢ / req
0.060000¢ / req
Model specifications
Context length, caps, and capability flags for Qwen3 Next 80B A3B Thinking. Family: Qwen. Values follow the main provider (Alibaba) record in our index.
- Context window
- 262,144 tokens
- Max output
- 262,144 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Sep 2025
- Primary provider
- Alibaba
- Model family
- Qwen
Compare Qwen3 Next 80B A3B Thinking
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Qwen3 Next 80B A3B Thinking.
- Qwen3 Next 80B A3B Thinking vs GPT-4o
Compare pricing side by side
- Qwen3 Next 80B A3B Thinking vs GPT-4o mini
Compare pricing side by side
- Qwen3 Next 80B A3B Thinking vs Claude Sonnet 4.6
Compare pricing side by side
- Qwen3 Next 80B A3B Thinking vs Gemini 2.0 Flash
Compare pricing side by side
- Qwen3 Next 80B A3B Thinking vs o3
Compare pricing side by side
- Qwen3 Next 80B A3B Thinking vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, ...
Also from Alibaba
Other models by Alibaba with live pricing in our catalog.