LongCat Flash Chat pricing
LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to reduce... Below you will find 1 current row with input and output dollars per million. Right now the lowest input is $0.200 and the lowest output is $0.800.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for LongCat Flash Chat. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.200 | $0.800 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how LongCat Flash Chat cost scales with traffic.
0.020000¢ / req
0.040000¢ / req
Model specifications
Quick spec sheet for LongCat Flash Chat before you dive back into pricing. Reported under Meituan.
- Context window
- 131,072 tokens
- Max output
- 131,072 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- Sep 2025
- Primary provider
- Meituan
- Model family
- N/A
Compare LongCat Flash Chat
These links open full side by side pages for LongCat Flash Chat. We picked pairs that people often shop together. 6 ready to open.
Locked
Compare with
Pick a model on both sides.
Popular LongCat Flash Chat comparisons
- LongCat Flash Chat vs GPT-4o
Compare pricing side by side
- LongCat Flash Chat vs GPT-4o mini
Compare pricing side by side
- LongCat Flash Chat vs Claude Sonnet 4.6
Compare pricing side by side
- LongCat Flash Chat vs Gemini 2.0 Flash
Compare pricing side by side
- LongCat Flash Chat vs o3
Compare pricing side by side
- LongCat Flash Chat vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around LongCat Flash Chat rates. The short model note from our index: LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input. It introduces a shortcut-connected MoE design to ...