GLM 4.6 pricing
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning. Below you will find 6 current rows with input and output dollars per million. Right now the lowest input is $0.400 and the lowest output is $1.75.
Pricing across providers
Use this table to read GLM 4.6 list prices. We show 6 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.400 | $1.75 | — | — |
B Baseten | $0.600 | $2.20 | — | — |
N Novita | $0.550 | $2.20 | $0.110 | — |
VA Vercel Ai Gateway | $0.450 | $1.80 | $0.110 | — |
TA Together AI | $0.600 | $2.20 | — | — |
ZA Z Ainative | $0.600 | $2.20 | $0.110 | — |
Input vs output · per provider
Cost calculator
Use this block to stress test GLM 4.6 cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.040000¢ / req
0.087500¢ / req
Model specifications
These fields describe GLM 4.6 as we store it (source: Z Ai). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 202,800 tokens
- Max output
- 131,000 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Sep 2025
- Primary provider
- Z Ai
- Model family
- N/A
Compare GLM 4.6
These links open full side by side pages for GLM 4.6. We picked pairs that people often shop together. 6 ready to open.
- GLM 4.6 vs Glm 4 6:exacto
Compare pricing side by side
- GLM 4.6 vs Glm 4 5 Flash
Compare pricing side by side
- GLM 4.6 vs Glm 4 32b 0414 128k
Glm 4 32b 0414 128k 95% cheaper on output
- GLM 4.6 vs Glm 4 5 Airx
GLM 4.6 51% cheaper on output
- GLM 4.6 vs GLM 4.5 Air
GLM 4.5 Air 50% cheaper on output
- GLM 4.6 vs Glm 5 Code
GLM 4.6 55% cheaper on output
Frequently asked questions
Quick frequently asked items for GLM 4.6 pricing and limits. The short model note from our index: Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Supe...
Also from Z Ai
Other models by Z Ai with live pricing in our catalog.