Z AiTool use

GLM 4.6 pricing

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning. Below you will find 6 current rows with input and output dollars per million. Right now the lowest input is $0.400 and the lowest output is $1.75.

203K context·6 providers·save up to 33% vs native via Openrouter·verified May 2, 2026
Best input$0.400per 1M tokens · Openrouter
Best output$1.75per 1M tokens · Openrouter

Pricing across providers

Use this table to read GLM 4.6 list prices. We show 6 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.

O
Openrouter
Input / 1M
$0.400
Output / 1M
$1.75
B
Baseten
Input / 1M
$0.600
Output / 1M
$2.20
N
Novita
Input / 1M
$0.550
Output / 1M
$2.20
Cached in: $0.110
VA
Vercel Ai Gateway
Input / 1M
$0.450
Output / 1M
$1.80
Cached in: $0.110
TA
Together AI
Input / 1M
$0.600
Output / 1M
$2.20
ZA
Z Ainative
Input / 1M
$0.600
Output / 1M
$2.20
Cached in: $0.110

Input vs output · per provider

Cost calculator

Use this block to stress test GLM 4.6 cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.400/M·Out: $1.75/M

0.040000¢ / req

0.087500¢ / req

Daily
$13
Monthly
$383
Annual
$4.7K
25% cheaper than the native API at this usage level

Model specifications

These fields describe GLM 4.6 as we store it (source: Z Ai). They sit next to price so buyers can check limits and tools in one place.

Context window
202,800 tokens
Max output
131,000 tokens
Vision (images)
No
Tool / function calling
Yes
Streaming
No
Released
Sep 2025
Primary provider
Z Ai
Model family
N/A

Compare GLM 4.6

These links open full side by side pages for GLM 4.6. We picked pairs that people often shop together. 6 ready to open.

Frequently asked questions

Quick frequently asked items for GLM 4.6 pricing and limits. The short model note from our index: Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Supe...

GLM 4.6 costs $0.60 per million input tokens and $2.20 per million output tokens via the native API. Prompt caching reduces input costs to $0.11/M tokens.

Also from Z Ai

Other models by Z Ai with live pricing in our catalog.