GLM 4.6V pricing
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts... This page tracks 2 listings in total. Highlighted lows are $0.300 per million input and $0.900 per million output (see table for which seller matches each).
Pricing across providers
Use this table to read GLM 4.6V list prices. We show 2 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.300 | $0.900 | — | — |
N Novita | $0.300 | $0.900 | $0.055 | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for GLM 4.6V.
Provider
0.030000¢ / req
0.045000¢ / req
Model specifications
Quick spec sheet for GLM 4.6V before you dive back into pricing. Reported under Z Ai.
- Context window
- 131,072 tokens
- Max output
- 32,768 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Dec 2025
- Primary provider
- Z Ai
- Model family
- N/A
Compare GLM 4.6V
These links open full side by side pages for GLM 4.6V. We picked pairs that people often shop together. 6 ready to open.
Locked
Compare with
Pick a model on both sides.
Popular GLM 4.6V comparisons
- GLM 4.6V vs Glm 4 5 Flash
Compare pricing side by side
- GLM 4.6V vs Glm 4 5 Airx
Compare pricing side by side
- GLM 4.6V vs Glm 4 32b 0414 128k
Compare pricing side by side
- GLM 4.6V vs GLM 4.5 Air
Compare pricing side by side
- GLM 4.6V vs Glm 4 5
Compare pricing side by side
- GLM 4.6V vs Glm 5 Code
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around GLM 4.6V rates. The short model note from our index: GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
Also from Z Ai
Other models by Z Ai with live pricing in our catalog.