GLM 4.7 Flash pricing
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards. Live index: 1 priced offer. Best input $0.070 per million tokens from Openrouter. Best output $0.400 per million tokens from Openrouter.
Pricing across providers
Every row is a seller of GLM 4.7 Flash with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.070 | $0.400 | $0.0000 | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how GLM 4.7 Flash cost scales with traffic.
0.007000¢ / req
0.020000¢ / req
Model specifications
Context length, caps, and capability flags for GLM 4.7 Flash. Values follow the main provider (Z Ai) record in our index.
- Context window
- 200,000 tokens
- Max output
- 32,000 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jan 2026
- Primary provider
- Z Ai
- Model family
- N/A
Compare GLM 4.7 Flash
Open a pair page to see GLM 4.7 Flash next to another model with a shared provider matrix. 6 shortcuts below.
- GLM 4.7 Flash vs Glm 4 5 Flash
Compare pricing side by side
- GLM 4.7 Flash vs Glm 4 5 Airx
Compare pricing side by side
- GLM 4.7 Flash vs Glm 4 32b 0414 128k
Compare pricing side by side
- GLM 4.7 Flash vs GLM 4.5 Air
Compare pricing side by side
- GLM 4.7 Flash vs Glm 4 5
Compare pricing side by side
- GLM 4.7 Flash vs Glm 5 Code
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, ...
Also from Z Ai
Other models by Z Ai with live pricing in our catalog.