Zhipu AI API Pricing
Zhipu AI (Z.ai) is a Beijing-based lab spun out of Tsinghua University, building the open-weight GLM series. The GLM-4.5, 4.6, 4.7 and GLM-5 models are strong agentic and coding performers with vision (GLM-V) variants, popular for self-hosting at a fraction of frontier API prices.
Models
12 tracked
All tiers, latest pricing.
All Zhipu AI Models
| Model | Tier | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| GLM 4.7 Flash | fast | $0.060 | $0.400 | 203K |
| GLM 4.5 Air | fast | $0.130 | $0.850 | 131K |
| GLM 4.6V | fast | $0.300 | $0.900 | 131K |
| GLM 4.7 | balanced | $0.400 | $1.75 | 203K |
| GLM 4.6 | balanced | $0.430 | $1.74 | 203K |
| GLM 5 | balanced | $0.600 | $1.92 | 203K |
| GLM 4.5V | balanced | $0.600 | $1.80 | 66K |
| GLM 4.5 | balanced | $0.600 | $2.20 | 131K |
| GLM 5.2 | balanced | $0.930 | $3.00 | 1M |
| GLM 5.1 | balanced | $0.975 | $4.30 | 203K |
| GLM 5V Turbo | balanced | $1.20 | $4.00 | 203K |
| GLM 5 Turbo | balanced | $1.20 | $4.00 | 262K |
Model Details
GLM 4.7 Flash
$0.060 inAs a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency.
GLM 4.5 Air
$0.130 inGLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications.
GLM 4.6V
$0.300 inGLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media.
GLM 4.7
$0.400 inGLM-4.7 is Z. ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution.
GLM 4.6
$0.430 inCompared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex.
GLM 5
$0.600 inGLM-5 is Z. ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows.
GLM 4.5V
$0.600 inGLM-4.5V is a vision-language foundation model for multimodal agent applications.
GLM 4.5
$0.600 inGLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens.
GLM 5.2
$0.930 inGLM 5.2 is Zhipu AI's a balanced model that trades a little peak capability for much lower cost and faster responses. It costs $0.930 per million input tokens with a 1M-token context window.
GLM 5.1
$0.975 inGLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks.
GLM 5V Turbo
$1.20 inGLM-5V-Turbo is Z. ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks.
GLM 5 Turbo
$1.20 inGLM-5 Turbo is a new model from Z. ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios.
Calculate Zhipu AI API Costs
Use the TokenRate calculator to estimate exactly what Zhipu AI models will cost for your workload.
Open Calculator →Other Providers