TokenRate

Zhipu AI API Pricing

Zhipu AI (Z.ai) is a Beijing-based lab spun out of Tsinghua University, building the open-weight GLM series. The GLM-4.5, 4.6, 4.7 and GLM-5 models are strong agentic and coding performers with vision (GLM-V) variants, popular for self-hosting at a fraction of frontier API prices.

Official site: z.ai

Cheapest

GLM 4.7 Flash

$0.060/1M in

Flagship

GLM 5 Turbo

$1.20/1M in

Models

12 tracked

All tiers, latest pricing.

All Zhipu AI Models

ModelTierInput / 1MOutput / 1MContext
GLM 4.7 Flashfast$0.060$0.400203K
GLM 4.5 Airfast$0.130$0.850131K
GLM 4.6Vfast$0.300$0.900131K
GLM 4.7balanced$0.400$1.75203K
GLM 4.6balanced$0.430$1.74203K
GLM 5balanced$0.600$1.92203K
GLM 4.5Vbalanced$0.600$1.8066K
GLM 4.5balanced$0.600$2.20131K
GLM 5.2balanced$0.930$3.001M
GLM 5.1balanced$0.975$4.30203K
GLM 5V Turbobalanced$1.20$4.00203K
GLM 5 Turbobalanced$1.20$4.00262K

Model Details

GLM 4.7 Flash

$0.060 in

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency.

GLM 4.5 Air

$0.130 in

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications.

GLM 4.6V

$0.300 in

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media.

GLM 4.7

$0.400 in

GLM-4.7 is Z. ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution.

GLM 4.6

$0.430 in

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex.

GLM 5

$0.600 in

GLM-5 is Z. ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows.

GLM 4.5V

$0.600 in

GLM-4.5V is a vision-language foundation model for multimodal agent applications.

GLM 4.5

$0.600 in

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens.

GLM 5.2

$0.930 in

GLM 5.2 is Zhipu AI's a balanced model that trades a little peak capability for much lower cost and faster responses. It costs $0.930 per million input tokens with a 1M-token context window.

GLM 5.1

$0.975 in

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks.

GLM 5V Turbo

$1.20 in

GLM-5V-Turbo is Z. ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks.

GLM 5 Turbo

$1.20 in

GLM-5 Turbo is a new model from Z. ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios.

Calculate Zhipu AI API Costs

Use the TokenRate calculator to estimate exactly what Zhipu AI models will cost for your workload.

Open Calculator →

Other Providers