TokenRate

GLM 4.6 Pricing

Balanced

Zhipu AI · 203K tokens context

GLM 4.6 from Zhipu AI costs $0.430 per 1 million input tokens and $1.74 per 1 million output tokens as of July 2026 (live OpenRouter data). The model supports a 202,752-token context window (approximately 152,064 words) with a 131K-token maximum output. A typical 1,000-token request costs $0.0004 in input charges; a 10,000-token request costs $0.0043.

GLM 4.6 pricing and capability summary
Input price$0.430 / 1M tokens
Output price$1.74 / 1M tokens
Output / input ratio4.0×
Context window202,752 tokens (~152,064 words)
Maximum output131,072 tokens
Cost per 1K tokens (input)$0.0004
TierBalanced
Last verified

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex.

Live pricing from OpenRouter

Input Price

$0.430

per 1 million tokens

Output Price

$1.74

per 1 million tokens

Context Window

203K tokens

max 131K output

Cost Examples

Request TypeTokensInput CostOutput Cost
1,000 word article1,333$0.000573$0.000696
10-page document (2,500 words)3,333$0.00143$0.00174
1,000 lines of code5,000$0.00215$0.00261
100K token document100,000$0.043$0.0522

Output cost estimated at 30% of input token count. Use the calculator for exact figures.

Strengths

  • Affordable at $0.43/1M input tokens
  • Large 203K-token context window
  • Strong general-purpose performance

Limitations

  • Quality and availability can vary by hosting provider
  • Quality and availability can vary by hosting provider

Best Use Cases

Customer-facing AI apps
Code generation and review
Content creation at scale
Conversational interfaces

Calculate GLM 4.6 Costs

Use the TokenRate calculator to convert any budget, token count, or text into exact GLM 4.6 costs — and compare across all models.

Open Calculator →

GLM 4.6 — FAQ

Related Models

Related Guides