Z.ai: GLM 4.6

Z.ai: GLM 4.6

z-ai · Released Sep 30, 2025 Specialist
Intelligence #90 / 561
61.8 Our Score
Speed #202 / 260
53.9 tokens / sec
Input #360 / 561
$0.430 per 1M tokens
Output #360 / 561
$1.74 per 1M tokens
Context #179 / 561
202,752 tokens

Analysis Summary

Z.ai: GLM 4.6 sits in the Specialist tier on our leaderboard, ranked #90 of 561 published models on overall intelligence. At $0.430 input and $1.74 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

GLM 4.6 from Z.ai posts a strong agentic index of 47.7 and solid math performance with tool use at moderate pricing, though a regional accessibility penalty applies and it lacks vision support.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6Technical5.9Value7.5Content5
Intelligence 6/10
Technical 5.9/10
Content 5/10
Value 7.5/10

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

32.5 Intelligence Index
29.5 Coding Index
47.7 Agentic Index
86 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 78% Graduate-level scientific reasoning
HLE 13.3% Humanity's Last Exam
MMLU Pro 82.9% Multi-task language understanding
AIME 2025 86% Competition mathematics (2025)
SciCode 38.4% Scientific computing

Technical

LiveCodeBench 69.5% Live coding evaluation
TerminalBench Hard 25% Agentic terminal tasks
τ²-Bench 70.5% Conversational agent benchmark

Content

IFBench 43.4% Instruction following
LCR 54.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Z.ai: GLM 4.6 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID z-ai/glm-4.6
Providerz-ai
Release Date September 30, 2025
Context Length202,752 tokens
Max Completion131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.43 $0.000430
Output $1.74 $0.001740

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

97.9%
Avg Uptime
596ms
Best Latency (TTFT)
35 tok/s
Best Throughput
4/5
Active Endpoints
Available via: DeepInfra, Novita, Z.AI, AtlasCloud, Venice

Leaderboard Categories