Home > AI Models > Z.ai: GLM 4.6

Z.ai: GLM 4.6

Name: Z.ai: GLM 4.6 Review
Item: Z.ai: GLM 4.6
Rating: 6.2
Author: Design for Online

Z.ai: GLM 4.6

z-ai · Released Sep 30, 2025 Specialist

Intelligence #90 / 561

61.8 Our Score

Speed #202 / 260

53.9 tokens / sec

Input #360 / 561

$0.430 per 1M tokens

Output #360 / 561

$1.74 per 1M tokens

Context #179 / 561

202,752 tokens

Z.ai: GLM 4.6 sits in the Specialist tier on our leaderboard, ranked #90 of 561 published models on overall intelligence. At $0.430 input and $1.74 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

GLM 4.6 from Z.ai posts a strong agentic index of 47.7 and solid math performance with tool use at moderate pricing, though a regional accessibility penalty applies and it lacks vision support.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 202,752 tokens
Max output: 131,072 tokens
Tokenizer: Other
Released: Sep 30, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

32.5 Intelligence Index

29.5 Coding Index

47.7 Agentic Index

86 Math Index

Benchmark Scores

GPQA Diamond 78% Graduate-level scientific reasoning

HLE 13.3% Humanity's Last Exam

MMLU Pro 82.9% Multi-task language understanding

AIME 2025 86% Competition mathematics (2025)

SciCode 38.4% Scientific computing

LiveCodeBench 69.5% Live coding evaluation

TerminalBench Hard 25% Agentic terminal tasks

τ²-Bench 70.5% Conversational agent benchmark

IFBench 43.4% Instruction following

LCR 54.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Z.ai: GLM 4.6 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID	`z-ai/glm-4.6`
Provider	z-ai
Release Date	September 30, 2025
Context Length	202,752 tokens
Max Completion	131,072 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.43	$0.000430
Output	$1.74	$0.001740

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

97.9%

Avg Uptime

596ms

Best Latency (TTFT)

35 tok/s

Best Throughput

4/5

Active Endpoints

Available via: DeepInfra, Novita, Z.AI, AtlasCloud, Venice

Leaderboard Categories

AI Agents Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Z.ai: GLM 4.6

Z.ai: GLM 4.6

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Z.ai: GLM 4.6

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models