Home > AI Models > Z.ai: GLM 4.5

Z.ai: GLM 4.5

Name: Z.ai: GLM 4.5 Review
Item: Z.ai: GLM 4.5
Rating: 5.6
Author: Design for Online

Z.ai: GLM 4.5

z-ai · Released Jul 25, 2025 Specialist

Intelligence #118 / 556

55.5 Our Score

Speed #207 / 257

50.9 tokens / sec

Input #381 / 557

$0.600 per 1M tokens

Output #389 / 557

$2.20 per 1M tokens

Context #220 / 557

131,072 tokens

Z.ai: GLM 4.5 sits in the Specialist tier on our leaderboard, ranked #118 of 556 published models on overall intelligence. At $0.600 input and $2.20 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

GLM 4.5 from Z.ai shows competitive coding and math scores with tool use support at moderate pricing, but sits in the mid-tier on intelligence and agentic capability; -4 penalty applied for provider accessibility.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 131,072 tokens
Max output: 98,304 tokens
Tokenizer: Other
Released: Jul 25, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

26.4 Intelligence Index

26.3 Coding Index

32.5 Agentic Index

73.7 Math Index

Benchmark Scores

GPQA Diamond 78.2% Graduate-level scientific reasoning

HLE 12.2% Humanity's Last Exam

MMLU Pro 83.5% Multi-task language understanding

MATH 500 97.9% Mathematical problem-solving

AIME 87.3% Competition mathematics

AIME 2025 73.7% Competition mathematics (2025)

SciCode 34.8% Scientific computing

LiveCodeBench 73.8% Live coding evaluation

TerminalBench Hard 22% Agentic terminal tasks

τ²-Bench 43% Conversational agent benchmark

IFBench 44.1% Instruction following

LCR 48.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Z.ai: GLM 4.5 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID	`z-ai/glm-4.5`
Provider	z-ai
Release Date	July 25, 2025
Context Length	131,072 tokens
Max Completion	98,304 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.60	$0.000600
Output	$2.20	$0.002200

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.4%

Avg Uptime

1,068ms

Best Latency (TTFT)

35 tok/s

Best Throughput

2/2

Active Endpoints

Available via: Novita, Z.AI

Leaderboard Categories

Coding Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Z.ai: GLM 4.5

Z.ai: GLM 4.5

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Z.ai: GLM 4.5

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models