Z.ai: GLM 4.5 Air

Z.ai: GLM 4.5 Air

z-ai · Released Jul 25, 2025 Specialist
Intelligence #118 / 544
55.0 Our Score
Speed #138 / 252
89.0 tokens / sec
Input #217 / 544
$0.130 per 1M tokens
Output #290 / 544
$0.850 per 1M tokens
Context #202 / 544
131,072 tokens

Analysis Summary

Z.ai: GLM 4.5 Air sits in the Specialist tier on our leaderboard, ranked #118 of 544 published models on overall intelligence. At $0.130 input and $0.850 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

GLM 4.5 Air delivers a 23.2 intelligence index with strong math (80.7) and reasonable agentic scores (33.5), tool use, and competitive pricing at $0.13/$0.85; a capable lightweight option though provider adoption outside its home market is limited.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.6Technical4.7Value7.8Content5
Intelligence 4.6/10
Technical 4.7/10
Content 5/10
Value 7.8/10

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

23.2 Intelligence Index
23.8 Coding Index
33.5 Agentic Index
80.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 73.3% Graduate-level scientific reasoning
HLE 6.8% Humanity's Last Exam
MMLU Pro 81.5% Multi-task language understanding
MATH 500 96.5% Mathematical problem-solving
AIME 67.3% Competition mathematics
AIME 2025 80.7% Competition mathematics (2025)
SciCode 30.6% Scientific computing

Technical

LiveCodeBench 68.4% Live coding evaluation
TerminalBench Hard 20.5% Agentic terminal tasks
τ²-Bench 46.5% Conversational agent benchmark

Content

IFBench 37.6% Instruction following
LCR 43.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Z.ai: GLM 4.5 Air stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID z-ai/glm-4.5-air
Providerz-ai
Release Date July 25, 2025
Context Length131,072 tokens
Max Completion98,304 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.13 $0.000130
Output $0.85 $0.000850

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.8%
Avg Uptime
765ms
Best Latency (TTFT)
46 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Novita, SiliconFlow, Z.AI

Leaderboard Categories