Z.ai: GLM 4.5 Air (free)

Z.ai: GLM 4.5 Air (free)

z-ai · Released Jul 25, 2025
30
Our Score

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the reasoning enabled boolean. Learn more in our docs

131,072 tokens Context Window
96,000 tokens Max Output

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerOther

Model Information

OpenRouter ID z-ai/glm-4.5-air:free
Providerz-ai
Release Date July 25, 2025
Context Length131,072 tokens
Max Completion96,000 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99%
Avg Uptime
8,112ms
Best Latency (TTFT)
15 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Z.AI

Leaderboard Categories