Z.ai: GLM 4.6 (exacto)

Z.ai: GLM 4.6 (exacto)

z-ai · Released Sep 30, 2025 Legacy
Intelligence #477 / 579
23.8 Our Score
Speed
— Not reported
Input #372 / 579
$0.440 per 1M tokens
Output #376 / 579
$1.76 per 1M tokens
Context #189 / 579
204,800 tokens

Analysis Summary

GLM 4.6 (exacto) is a variant of Z.ai's GLM 4.6 model, listed with tool use and function calling support and a 204K context window. No independent benchmark data is available for this specific variant, so its performance relative to the base GLM 4.6 cannot be confirmed.

Without benchmark evidence, it must be treated as an unverified model regardless of the base model's scores. Businesses should use the benchmarked GLM 4.6 if they require verified performance, or wait for this variant's own evaluation data before adopting it in production.

Pricing is nearly identical to GLM 4.6 at $0.44 input and $1.76 output. The regional penalty also applies. There is no clear reason to prefer this variant over the benchmarked base model until independent data is available.

Assessed June 6, 2026

Editorial notes

Z.ai GLM 4.6 (exacto) is a variant of GLM 4.6 with no independent benchmark data; tool use and function calling are supported but performance cannot be verified separately.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value7.5Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 7.5/10

How Z.ai: GLM 4.6 (exacto) compares

Its 205K-token context window is larger than 67% of the models we list. At $0.44 per million input tokens it is cheaper than 36% of comparable models.

About Z.ai: GLM 4.6 (exacto)

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks.
Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code态Cline态Roo Code and Kilo Code, including improvements in generating visually polished front-end pages.
Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability.
More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks.
Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Capabilities

Tool Use Function Calling

How does Z.ai: GLM 4.6 (exacto) stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID z-ai/glm-4.6:exacto
Providerz-ai
Release Date September 30, 2025
Context Length204,800 tokens
Max Completion131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.44 $0.000440
Output $1.76 $0.001760

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

88.3%
Avg Uptime
659ms
Best Latency (TTFT)
34 tok/s
Best Throughput
4/5
Active Endpoints
Available via: DeepInfra, Novita, Z.AI, AtlasCloud, Venice

Frequently asked questions about Z.ai: GLM 4.6 (exacto)

How much does Z.ai: GLM 4.6 (exacto) cost?

Z.ai: GLM 4.6 (exacto) costs $0.44 per million input tokens and $1.76 per million output tokens.

What is the context window of Z.ai: GLM 4.6 (exacto)?

Z.ai: GLM 4.6 (exacto) has a context window of 204,800 tokens (205K).

What can Z.ai: GLM 4.6 (exacto) do?

Z.ai: GLM 4.6 (exacto) supports tool use and function calling.

Who created Z.ai: GLM 4.6 (exacto)?

Z.ai: GLM 4.6 (exacto) is developed by Z.ai and was released on September 30, 2025.