Home > AI Models > Z.ai: GLM 4.6 (exacto)

Z.ai: GLM 4.6 (exacto)

Name: Z.ai: GLM 4.6 (exacto) Review
Item: Z.ai: GLM 4.6 (exacto)
Author: Design for Online Editorial

Z.ai: GLM 4.6 (exacto)

z-ai · Released Sep 30, 2025 Legacy

Intelligence #411 / 583

23.8 Our Score

Speed

— Not reported

Input #374 / 583

$0.440 per 1M tokens

Output #378 / 583

$1.76 per 1M tokens

Context #192 / 583

204,800 tokens

GLM 4.6 (exacto) is a variant of Z.ai's GLM 4.6 model, listed with tool use and function calling support and a 204K context window. No independent benchmark data is available for this specific variant, so its performance relative to the base GLM 4.6 cannot be confirmed.

Without benchmark evidence, it must be treated as an unverified model regardless of the base model's scores. Businesses should use the benchmarked GLM 4.6 if they require verified performance, or wait for this variant's own evaluation data before adopting it in production.

Pricing is nearly identical to GLM 4.6 at $0.44 input and $1.76 output. The regional penalty also applies. There is no clear reason to prefer this variant over the benchmarked base model until independent data is available.

Assessed June 6, 2026

Editorial notes

Z.ai GLM 4.6 (exacto) is a variant of GLM 4.6 with no independent benchmark data; tool use and function calling are supported but performance cannot be verified separately.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 204,800 tokens
Max output: 131,072 tokens
Tokenizer: Other
Released: Sep 30, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How Z.ai: GLM 4.6 (exacto) compares

Its 205K-token context window is larger than 67% of the models we list. At $0.44 per million input tokens it is cheaper than 36% of comparable models.

About Z.ai: GLM 4.6 (exacto)

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks.
Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages.
Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability.
More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks.
Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Capabilities

Tool Use Function Calling

How does Z.ai: GLM 4.6 (exacto) stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID	`z-ai/glm-4.6:exacto`
Provider	z-ai
Release Date	September 30, 2025
Context Length	204,800 tokens
Max Completion	131,072 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.44	$0.000440
Output	$1.76	$0.001760

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

98.9%

Avg Uptime

564ms

Best Latency (TTFT)

45.5 tok/s

Best Throughput

4/5

Active Endpoints

Available via: DeepInfra, Novita, Z.AI, AtlasCloud, Venice

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about Z.ai: GLM 4.6 (exacto)

How much does Z.ai: GLM 4.6 (exacto) cost?

Z.ai: GLM 4.6 (exacto) costs $0.44 per million input tokens and $1.76 per million output tokens.

What is the context window of Z.ai: GLM 4.6 (exacto)?

Z.ai: GLM 4.6 (exacto) has a context window of 204,800 tokens (205K).

What can Z.ai: GLM 4.6 (exacto) do?

Z.ai: GLM 4.6 (exacto) supports tool use and function calling.

Who created Z.ai: GLM 4.6 (exacto)?

Z.ai: GLM 4.6 (exacto) is developed by Z.ai and was released on September 30, 2025.

Z.ai: GLM 4.6 (exacto)

Performance Profile

How Z.ai: GLM 4.6 (exacto) compares

About Z.ai: GLM 4.6 (exacto)

Capabilities

Model Information

Pricing

Live Performance

External Resources

Explore Related Models

Frequently asked questions about Z.ai: GLM 4.6 (exacto)

How much does Z.ai: GLM 4.6 (exacto) cost?

What is the context window of Z.ai: GLM 4.6 (exacto)?

What can Z.ai: GLM 4.6 (exacto) do?

Who created Z.ai: GLM 4.6 (exacto)?