Z.ai: GLM 4.6 (exacto)
Analysis Summary
GLM 4.6 (exacto) is a variant of Z.ai's GLM 4.6 model, listed with tool use and function calling support and a 204K context window. No independent benchmark data is available for this specific variant, so its performance relative to the base GLM 4.6 cannot be confirmed.
Without benchmark evidence, it must be treated as an unverified model regardless of the base model's scores. Businesses should use the benchmarked GLM 4.6 if they require verified performance, or wait for this variant's own evaluation data before adopting it in production.
Pricing is nearly identical to GLM 4.6 at $0.44 input and $1.76 output. The regional penalty also applies. There is no clear reason to prefer this variant over the benchmarked base model until independent data is available.
Assessed June 6, 2026
Editorial notes
Z.ai GLM 4.6 (exacto) is a variant of GLM 4.6 with no independent benchmark data; tool use and function calling are supported but performance cannot be verified separately.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Z.ai: GLM 4.6 (exacto) compares
Its 205K-token context window is larger than 67% of the models we list. At $0.44 per million input tokens it is cheaper than 36% of comparable models.
About Z.ai: GLM 4.6 (exacto)
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks.
Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude CodećClinećRoo Code and Kilo Code, including improvements in generating visually polished front-end pages.
Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability.
More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks.
Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.
Capabilities
How does Z.ai: GLM 4.6 (exacto) stack up?
Compare side-by-side with other legacy models.
Model Information
| OpenRouter ID |
z-ai/glm-4.6:exacto
|
| Provider | z-ai |
| Release Date | September 30, 2025 |
| Context Length | 204,800 tokens |
| Max Completion | 131,072 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.44 | $0.000440 |
| Output | $1.76 | $0.001760 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
External Resources
Explore Related Models
Frequently asked questions about Z.ai: GLM 4.6 (exacto)
How much does Z.ai: GLM 4.6 (exacto) cost?
Z.ai: GLM 4.6 (exacto) costs $0.44 per million input tokens and $1.76 per million output tokens.
What is the context window of Z.ai: GLM 4.6 (exacto)?
Z.ai: GLM 4.6 (exacto) has a context window of 204,800 tokens (205K).
What can Z.ai: GLM 4.6 (exacto) do?
Z.ai: GLM 4.6 (exacto) supports tool use and function calling.
Who created Z.ai: GLM 4.6 (exacto)?
Z.ai: GLM 4.6 (exacto) is developed by Z.ai and was released on September 30, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 17, 2026 9:41 am