Z.ai: GLM 4.6V

Z.ai: GLM 4.6V

z-ai · Released Dec 8, 2025 Specialist
Intelligence #135 / 557
52.2 Our Score
Speed #213 / 257
47.5 tokens / sec
Input #318 / 557
$0.300 per 1M tokens
Output #304 / 557
$0.900 per 1M tokens
Context #220 / 557
131,072 tokens

Analysis Summary

Z.ai: GLM 4.6V sits in the Specialist tier on our leaderboard, ranked #135 of 557 published models on overall intelligence. At $0.300 input and $0.900 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and vision.

Editorial notes

GLM 4.6V from Z.ai supports vision, video, tool use, and function calling with a strong math index, but its intelligence and coding scores are low, and the -4 regional penalty applies for limited enterprise availability.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.7Technical3.3Value7.8Content4.5
Intelligence 4.7/10
Technical 3.3/10
Content 4.5/10
Value 7.8/10

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

23.4 Intelligence Index
19.7 Coding Index
23 Agentic Index
85.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 71.9% Graduate-level scientific reasoning
HLE 8.9% Humanity's Last Exam
MMLU Pro 79.9% Multi-task language understanding
AIME 2025 85.3% Competition mathematics (2025)
SciCode 30.4% Scientific computing

Technical

LiveCodeBench 16% Live coding evaluation
TerminalBench Hard 14.4% Agentic terminal tasks
τ²-Bench 31.6% Conversational agent benchmark

Content

IFBench 30.1% Instruction following
LCR 40.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Z.ai: GLM 4.6V stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID z-ai/glm-4.6v
Providerz-ai
Release Date December 8, 2025
Context Length131,072 tokens
Max Completion24,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.30 $0.000300
Output $0.90 $0.000900

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

6,422ms
Best Latency (TTFT)
33 tok/s
Best Throughput
0/2
Active Endpoints
Available via: Z.AI, Novita

Leaderboard Categories