Z.ai: GLM 4.6V

Z.ai: GLM 4.6V

z-ai · Released Dec 8, 2025 Efficient
42.8
Our Score

Performance Profile

Intelligence3.5Technical2.4Value7.8Content2.5
Intelligence 3.5/10
Technical 2.4/10
Content 2.5/10
Value 7.8/10

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts..

$0.30 / 1M
Input Price
$0.90 / 1M
Output Price
131,072 tokens
Context Window
131,072 tokens
Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + Video → Text
TokenizerOther

Performance Indices

Source: Artificial Analysis

23.4 Intelligence Index
19.7 Coding Index
23 Agentic Index
85.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 71.9% Graduate-level scientific reasoning
HLE 8.9% Humanity's Last Exam
MMLU Pro 79.9% Multi-task language understanding
AIME 2025 85.3% Competition mathematics (2025)
SciCode 30.4% Scientific computing

Technical

LiveCodeBench 16% Live coding evaluation
TerminalBench Hard 14.4% Agentic terminal tasks
τ²-Bench 31.6% Conversational agent benchmark

Content

IFBench 30.1% Instruction following
LCR 40.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Z.ai: GLM 4.6V stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID z-ai/glm-4.6v
Providerz-ai
Release Date December 8, 2025
Context Length131,072 tokens
Max Completion131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.30 $0.000300
Output $0.90 $0.000900

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.4%
Avg Uptime
572ms
Best Latency (TTFT)
67 tok/s
Best Throughput
3/4
Active Endpoints
Available via: SiliconFlow, DeepInfra, Z.AI, Novita

Leaderboard Categories