Google: Gemma 4 26B A4B

Google: Gemma 4 26B A4B

google · Released Apr 3, 2026 Specialist
Intelligence #86 / 557
61.9 Our Score
Speed #162 / 259
66.4 tokens / sec
Input #160 / 560
$0.060 per 1M tokens
Output #214 / 560
$0.330 per 1M tokens
Context #99 / 560
262,144 tokens

Analysis Summary

Google: Gemma 4 26B A4B sits in the Specialist tier on our leaderboard, ranked #86 of 557 published models on overall intelligence. At $0.060 input and $0.330 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, and vision.

Editorial notes

Gemma 4 26B A4B from Google offers vision, tool use, and strong instruction following at very low cost, though coding and agentic benchmarks are limited.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.7Technical3.9Value8Content6
Intelligence 5.7/10
Technical 3.9/10
Content 6/10
Value 8/10

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at..

26B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

31.2 Intelligence Index
22.4 Coding Index
28.6 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 79.2% Graduate-level scientific reasoning
HLE 18.3% Humanity's Last Exam
SciCode 40% Scientific computing

Technical

TerminalBench Hard 13.6% Agentic terminal tasks
τ²-Bench 43.6% Conversational agent benchmark

Content

IFBench 72.4% Instruction following
LCR 55.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 4 26B A4B stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID google/gemma-4-26b-a4b-it
Providergoogle
Release Date April 3, 2026
Context Length262,144 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.06 $0.000060
Output $0.33 $0.000330

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.6%
Avg Uptime
287ms
Best Latency (TTFT)
37 tok/s
Best Throughput
10/10
Active Endpoints
Available via: DekaLLM, DeepInfra, Cloudflare, SiliconFlow, Parasail, Novita, NextBit, Io Net +2 more

Leaderboard Categories