Google: Gemma 4 31B

Google: Gemma 4 31B

google · Released Apr 2, 2026 Professional
Intelligence #39 / 557
75.9 Our Score
Speed #253 / 259
28.6 tokens / sec
Input #227 / 560
$0.120 per 1M tokens
Output #217 / 560
$0.370 per 1M tokens
Context #99 / 560
262,144 tokens

Analysis Summary

Google: Gemma 4 31B sits in the Professional tier on our leaderboard, ranked #39 of 557 published models on overall intelligence. At $0.120 input and $0.370 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, and vision.

Editorial notes

Gemma 4 31B from Google combines strong reasoning, coding, vision, and tool use with a 262K context at very competitive pricing, making it a capable mid-tier option.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence7Technical6.8Value8Content7
Intelligence 7/10
Technical 6.8/10
Content 7/10
Value 8/10

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function..

31B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

32.3 Intelligence Index
33.9 Coding Index
47.9 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 76.3% Graduate-level scientific reasoning
HLE 11.5% Humanity's Last Exam
SciCode 41.1% Scientific computing

Technical

TerminalBench Hard 30.3% Agentic terminal tasks
τ²-Bench 65.5% Conversational agent benchmark

Content

IFBench 53.5% Instruction following
LCR 36% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 4 31B stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemma-4-31b-it
Providergoogle
Release Date April 2, 2026
Context Length262,144 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.12 $0.000120
Output $0.37 $0.000370

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

88.8%
Avg Uptime
653ms
Best Latency (TTFT)
44 tok/s
Best Throughput
9/9
Active Endpoints
Available via: DeepInfra, Chutes, Ambient, SiliconFlow, Novita, Parasail, Venice, Together

Leaderboard Categories