Google: Gemma 4 31B

Google: Gemma 4 31B

google · Released Apr 2, 2026 Professional New
Intelligence #38 / 525
75.9 Our Score
Speed #220 / 244
35.5 tokens / sec
Input #208 / 525
$0.130 per 1M tokens
Output #203 / 525
$0.380 per 1M tokens
Context #80 / 525
262,144 tokens

Analysis Summary

Google: Gemma 4 31B sits in the Professional tier on our leaderboard, ranked #38 of 525 published models on overall intelligence. At $0.130 input and $0.380 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, and vision.

Editorial notes

Google's Gemma 4 31B is a strong open-weight model with solid reasoning, coding, and multimodal capability at very low cost, making it an excellent choice for businesses seeking capable performance without high API spend.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence7Technical6.8Value8Content7
Intelligence 7/10
Technical 6.8/10
Content 7/10
Value 8/10

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function..

31B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

39.2 Intelligence Index
38.7 Coding Index
48.2 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 85.7% Graduate-level scientific reasoning
HLE 22.7% Humanity's Last Exam
SciCode 43.4% Scientific computing

Technical

TerminalBench Hard 36.4% Agentic terminal tasks
τ²-Bench 59.9% Conversational agent benchmark

Content

IFBench 75.6% Instruction following
LCR 62% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 4 31B stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemma-4-31b-it
Providergoogle
Release Date April 2, 2026
Context Length262,144 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.13 $0.000130
Output $0.38 $0.000380

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

92.3%
Avg Uptime
424ms
Best Latency (TTFT)
58 tok/s
Best Throughput
7/7
Active Endpoints
Available via: DeepInfra, Chutes, Novita, Parasail, AkashML, Venice, Together

Leaderboard Categories