Home > AI Models > Google: Gemma 4 31B

Google: Gemma 4 31B

Name: Google: Gemma 4 31B Review
Item: Google: Gemma 4 31B
Author: Design for Online Editorial

NEWKimi K3in at #9 NEWKAT-Coder-Air V2.5in at #560 NEWKAT-Coder-Pro V2.5in at #568 NEWMuse Spark 1.1in at #392 NEWUncensoredin at #487 NEWGPT-5.6 Terrain at #11 NEWGPT-5.6 Sol Proin at #416 NEWGPT-5.6 Solin at #2

Google: Gemma 4 31B

google · Released Apr 2, 2026

Intelligence #9 / 612

82.0 our score

Speed #264 / 287

35.9 tok/s

Input Price #234 / 612

$0.120 per 1M tokens

Output Price #228 / 612

$0.370 per 1M tokens

Context #125 / 612

262,144 tokens

Gemma 4 31B is Google's dense 31-billion-parameter model, with an intelligence index of 29.4 and a coding index of 43.4, both meaningfully above the sparse 26B A4B sibling. The agentic index of 48.2 is more capable for multi-step tasks, and the model supports vision, video, tool use, and function calling across a 262K context window.

For businesses, Gemma 4 31B suits structured content generation, SEO workflows, coding assistance for lighter tasks, and tool-calling pipelines where multimodal input is needed. The instruction following score of 0.756 is strong for its tier, making it reliable for templated and structured output tasks.

At $0.12 input and $0.35 output per million tokens, it offers excellent price-performance for a benchmarked multimodal model. Teams needing a step up from the 26B A4B without moving to premium pricing will find it a practical choice.

Assessed July 10, 2026

Editorial notes

Gemma 4 31B from Google pairs vision, video, and tool use with a 262K context at $0.12 input per million tokens, offering a meaningful step up in reasoning and coding over the 26B A4B variant.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

DFO Verdict

Gemma 4 31B from Google pairs vision, video, and tool use with a 262K context at $0.12 input per million tokens, offering a meaningful step up in reasoning and coding over the 26B A4B variant.

#9 of 612 overall

Benchmark scores

GPQA Diamond 85.7%

HLE 22.7%

SciCode 43.4%

TerminalBench Hard 36.4%

τ²-Bench 59.9%

IFBench 75.6%

LCR 62%

Magenta = intelligence · Ink = technical/agentic · Cyan = content & long-context · Grey = community benchmarks. Data: Artificial Analysis, Hugging Face.

29.4 Intelligence Index·43.4 Coding Index·48.2 Agentic Index

How Google: Gemma 4 31B compares

Google: Gemma 4 31B ranks #96 of 393 AI models we track for overall intelligence, #51 of 157 for coding, #83 of 300 for agentic tasks. Its 262K-token context window is larger than 80% of the models we list. At $0.12 per million input tokens it is cheaper than 62% of comparable models.

Position in the field

Intelligence: smarter than 99% of models #9

Speed: faster than 8% of models #264

Price: cheaper than 62% of models #234

Context: larger than 80% of models #125

worst in fieldmedianbest in field

Price vs frontier peers · $ per 1M tokens

Google: Gemma 4 31B $0.12 in $0.37 out

Anthropic: Claude Fable 5 $10.00 in $50.00 out

Anthropic: Claude Opus 4.8 $5.00 in $25.00 out

Google: Gemini 3.1 Pro Preview $2.00 in $12.00 out

Dark bar = input · light bar = output, scaled to the priciest peer.

Context window vs peers · tokens

Google: Gemini 3.1 Pro Preview 1M

Anthropic: Claude Fable 5 1M

Anthropic: Claude Opus 4.8 1M

Google: Gemma 4 31B 262K

1M tokens ≈ 8 full-length novels or ~2,500 pages of business documents in a single request.

Performance profile

Strongest on value. The pulled-in intelligence corner is the trade-off, and if the shape matters more than the price, this is your model.

Compare shapes side-by-side →

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.12	$0.000120
Output	$0.37	$0.000370

What would Google: Gemma 4 31B cost your business?

Pick the job that looks most like yours, then fine-tune with the sliders. Estimates update live.

A website chatbot handling around 100 customer conversations a day, a few short messages each.

Requests per month 3,000

One request is one message, email, draft or automation call.

Size of each request 1,200 tokens

$0/mo Google: Gemma 4 31B

$0/mo Anthropic: Claude Fable 5

$0/mo Z.ai: GLM 5.2 · best value

Full calculator with 612 models → Price Calculator

DFO AI AUTOMATION

These numbers get smaller with the right architecture.

We route routine calls to cheap models and save Google: Gemma 4 31B for the hard ones. Most clients cut their estimate by 60-80%.

Talk to our team

About Google: Gemma 4 31B

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function..

Embed this ranking

Writing about this model? Add the badge to your site. It always shows the current rank and score, and links back to this page.

<a href="https://designforonline.com/ai-models/google-gemma-4-31b/"><img src="https://designforonline.com/?aiml_badge=google-gemma-4-31b&theme=dark" alt="Google: Gemma 4 31B, ranked #9 on the Design for Online AI Leaderboard" width="400" height="76"></a>

<a href="https://designforonline.com/ai-models/google-gemma-4-31b/"><img src="https://designforonline.com/?aiml_badge=google-gemma-4-31b&theme=light" alt="Google: Gemma 4 31B, ranked #9 on the Design for Online AI Leaderboard" width="400" height="76"></a>

Frequently asked questions about Google: Gemma 4 31B

How much does Google: Gemma 4 31B cost?

Google: Gemma 4 31B costs $0.12 per million input tokens and $0.37 per million output tokens.

What is the context window of Google: Gemma 4 31B?

Google: Gemma 4 31B has a context window of 262,144 tokens (262K).

Is Google: Gemma 4 31B good for coding?

On our coding benchmark index, Google: Gemma 4 31B ranks #51 of 157 models, placing it in the broader range of the field for code generation and debugging.

What can Google: Gemma 4 31B do?

Google: Gemma 4 31B supports image/vision input, tool use, and function calling.

Who created Google: Gemma 4 31B?

Google: Gemma 4 31B is developed by Google and was released on April 2, 2026.

Performance profile

Intelligence 4.8

Technical 5.6

Content 7.8

Value 8

Reasoning: Yes
Input
Output
Context: 262,144 tokens
Max output: 16,384 tokens
Tokenizer: Gemma
Released: Apr 2, 2026

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Model information

Provider google

OpenRouter ID google/gemma-4-31b-it

Status Active

Capabilities

Tool Use Function Calling Vision

Ranked in

Coding Content Writing SEO Tool Use

Live performance · 30 min refresh

96% Avg uptime

294ms Best latency

68 tok/s Best throughput

14/16 Active endpoints

External resources View on OpenRouter API access, playground & provider details API Quickstart Sample code and integration guide

Data sourced from the OpenRouter API, Artificial Analysis, the Hugging Face Open LLM Leaderboard and our own internal testing. Scores are editorially curated by our team.

Last updated: July 19, 2026 8:38 pm

Issues with our rankings? Contact us

Google: Gemma 4 31B

DFO Verdict

Benchmark scores

How Google: Gemma 4 31B compares

Pricing

What would Google: Gemma 4 31B cost your business?

About Google: Gemma 4 31B

Explore Related Models

Embed this ranking

Frequently asked questions about Google: Gemma 4 31B

How much does Google: Gemma 4 31B cost?

What is the context window of Google: Gemma 4 31B?

Is Google: Gemma 4 31B good for coding?

What can Google: Gemma 4 31B do?

Who created Google: Gemma 4 31B?