Google: Gemma 3 4B

Google: Gemma 3 4B

google · Released Mar 13, 2025 Efficient
Intelligence #263 / 551
32.2 Our Score
Speed #251 / 257
29.8 tokens / sec
Input #140 / 552
$0.040 per 1M tokens
Output #128 / 552
$0.080 per 1M tokens
Context #206 / 552
131,072 tokens

Analysis Summary

Google: Gemma 3 4B sits in the Efficient tier on our leaderboard, ranked #263 of 551 published models on overall intelligence. At $0.040 input and $0.080 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports vision.

Editorial notes

Gemma 3 4B is a compact vision-capable model from Google with very low pricing, but benchmarks confirm limited reasoning and coding capability suited only to the simplest tasks.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.6Technical0.5Value8Content2.5
Intelligence 1.6/10
Technical 0.5/10
Content 2.5/10
Value 8/10

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,..

4B Parameters

Capabilities

Vision

Architecture Detail

Instruct Typegemma

Performance Indices

Source: Artificial Analysis

6.3 Intelligence Index
2.9 Coding Index
2.9 Agentic Index
12.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 29.1% Graduate-level scientific reasoning
HLE 5.2% Humanity's Last Exam
MMLU Pro 41.7% Multi-task language understanding
MATH 500 76.6% Mathematical problem-solving
AIME 6.3% Competition mathematics
AIME 2025 12.7% Competition mathematics (2025)
SciCode 7.3% Scientific computing

Technical

LiveCodeBench 11.2% Live coding evaluation
TerminalBench Hard 0.8% Agentic terminal tasks
τ²-Bench 5% Conversational agent benchmark

Content

IFBench 28.3% Instruction following
LCR 5.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 3 4B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID google/gemma-3-4b-it
Providergoogle
Release Date March 13, 2025
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.08 $0.000080

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
373ms
Best Latency (TTFT)
31 tok/s
Best Throughput
1/1
Active Endpoints
Available via: DeepInfra