Google: Gemma 3 4B

Google: Gemma 3 4B

google · Released Mar 13, 2025 Efficient
32.2
Our Score

Performance Profile

Intelligence1.6Technical0.5Value8Content2.5
Intelligence 1.6/10
Technical 0.5/10
Content 2.5/10
Value 8/10

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,..

$0.04 / 1M
Input Price
$0.08 / 1M
Output Price
131,072 tokens
Context Window
4B Parameters

Capabilities

Vision

Architecture

ModalityText + Image → Text
TokenizerGemini
Instruct Typegemma
Parameters4B

Performance Indices

Source: Artificial Analysis

6.3 Intelligence Index
2.9 Coding Index
2.9 Agentic Index
12.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 29.1% Graduate-level scientific reasoning
HLE 5.2% Humanity's Last Exam
MMLU Pro 41.7% Multi-task language understanding
MATH 500 76.6% Mathematical problem-solving
AIME 6.3% Competition mathematics
AIME 2025 12.7% Competition mathematics (2025)
SciCode 7.3% Scientific computing

Technical

LiveCodeBench 11.2% Live coding evaluation
TerminalBench Hard 0.8% Agentic terminal tasks
τ²-Bench 5% Conversational agent benchmark

Content

IFBench 28.3% Instruction following
LCR 5.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 3 4B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID google/gemma-3-4b-it
Providergoogle
Release Date March 13, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.08 $0.000080

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
347ms
Best Latency (TTFT)
22 tok/s
Best Throughput
1/1
Active Endpoints
Available via: DeepInfra