Google: Gemma 3 12B

Google: Gemma 3 12B

google · Released Mar 13, 2025 Professional
Intelligence #10 / 576
82.0 Our Score
Speed #263 / 268
26.6 tokens / sec
Input #154 / 576
$0.050 per 1M tokens
Output #161 / 576
$0.150 per 1M tokens
Context #234 / 576
131,072 tokens

Analysis Summary

Google's Gemma 3 12B is a mid-small open-weight model released in March 2025, supporting text and image input with tool use and function calling. Its intelligence index of 8.8 and coding index of 6.3 place it in the limited-to-moderate tier, though its math index of 18.3 and MMLU-Pro of 0.595 show reasonable general knowledge breadth. The 131K context window is adequate for most document tasks.

For businesses, the combination of vision, tool use, and function calling at $0.05 input and $0.15 output per million tokens makes it an attractive option for cost-sensitive automation pipelines. It can handle structured extraction, basic tool-calling workflows, and image-aware tasks without significant cost overhead. Instruction-following scores are moderate, so prompt engineering matters.

It is best positioned as a budget-friendly component in multi-model architectures, handling routine structured tasks while more capable models handle complex reasoning. Teams building high-volume, low-complexity pipelines with tool integration will find it a practical choice.

Assessed June 9, 2026

Editorial notes

Gemma 3 12B from Google offers vision, tool use, and function calling at very low cost, with moderate reasoning capability suitable for lightweight business automation.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.9Technical1.1Value8Content2.4
Intelligence 1.9/10
Technical 1.1/10
Content 2.4/10
Value 8/10

How Google: Gemma 3 12B compares

Google: Gemma 3 12B ranks #333 of 377 AI models we track for overall intelligence, #269 of 314 for coding, #279 of 289 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.05 per million input tokens it is cheaper than 73% of comparable models.

About Google: Gemma 3 12B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,..

12B Parameters

Capabilities

Tool Use Function Calling Vision

Architecture Detail

Instruct Typegemma

Performance Indices

Source: Artificial Analysis

8.8 Intelligence Index
6.3 Coding Index
5.8 Agentic Index
18.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 34.9% Graduate-level scientific reasoning
HLE 4.8% Humanity's Last Exam
MMLU Pro 59.5% Multi-task language understanding
MATH 500 85.3% Mathematical problem-solving
AIME 22% Competition mathematics
AIME 2025 18.3% Competition mathematics (2025)
SciCode 17.4% Scientific computing

Technical

LiveCodeBench 13.7% Live coding evaluation
TerminalBench Hard 0.8% Agentic terminal tasks
τ²-Bench 10.8% Conversational agent benchmark

Content

IFBench 36.7% Instruction following
LCR 6.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 3 12B stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemma-3-12b-it
Providergoogle
Release Date March 13, 2025
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.05 $0.000050
Output $0.15 $0.000150

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.3%
Avg Uptime
738ms
Best Latency (TTFT)
74 tok/s
Best Throughput
2/2
Active Endpoints
Available via: DeepInfra, SambaNova

Leaderboard Categories

Frequently asked questions about Google: Gemma 3 12B

How much does Google: Gemma 3 12B cost?

Google: Gemma 3 12B costs $0.05 per million input tokens and $0.15 per million output tokens.

What is the context window of Google: Gemma 3 12B?

Google: Gemma 3 12B has a context window of 131,072 tokens (131K).

Is Google: Gemma 3 12B good for coding?

On our coding benchmark index, Google: Gemma 3 12B ranks #269 of 314 models, placing it in the broader range of the field for code generation and debugging.

What can Google: Gemma 3 12B do?

Google: Gemma 3 12B supports image/vision input, tool use, and function calling.

Who created Google: Gemma 3 12B?

Google: Gemma 3 12B is developed by Google and was released on March 13, 2025.