Google: Gemma 3 12B

Google: Gemma 3 12B

google · Released Mar 13, 2025 Professional
Intelligence #10 / 565
82.0 Our Score
Speed #257 / 262
26.6 tokens / sec
Input #143 / 566
$0.040 per 1M tokens
Output #150 / 566
$0.130 per 1M tokens
Context #228 / 566
131,072 tokens

Analysis Summary

Google: Gemma 3 12B sits in the Professional tier on our leaderboard, ranked #10 of 565 published models on overall intelligence. At $0.040 input and $0.130 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and vision.

Editorial notes

Gemma 3 12B from Google supports vision, tool use, and function calling at very low cost with a 131K context window, though reasoning and coding scores are modest for professional workflows.

Assessed May 31, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.9Technical1.1Value8Content2.4
Intelligence 1.9/10
Technical 1.1/10
Content 2.4/10
Value 8/10

How Google: Gemma 3 12B compares

Google: Gemma 3 12B ranks #327 of 371 AI models we track for overall intelligence, #263 of 308 for coding, #273 of 283 for agentic tasks. Its 131K-token context window is larger than 60% of the models we list. At $0.04 per million input tokens it is cheaper than 75% of comparable models.

About Google: Gemma 3 12B

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,..

12B Parameters

Capabilities

Tool Use Function Calling Vision

Architecture Detail

Instruct Typegemma

Performance Indices

Source: Artificial Analysis

8.8 Intelligence Index
6.3 Coding Index
5.8 Agentic Index
18.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 34.9% Graduate-level scientific reasoning
HLE 4.8% Humanity's Last Exam
MMLU Pro 59.5% Multi-task language understanding
MATH 500 85.3% Mathematical problem-solving
AIME 22% Competition mathematics
AIME 2025 18.3% Competition mathematics (2025)
SciCode 17.4% Scientific computing

Technical

LiveCodeBench 13.7% Live coding evaluation
TerminalBench Hard 0.8% Agentic terminal tasks
τ²-Bench 10.8% Conversational agent benchmark

Content

IFBench 36.7% Instruction following
LCR 6.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemma 3 12B stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemma-3-12b-it
Providergoogle
Release Date March 13, 2025
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.13 $0.000130

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.6%
Avg Uptime
399ms
Best Latency (TTFT)
43 tok/s
Best Throughput
2/2
Active Endpoints
Available via: DeepInfra, SambaNova

Leaderboard Categories

Frequently asked questions about Google: Gemma 3 12B

How much does Google: Gemma 3 12B cost?

Google: Gemma 3 12B costs $0.04 per million input tokens and $0.13 per million output tokens.

What is the context window of Google: Gemma 3 12B?

Google: Gemma 3 12B has a context window of 131,072 tokens (131K).

Is Google: Gemma 3 12B good for coding?

On our coding benchmark index, Google: Gemma 3 12B ranks #263 of 308 models, placing it in the broader range of the field for code generation and debugging.

What can Google: Gemma 3 12B do?

Google: Gemma 3 12B supports image/vision input, tool use, and function calling.

Who created Google: Gemma 3 12B?

Google: Gemma 3 12B is developed by Google and was released on March 13, 2025.