Google: Gemma 3 4B
Analysis Summary
Google: Gemma 3 4B sits in the Efficient tier on our leaderboard, ranked #263 of 551 published models on overall intelligence. At $0.040 input and $0.080 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports vision.
Editorial notes
Gemma 3 4B is a compact vision-capable model from Google with very low pricing, but benchmarks confirm limited reasoning and coding capability suited only to the simplest tasks.
Assessed May 5, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,..
Capabilities
Architecture Detail
| Instruct Type | gemma |
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Google: Gemma 3 4B stack up?
Compare side-by-side with other efficient models.
Model Information
| OpenRouter ID |
google/gemma-3-4b-it
|
| Provider | |
| Release Date | March 13, 2025 |
| Context Length | 131,072 tokens |
| Max Completion | 16,384 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.04 | $0.000040 |
| Output | $0.08 | $0.000080 |
Live Performance
Live endpoint metrics — refreshed every 30 minutes.
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 11, 2026 8:38 pm