Google: Gemma 3 4B

Google: Gemma 3 4B

google · Released Mar 13, 2025
28
Our Score

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling.

$0.04 / 1M Input Price
$0.08 / 1M Output Price
131,072 tokens Context Window
4B Parameters

Capabilities

Function Calling Vision

Architecture

ModalityText + Image → Text
TokenizerGemini
Instruct Typegemma
Parameters4B

Model Information

OpenRouter ID google/gemma-3-4b-it
Providergoogle
Release Date March 13, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000040
Output $0.08 $0.000080

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
419ms
Best Latency (TTFT)
36 tok/s
Best Throughput
1/1
Active Endpoints
Available via: DeepInfra