Google: Gemini 3 Flash Preview

Google: Gemini 3 Flash Preview

google · Released Dec 17, 2025
88
Our Score

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool use performance with substantially lower latency than larger Gemini variants, making it well suited for interactive development, long running agent loops, and collaborative coding tasks. Compared to Gemini 2.5 Flash, it provides broad quality improvements across reasoning, multimodal understanding, and reliability. The model supports a 1M token context window and multimodal inputs including text, images, audio, video, and PDFs, with text output. It includes configurable reasoning via thinking levels (minimal, low, medium, high), structured output, tool use, and automatic context caching. Gemini 3 Flash Preview is optimized for users who want strong reasoning and agentic behavior without the cost or latency of full scale frontier models.

$0.50 / 1M Input Price
$3.00 / 1M Output Price
1M tokens Context Window
65,536 tokens Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + File + Audio + Video → Text
TokenizerGemini

Performance Indices

Source: Artificial Analysis

46.4 Intelligence Index
42.6 Coding Index
59.5 Agentic Index
97 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 89.8%
Graduate-level scientific reasoning
HLE 34.7%
Humanity's Last Exam
MMLU Pro 89%
Multi-task language understanding
LiveCodeBench 90.8%
Live coding evaluation
SciCode 50.6%
Scientific computing
AIME 2025 97%
Competition mathematics (2025)
IFBench 78%
Instruction following
LCR 66.3%
Long-context reasoning
TerminalBench Hard 38.6%
Agentic terminal tasks
τ²-Bench 80.4%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID google/gemini-3-flash-preview
Providergoogle
Model FamilyGemini 3
Release Date December 17, 2025
Context Length1,048,576 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.50 $0.000500
Output $3.00 $0.003000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.8%
Avg Uptime
1,072ms
Best Latency (TTFT)
68 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Google AI Studio, Google