Google: Gemini 2.5 Flash

Google: Gemini 2.5 Flash

google · Released Jun 17, 2025 Specialist
54.9
Our Score

Performance Profile

Intelligence4.2Technical3.1Value7.8Content5.5
Intelligence 4.2/10
Technical 3.1/10
Content 5.5/10
Value 7.8/10

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

$0.30 / 1M
Input Price
$2.50 / 1M
Output Price
1M tokens
Context Window
65,535 tokens
Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + File + Audio + Video → Text
TokenizerGemini

Performance Indices

Source: Artificial Analysis

20.6 Intelligence Index
17.8 Coding Index
13.5 Agentic Index
60.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 68.3% Graduate-level scientific reasoning
HLE 5.1% Humanity's Last Exam
MMLU Pro 80.9% Multi-task language understanding
MATH 500 93.2% Mathematical problem-solving
AIME 50% Competition mathematics
AIME 2025 60.3% Competition mathematics (2025)
SciCode 29.1% Scientific computing

Technical

LiveCodeBench 49.5% Live coding evaluation
TerminalBench Hard 12.1% Agentic terminal tasks
τ²-Bench 14.9% Conversational agent benchmark

Content

IFBench 39% Instruction following
LCR 45.9% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID google/gemini-2.5-flash
Providergoogle
Model FamilyGemini 2
Release Date June 17, 2025
Context Length1,048,576 tokens
Max Completion65,535 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.30 $0.000300
Output $2.50 $0.002500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

97.7%
Avg Uptime
587ms
Best Latency (TTFT)
108 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Google, Google AI Studio

Leaderboard Categories