Google: Gemini 2.5 Flash Lite

Google: Gemini 2.5 Flash Lite

google · Released Jul 22, 2025 Professional
Intelligence #10 / 576
82.0 Our Score
Speed #19 / 271
241.9 tokens / sec
Input #196 / 577
$0.100 per 1M tokens
Output #226 / 577
$0.400 per 1M tokens
Context #17 / 577
1M tokens

Analysis Summary

Google's Gemini 2.5 Flash Lite, released in July 2025, is the lightweight tier of the Gemini 2.5 Flash family, supporting text, image, file, audio, and video input with tool use and function calling. Its intelligence index of 12.7 and coding index of 7.4 place it in the limited-to-moderate tier, though its math index of 35.3 and AIME score of 0.353 show reasonable quantitative ability for the price. The 1M token context window is a standout feature at this cost level.

For businesses, Flash Lite is best positioned as a high-volume, cost-sensitive component in larger pipelines. Its full multimodal input support, including audio and video, is unusual at this price point and opens up lightweight media processing use cases. Instruction-following and long-context reasoning are moderate, so it suits structured, well-defined tasks rather than open-ended reasoning.

At $0.10 input and $0.40 output per million tokens, it is one of the most cost-efficient multimodal options available. Teams running high-volume pipelines that need vision or audio capability without paying for frontier reasoning will find it a practical and well-priced choice.

Assessed June 9, 2026

Editorial notes

Gemini 2.5 Flash Lite from Google offers full multimodal support, tool use, and a 1M token context at very low cost, with moderate reasoning capability.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.5Technical1.7Value8.3Content3.6
Intelligence 2.5/10
Technical 1.7/10
Content 3.6/10
Value 8.3/10

How Google: Gemini 2.5 Flash Lite compares

Google: Gemini 2.5 Flash Lite ranks #272 of 378 AI models we track for overall intelligence, #260 of 315 for coding, #253 of 289 for agentic tasks. Its 1M-token context window is larger than 97% of the models we list. At $0.10 per million input tokens it is cheaper than 66% of comparable models.

About Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

12.7 Intelligence Index
7.4 Coding Index
10.6 Agentic Index
35.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 47.4% Graduate-level scientific reasoning
HLE 3.7% Humanity's Last Exam
MMLU Pro 72.4% Multi-task language understanding
MATH 500 92.6% Mathematical problem-solving
AIME 50% Competition mathematics
AIME 2025 35.3% Competition mathematics (2025)
SciCode 17.7% Scientific computing

Technical

LiveCodeBench 40% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 19% Conversational agent benchmark

Content

IFBench 31.5% Instruction following
LCR 31.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemini 2.5 Flash Lite stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemini-2.5-flash-lite
Providergoogle
Model FamilyGemini 2
Release Date July 22, 2025
Context Length1,048,576 tokens
Max Completion65,535 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.40 $0.000400

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.9%
Avg Uptime
380ms
Best Latency (TTFT)
108 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Google, Google AI Studio

Leaderboard Categories

Frequently asked questions about Google: Gemini 2.5 Flash Lite

How much does Google: Gemini 2.5 Flash Lite cost?

Google: Gemini 2.5 Flash Lite costs $0.10 per million input tokens and $0.40 per million output tokens.

What is the context window of Google: Gemini 2.5 Flash Lite?

Google: Gemini 2.5 Flash Lite has a context window of 1,048,576 tokens (1M).

Is Google: Gemini 2.5 Flash Lite good for coding?

On our coding benchmark index, Google: Gemini 2.5 Flash Lite ranks #260 of 315 models, placing it in the broader range of the field for code generation and debugging.

What can Google: Gemini 2.5 Flash Lite do?

Google: Gemini 2.5 Flash Lite supports image/vision input, tool use, and function calling.

Who created Google: Gemini 2.5 Flash Lite?

Google: Gemini 2.5 Flash Lite is developed by Google and was released on July 22, 2025.