Google: Gemini 2.5 Flash Lite

Google: Gemini 2.5 Flash Lite

google · Released Jul 22, 2025 Professional
Intelligence #14 / 590
82.0 Our Score
Speed #34 / 279
204.1 tokens / sec
Input #195 / 592
$0.100 per 1M tokens
Output #229 / 592
$0.400 per 1M tokens
Context #17 / 592
1M tokens

Analysis Summary

Gemini 2.5 Flash Lite is Google's ultra-efficient multimodal model, supporting text, image, file, audio, and video inputs with a 1M token context window. At $0.10 input and $0.40 output, it is one of the most cost-effective multimodal options available. Intelligence index at 6.9 and agentic index at 10.6 place it in the capable lightweight tier, with livecodebench at 0.400 and MMLU Pro at 0.724 showing solid general knowledge.

For businesses, the combination of multimodal input, tool use, function calling, and a massive context window makes it a strong fit for high-volume content processing, document summarisation, SEO content pipelines, and media analysis tasks. It is not suited to complex reasoning chains or frontier-level coding work.

The pricing and context window make it an excellent workhorse for cost-sensitive, high-throughput workflows. Teams running large-scale content operations or needing to process diverse media types will find it a practical and affordable choice.

Assessed June 30, 2026

Editorial notes

Gemini 2.5 Flash Lite from Google offers multimodal input including audio and video, tool use, a 1M token context window, and very low pricing, with moderate reasoning capability.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.9Technical1.8Value8.3Content3.6
Intelligence 1.9/10
Technical 1.8/10
Content 3.6/10
Value 8.3/10

How Google: Gemini 2.5 Flash Lite compares

Google: Gemini 2.5 Flash Lite ranks #285 of 385 AI models we track for overall intelligence, #257 of 293 for agentic tasks. Its 1M-token context window is larger than 97% of the models we list. At $0.10 per million input tokens it is cheaper than 67% of comparable models.

About Google: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

6.9 Intelligence Index
10.6 Agentic Index
35.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 47.4% Graduate-level scientific reasoning
HLE 3.7% Humanity's Last Exam
MMLU Pro 72.4% Multi-task language understanding
MATH 500 92.6% Mathematical problem-solving
AIME 50% Competition mathematics
AIME 2025 35.3% Competition mathematics (2025)
SciCode 17.7% Scientific computing

Technical

LiveCodeBench 40% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 19% Conversational agent benchmark

Content

IFBench 31.5% Instruction following
LCR 31.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemini 2.5 Flash Lite stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemini-2.5-flash-lite
Providergoogle
Model FamilyGemini 2
Release Date July 22, 2025
Context Length1,048,576 tokens
Max Completion65,535 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.40 $0.000400

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

98.5%
Avg Uptime
383ms
Best Latency (TTFT)
104 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Google, Google AI Studio

Leaderboard Categories

Frequently asked questions about Google: Gemini 2.5 Flash Lite

How much does Google: Gemini 2.5 Flash Lite cost?

Google: Gemini 2.5 Flash Lite costs $0.10 per million input tokens and $0.40 per million output tokens.

What is the context window of Google: Gemini 2.5 Flash Lite?

Google: Gemini 2.5 Flash Lite has a context window of 1,048,576 tokens (1M).

What can Google: Gemini 2.5 Flash Lite do?

Google: Gemini 2.5 Flash Lite supports image/vision input, tool use, and function calling.

Who created Google: Gemini 2.5 Flash Lite?

Google: Gemini 2.5 Flash Lite is developed by Google and was released on July 22, 2025.