Google: Gemini 2.5 Flash Lite

Google: Gemini 2.5 Flash Lite

google · Released Jul 22, 2025 Efficient
Intelligence #167 / 525
44.4 Our Score
Speed #7 / 244
324.0 tokens / sec
Input #175 / 525
$0.100 per 1M tokens
Output #204 / 525
$0.400 per 1M tokens
Context #12 / 525
1M tokens

Analysis Summary

Google: Gemini 2.5 Flash Lite sits in the Efficient tier on our leaderboard, ranked #167 of 525 published models on overall intelligence. At $0.100 input and $0.400 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Gemini 2.5 Flash Lite from Google supports rich multimodal input with tool use and a 1M context at $0.10 input; intelligence and coding benchmarks are limited, positioning it for lightweight, high-volume tasks.

Assessed April 26, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.9Technical1.8Value8.3Content3
Intelligence 2.9/10
Technical 1.8/10
Content 3/10
Value 8.3/10

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

12.7 Intelligence Index
7.4 Coding Index
10.7 Agentic Index
35.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 47.4% Graduate-level scientific reasoning
HLE 3.7% Humanity's Last Exam
MMLU Pro 72.4% Multi-task language understanding
MATH 500 92.6% Mathematical problem-solving
AIME 50% Competition mathematics
AIME 2025 35.3% Competition mathematics (2025)
SciCode 17.7% Scientific computing

Technical

LiveCodeBench 40% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 19% Conversational agent benchmark

Content

IFBench 31.5% Instruction following
LCR 31.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemini 2.5 Flash Lite stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID google/gemini-2.5-flash-lite
Providergoogle
Model FamilyGemini 2
Release Date July 22, 2025
Context Length1,048,576 tokens
Max Completion65,535 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.40 $0.000400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.7%
Avg Uptime
508ms
Best Latency (TTFT)
109 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Google, Google AI Studio

Leaderboard Categories