Google: Gemini 2.5 Flash Lite

Google: Gemini 2.5 Flash Lite

google · Released Jul 22, 2025 Specialist
Intelligence #139 / 557
50.7 Our Score
Speed #25 / 259
202.5 tokens / sec
Input #192 / 560
$0.100 per 1M tokens
Output #219 / 560
$0.400 per 1M tokens
Context #17 / 560
1M tokens

Analysis Summary

Google: Gemini 2.5 Flash Lite sits in the Specialist tier on our leaderboard, ranked #139 of 557 published models on overall intelligence. At $0.100 input and $0.400 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Gemini 2.5 Flash Lite from Google offers an exceptional 1M token context, multimodal input including audio and video, and very low pricing, but benchmark scores across reasoning and coding are limited.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.7Technical2.3Value8.3Content4.5
Intelligence 3.7/10
Technical 2.3/10
Content 4.5/10
Value 8.3/10

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

12.7 Intelligence Index
7.4 Coding Index
10.6 Agentic Index
35.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 47.4% Graduate-level scientific reasoning
HLE 3.7% Humanity's Last Exam
MMLU Pro 72.4% Multi-task language understanding
MATH 500 92.6% Mathematical problem-solving
AIME 50% Competition mathematics
AIME 2025 35.3% Competition mathematics (2025)
SciCode 17.7% Scientific computing

Technical

LiveCodeBench 40% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 19% Conversational agent benchmark

Content

IFBench 31.5% Instruction following
LCR 31.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemini 2.5 Flash Lite stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID google/gemini-2.5-flash-lite
Providergoogle
Model FamilyGemini 2
Release Date July 22, 2025
Context Length1,048,576 tokens
Max Completion65,535 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.40 $0.000400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.9%
Avg Uptime
390ms
Best Latency (TTFT)
164 tok/s
Best Throughput
3/3
Active Endpoints
Available via: Google, Google AI Studio

Leaderboard Categories

SEO