Google: Gemini 2.5 Flash Lite
Analysis Summary
Google's Gemini 2.5 Flash Lite, released in July 2025, is the lightweight tier of the Gemini 2.5 Flash family, supporting text, image, file, audio, and video input with tool use and function calling. Its intelligence index of 12.7 and coding index of 7.4 place it in the limited-to-moderate tier, though its math index of 35.3 and AIME score of 0.353 show reasonable quantitative ability for the price. The 1M token context window is a standout feature at this cost level.
For businesses, Flash Lite is best positioned as a high-volume, cost-sensitive component in larger pipelines. Its full multimodal input support, including audio and video, is unusual at this price point and opens up lightweight media processing use cases. Instruction-following and long-context reasoning are moderate, so it suits structured, well-defined tasks rather than open-ended reasoning.
At $0.10 input and $0.40 output per million tokens, it is one of the most cost-efficient multimodal options available. Teams running high-volume pipelines that need vision or audio capability without paying for frontier reasoning will find it a practical and well-priced choice.
Assessed June 9, 2026
Editorial notes
Gemini 2.5 Flash Lite from Google offers full multimodal support, tool use, and a 1M token context at very low cost, with moderate reasoning capability.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Google: Gemini 2.5 Flash Lite compares
Google: Gemini 2.5 Flash Lite ranks #272 of 378 AI models we track for overall intelligence, #260 of 315 for coding, #253 of 289 for agentic tasks. Its 1M-token context window is larger than 97% of the models we list. At $0.10 per million input tokens it is cheaper than 66% of comparable models.
About Google: Gemini 2.5 Flash Lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance..
Capabilities
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Google: Gemini 2.5 Flash Lite stack up?
Compare side-by-side with other professional models.
Model Information
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.10 | $0.000100 |
| Output | $0.40 | $0.000400 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
Leaderboard Categories
External Resources
Explore Related Models
Frequently asked questions about Google: Gemini 2.5 Flash Lite
How much does Google: Gemini 2.5 Flash Lite cost?
Google: Gemini 2.5 Flash Lite costs $0.10 per million input tokens and $0.40 per million output tokens.
What is the context window of Google: Gemini 2.5 Flash Lite?
Google: Gemini 2.5 Flash Lite has a context window of 1,048,576 tokens (1M).
Is Google: Gemini 2.5 Flash Lite good for coding?
On our coding benchmark index, Google: Gemini 2.5 Flash Lite ranks #260 of 315 models, placing it in the broader range of the field for code generation and debugging.
What can Google: Gemini 2.5 Flash Lite do?
Google: Gemini 2.5 Flash Lite supports image/vision input, tool use, and function calling.
Who created Google: Gemini 2.5 Flash Lite?
Google: Gemini 2.5 Flash Lite is developed by Google and was released on July 22, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 11, 2026 8:38 pm