Google: Gemini 2.5 Flash Lite
Analysis Summary
Gemini 2.5 Flash Lite is Google's ultra-efficient multimodal model, supporting text, image, file, audio, and video inputs with a 1M token context window. At $0.10 input and $0.40 output, it is one of the most cost-effective multimodal options available. Intelligence index at 6.9 and agentic index at 10.6 place it in the capable lightweight tier, with livecodebench at 0.400 and MMLU Pro at 0.724 showing solid general knowledge.
For businesses, the combination of multimodal input, tool use, function calling, and a massive context window makes it a strong fit for high-volume content processing, document summarisation, SEO content pipelines, and media analysis tasks. It is not suited to complex reasoning chains or frontier-level coding work.
The pricing and context window make it an excellent workhorse for cost-sensitive, high-throughput workflows. Teams running large-scale content operations or needing to process diverse media types will find it a practical and affordable choice.
Assessed June 30, 2026
Editorial notes
Gemini 2.5 Flash Lite from Google offers multimodal input including audio and video, tool use, a 1M token context window, and very low pricing, with moderate reasoning capability.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Google: Gemini 2.5 Flash Lite compares
Google: Gemini 2.5 Flash Lite ranks #285 of 385 AI models we track for overall intelligence, #257 of 293 for agentic tasks. Its 1M-token context window is larger than 97% of the models we list. At $0.10 per million input tokens it is cheaper than 67% of comparable models.
About Google: Gemini 2.5 Flash Lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance..
Capabilities
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does Google: Gemini 2.5 Flash Lite stack up?
Compare side-by-side with other professional models.
Model Information
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.10 | $0.000100 |
| Output | $0.40 | $0.000400 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
Leaderboard Categories
External Resources
Explore Related Models
Frequently asked questions about Google: Gemini 2.5 Flash Lite
How much does Google: Gemini 2.5 Flash Lite cost?
Google: Gemini 2.5 Flash Lite costs $0.10 per million input tokens and $0.40 per million output tokens.
What is the context window of Google: Gemini 2.5 Flash Lite?
Google: Gemini 2.5 Flash Lite has a context window of 1,048,576 tokens (1M).
What can Google: Gemini 2.5 Flash Lite do?
Google: Gemini 2.5 Flash Lite supports image/vision input, tool use, and function calling.
Who created Google: Gemini 2.5 Flash Lite?
Google: Gemini 2.5 Flash Lite is developed by Google and was released on July 22, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 30, 2026 9:37 pm