Google: Gemini 3.1 Flash Lite

Google: Gemini 3.1 Flash Lite

google · Released May 7, 2026 Emerging
Intelligence #382 / 576
32.6 Our Score
Speed
— Not reported
Input #303 / 577
$0.250 per 1M tokens
Output #358 / 577
$1.50 per 1M tokens
Context #17 / 577
1M tokens

Analysis Summary

Google Gemini 3.1 Flash Lite is a lightweight multimodal model from Google supporting text, image, file, audio, and video inputs across a 1,048,576 token context window. Pricing at $0.25 input and $1.50 output per million tokens is competitive for the modality breadth on offer.

However, the complete absence of benchmark data means reasoning, coding, and instruction-following quality cannot be assessed. Without this evidence, it is not possible to position it accurately against the broader model landscape, even accounting for Google's strong track record with the Gemini family.

For businesses, the multimodal capability and large context make it worth monitoring as benchmarks emerge. Until then, teams requiring verified performance should use the benchmarked Gemini 3.5 Flash or Gemini 3.1 Pro variants instead.

Assessed June 6, 2026

Editorial notes

Google Gemini 3.1 Flash Lite supports a full multimodal stack including audio and video with a 1M token context at low cost, but has no published benchmark data.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value7.8Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 7.8/10

How Google: Gemini 3.1 Flash Lite compares

Its 1M-token context window is larger than 97% of the models we list. At $0.25 per million input tokens it is cheaper than 47% of comparable models.

About Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic..

Capabilities

Tool Use Function Calling Vision

How does Google: Gemini 3.1 Flash Lite stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

OpenRouter ID google/gemini-3.1-flash-lite
Providergoogle
Model FamilyGemini 3
Release Date May 7, 2026
Context Length1,048,576 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.25 $0.000250
Output $1.50 $0.001500

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.1%
Avg Uptime
530ms
Best Latency (TTFT)
87 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Google, Google AI Studio

Frequently asked questions about Google: Gemini 3.1 Flash Lite

How much does Google: Gemini 3.1 Flash Lite cost?

Google: Gemini 3.1 Flash Lite costs $0.25 per million input tokens and $1.50 per million output tokens.

What is the context window of Google: Gemini 3.1 Flash Lite?

Google: Gemini 3.1 Flash Lite has a context window of 1,048,576 tokens (1M).

What can Google: Gemini 3.1 Flash Lite do?

Google: Gemini 3.1 Flash Lite supports image/vision input, tool use, and function calling.

Who created Google: Gemini 3.1 Flash Lite?

Google: Gemini 3.1 Flash Lite is developed by Google and was released on May 7, 2026.