Google: Gemini 3.1 Flash Lite Preview

Google: Gemini 3.1 Flash Lite Preview

google · Released Mar 3, 2026 Specialist
Intelligence #66 / 525
67.6 Our Score
Speed #6 / 244
337.9 tokens / sec
Input #275 / 525
$0.250 per 1M tokens
Output #327 / 525
$1.50 per 1M tokens
Context #12 / 525
1M tokens

Analysis Summary

Google: Gemini 3.1 Flash Lite Preview sits in the Specialist tier on our leaderboard, ranked #66 of 525 published models on overall intelligence. At $0.250 input and $1.50 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Google's Gemini 3.1 Flash Lite Preview offers a massive 1M token context window with full multimodal support (image, audio, video, file) and strong instruction-following at moderate pricing — a versatile and cost-effective option from a trusted provider for content-heavy business workflows.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6Technical4.8Value7.8Content7
Intelligence 6/10
Technical 4.8/10
Content 7/10
Value 7.8/10

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

33.5 Intelligence Index
30.1 Coding Index
27.8 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 82.2% Graduate-level scientific reasoning
HLE 16.2% Humanity's Last Exam
SciCode 41.9% Scientific computing

Technical

TerminalBench Hard 24.2% Agentic terminal tasks
τ²-Bench 31.3% Conversational agent benchmark

Content

IFBench 77.2% Instruction following
LCR 65.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemini 3.1 Flash Lite Preview stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID google/gemini-3.1-flash-lite-preview
Providergoogle
Model FamilyGemini 3
Release Date March 3, 2026
Context Length1,048,576 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.25 $0.000250
Output $1.50 $0.001500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

96.4%
Avg Uptime
793ms
Best Latency (TTFT)
95 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Google AI Studio, Google

Leaderboard Categories