Google: Gemini 3.5 Flash

Google: Gemini 3.5 Flash

google · Released May 19, 2026 Professional New
Intelligence #20 / 561
82.0 Our Score
Speed #29 / 260
211.7 tokens / sec
Input #458 / 561
$1.50 per 1M tokens
Output #479 / 561
$9.00 per 1M tokens
Context #17 / 561
1M tokens

Analysis Summary

Google: Gemini 3.5 Flash sits in the Professional tier on our leaderboard, ranked #20 of 561 published models on overall intelligence. At $1.50 input and $9.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, vision, and reasoning.

Editorial notes

Gemini 3.5 Flash from Google pairs strong coding capability with full multimodal support across text, image, audio, and video, a 1M token context, and aggressive $1.5/$9 pricing for broad business use.

Assessed May 24, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence9.7Technical8.3Value6.8Content8.3
Intelligence 9.7/10
Technical 8.3/10
Content 8.3/10
Value 6.8/10

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

55.3 Intelligence Index
45 Coding Index
68.1 Agentic Index

This model was released recently. Independent benchmark evaluations are typically completed within days of release — these figures are preliminary and are likely to be updated as testing is finalised.

Benchmark Scores

Intelligence

GPQA Diamond 92.2% Graduate-level scientific reasoning
HLE 41% Humanity's Last Exam
SciCode 53.1% Scientific computing

Technical

TerminalBench Hard 40.9% Agentic terminal tasks
τ²-Bench 95.3% Conversational agent benchmark

Content

IFBench 76.3% Instruction following
LCR 69.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Google: Gemini 3.5 Flash stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID google/gemini-3.5-flash
Providergoogle
Model FamilyGemini 3
Release Date May 19, 2026
Context Length1,048,576 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $1.50 $0.001500
Output $9.00 $0.009000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.4%
Avg Uptime
2,317ms
Best Latency (TTFT)
137.5 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Google AI Studio, Google

Leaderboard Categories