StepFun: Step 3.5 Flash

StepFun: Step 3.5 Flash

stepfun · Released Jan 29, 2026 Professional
Intelligence #61 / 557
70.5 Our Score
Speed #35 / 257
186.8 tokens / sec
Input #189 / 557
$0.100 per 1M tokens
Output #200 / 557
$0.300 per 1M tokens
Context #98 / 557
262,144 tokens

Analysis Summary

StepFun: Step 3.5 Flash sits in the Professional tier on our leaderboard, ranked #61 of 557 published models on overall intelligence. At $0.100 input and $0.300 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

Step 3.5 Flash delivers strong reasoning for its price point, with a 262K context window, tool use, and competitive agentic performance at just $0.10 input per million tokens.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6.8Technical6.8Value8Content6.5
Intelligence 6.8/10
Technical 6.8/10
Content 6.5/10
Value 8/10

Step 3.5 Flash is StepFun's most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

38.5 Intelligence Index
34.6 Coding Index
60 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 82.6% Graduate-level scientific reasoning
HLE 22.6% Humanity's Last Exam
SciCode 38.5% Scientific computing

Technical

TerminalBench Hard 32.6% Agentic terminal tasks
τ²-Bench 87.4% Conversational agent benchmark

Content

IFBench 66.5% Instruction following
LCR 54.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does StepFun: Step 3.5 Flash stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID stepfun/step-3.5-flash
Providerstepfun
Release Date January 29, 2026
Context Length262,144 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.30 $0.000300

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
282ms
Best Latency (TTFT)
98 tok/s
Best Throughput
3/3
Active Endpoints
Available via: SiliconFlow, DeepInfra, StepFun

Leaderboard Categories