Home > AI Models > StepFun: Step 3.7 Flash

StepFun: Step 3.7 Flash

Name: StepFun: Step 3.7 Flash Review
Item: StepFun: Step 3.7 Flash
Author: Design for Online Editorial

NEWKimi K3in at #9 NEWKAT-Coder-Air V2.5in at #560 NEWKAT-Coder-Pro V2.5in at #568 NEWMuse Spark 1.1in at #392 NEWUncensoredin at #487 NEWGPT-5.6 Terrain at #11 NEWGPT-5.6 Sol Proin at #416 NEWGPT-5.6 Solin at #2

StepFun: Step 3.7 Flash

stepfun · Released May 28, 2026

Intelligence #73 / 612

65.3 our score

Speed #4 / 287

387.2 tok/s

Input Price #275 / 612

$0.200 per 1M tokens

Output Price #345 / 612

$1.15 per 1M tokens

Context #194 / 612

256,000 tokens

Step 3.7 Flash stands out for agentic task completion, scoring well above its coding and general reasoning levels, and comes with tool use, function calling, and vision support across a 256K context window. Its instruction-following and long-context handling are both competent.

This makes it a fit for automation-heavy workflows and multi-step task execution where reliability matters more than raw reasoning depth. It is less suited to complex analytical or creative writing tasks where deeper reasoning is required.

At $0.2 input and $1.15 output per million tokens, it's a cost-efficient pick for teams building agent pipelines rather than premium client-facing content.

Assessed July 19, 2026

Editorial notes

Step 3.7 Flash from StepFun pairs strong agentic reliability with solid coding and tool use at low pricing, though general reasoning trails higher-tier models.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

DFO Verdict

Step 3.7 Flash from StepFun pairs strong agentic reliability with solid coding and tool use at low pricing, though general reasoning trails higher-tier models.

#73 of 612 overall

Benchmark scores

GPQA Diamond 80.9%

HLE 19.9%

SciCode 40%

TerminalBench Hard 35.6%

τ²-Bench 98.5%

IFBench 67.3%

LCR 63.7%

Magenta = intelligence · Ink = technical/agentic · Cyan = content & long-context · Grey = community benchmarks. Data: Artificial Analysis, Hugging Face.

30.3 Intelligence Index·39.6 Coding Index·67.1 Agentic Index

How StepFun: Step 3.7 Flash compares

StepFun: Step 3.7 Flash ranks #91 of 393 AI models we track for overall intelligence, #58 of 157 for coding, #26 of 300 for agentic tasks. Its 256K-token context window is larger than 68% of the models we list. At $0.20 per million input tokens it is cheaper than 55% of comparable models.

Position in the field

Intelligence: smarter than 88% of models #73

Speed: faster than 99% of models #4

Price: cheaper than 55% of models #275

Context: larger than 68% of models #194

worst in fieldmedianbest in field

Price vs frontier peers · $ per 1M tokens

StepFun: Step 3.7 Flash $0.20 in $1.15 out

Anthropic: Claude Fable 5 $10.00 in $50.00 out

Anthropic: Claude Opus 4.8 $5.00 in $25.00 out

Google: Gemini 3.1 Pro Preview $2.00 in $12.00 out

Dark bar = input · light bar = output, scaled to the priciest peer.

Context window vs peers · tokens

Google: Gemini 3.1 Pro Preview 1M

Anthropic: Claude Fable 5 1M

Anthropic: Claude Opus 4.8 1M

StepFun: Step 3.7 Flash 256K

1M tokens ≈ 8 full-length novels or ~2,500 pages of business documents in a single request.

Performance profile

Strongest on value. The pulled-in intelligence corner is the trade-off, and if the shape matters more than the price, this is your model.

Compare shapes side-by-side →

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.20	$0.000200
Output	$1.15	$0.001150

What would StepFun: Step 3.7 Flash cost your business?

Pick the job that looks most like yours, then fine-tune with the sliders. Estimates update live.

A website chatbot handling around 100 customer conversations a day, a few short messages each.

Requests per month 3,000

One request is one message, email, draft or automation call.

Size of each request 1,200 tokens

$0/mo StepFun: Step 3.7 Flash

$0/mo Anthropic: Claude Fable 5

$0/mo MoonshotAI: Kimi K3 · best value

Full calculator with 612 models → Price Calculator

DFO AI AUTOMATION

These numbers get smaller with the right architecture.

We route routine calls to cheap models and save StepFun: Step 3.7 Flash for the hard ones. Most clients cut their estimate by 60-80%.

Talk to our team

About StepFun: Step 3.7 Flash

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters..

Frequently asked questions about StepFun: Step 3.7 Flash

How much does StepFun: Step 3.7 Flash cost?

StepFun: Step 3.7 Flash costs $0.20 per million input tokens and $1.15 per million output tokens.

What is the context window of StepFun: Step 3.7 Flash?

StepFun: Step 3.7 Flash has a context window of 256,000 tokens (256K).

Is StepFun: Step 3.7 Flash good for coding?

On our coding benchmark index, StepFun: Step 3.7 Flash ranks #58 of 157 models, placing it in the broader range of the field for code generation and debugging.

What can StepFun: Step 3.7 Flash do?

StepFun: Step 3.7 Flash supports image/vision input, tool use, and function calling.

Who created StepFun: Step 3.7 Flash?

StepFun: Step 3.7 Flash is developed by StepFun and was released on May 28, 2026.

Performance profile

Intelligence 4.8

Technical 6

Content 7

Value 8

Reasoning: Yes
Input
Output
Context: 256,000 tokens
Max output: 256,000 tokens
Tokenizer: Other
Released: May 28, 2026

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Model information

Provider stepfun

OpenRouter ID stepfun/step-3.7-flash

Status Active

Capabilities

Tool Use Function Calling Vision

Ranked in

AI Agents Coding Tool Use

Live performance · 30 min refresh

97.5% Avg uptime

4,533ms Best latency

52 tok/s Best throughput

1/3 Active endpoints

External resources View on OpenRouter API access, playground & provider details API Quickstart Sample code and integration guide

Data sourced from the OpenRouter API, Artificial Analysis, the Hugging Face Open LLM Leaderboard and our own internal testing. Scores are editorially curated by our team.

Last updated: July 19, 2026 10:00 am

Issues with our rankings? Contact us

StepFun: Step 3.7 Flash

DFO Verdict

Benchmark scores

How StepFun: Step 3.7 Flash compares

Pricing

What would StepFun: Step 3.7 Flash cost your business?

About StepFun: Step 3.7 Flash

Explore Related Models

Frequently asked questions about StepFun: Step 3.7 Flash

How much does StepFun: Step 3.7 Flash cost?

What is the context window of StepFun: Step 3.7 Flash?

Is StepFun: Step 3.7 Flash good for coding?

What can StepFun: Step 3.7 Flash do?

Who created StepFun: Step 3.7 Flash?