OpenAI: o4 Mini High

OpenAI: o4 Mini High

openai · Released Apr 16, 2025
38
Our Score

OpenAI o4-mini-high is the same model as o4-mini with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning and coding performance across benchmarks like AIME (99.5% with Python) and SWE-bench, outperforming its predecessor o3-mini and even approaching o3 in some domains. Despite its smaller size, o4-mini exhibits high accuracy in STEM tasks, visual problem solving (e.g., MathVista, MMMU), and code editing. It is especially well-suited for high-throughput scenarios where latency or cost is critical. Thanks to its efficient architecture and refined reinforcement learning training, o4-mini can chain tools, generate structured outputs, and solve multi-step tasks with minimal delay—often in under a minute.

$1.10 / 1M Input Price
$4.40 / 1M Output Price
200,000 tokens Context Window
100,000 tokens Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + File → Text
TokenizerGPT

Model Information

OpenRouter ID openai/o4-mini-high
Provideropenai
Release Date April 16, 2025
Context Length200,000 tokens
Max Completion100,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $1.10 $0.001100
Output $4.40 $0.004400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
6,404ms
Best Latency (TTFT)
125 tok/s
Best Throughput
1/1
Active Endpoints
Available via: OpenAI

Leaderboard Categories