OpenAI: o3 Mini High

OpenAI: o3 Mini High

openai · Released Feb 12, 2025
62
Our Score

OpenAI o3-mini-high is the same model as o3-mini with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. The model features three adjustable reasoning effort levels and supports key developer capabilities including function calling, structured outputs, and streaming, though it does not include vision processing capabilities. The model demonstrates significant improvements over its predecessor, with expert testers preferring its responses 56% of the time and noting a 39% reduction in major errors on complex questions. With medium reasoning effort settings, o3-mini matches the performance of the larger o1 model on challenging reasoning evaluations like AIME and GPQA, while maintaining lower latency and cost.

$1.10 / 1M Input Price
$4.40 / 1M Output Price
200,000 tokens Context Window
100,000 tokens Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + File → Text
TokenizerGPT

Performance Indices

Source: Artificial Analysis

25.2 Intelligence Index
17.3 Coding Index
18.7 Agentic Index

Benchmark Scores

Evaluations

GPQA Diamond 77.3%
Graduate-level scientific reasoning
HLE 12.3%
Humanity's Last Exam
MMLU Pro 80.2%
Multi-task language understanding
LiveCodeBench 73.4%
Live coding evaluation
SciCode 39.8%
Scientific computing
MATH 500 98.5%
Mathematical problem-solving
AIME 86%
Competition mathematics
IFBench 67.1%
Instruction following
LCR 39.3%
Long-context reasoning
TerminalBench Hard 6.1%
Agentic terminal tasks
τ²-Bench 31.3%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID openai/o3-mini-high
Provideropenai
Model Familyo3
Release Date February 12, 2025
Context Length200,000 tokens
Max Completion100,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $1.10 $0.001100
Output $4.40 $0.004400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

7,655ms
Best Latency (TTFT)
240.5 tok/s
Best Throughput
0/1
Active Endpoints
Available via: OpenAI

Leaderboard Categories