OpenAI: o3

OpenAI: o3

openai · Released Apr 16, 2025 Specialist
Intelligence #52 / 583
68.0 Our Score
Speed #74 / 278
146.9 tokens / sec
Input #490 / 586
$2.00 per 1M tokens
Output #491 / 586
$8.00 per 1M tokens
Context #206 / 586
200,000 tokens

Analysis Summary

OpenAI o3 is a reasoning-focused model from OpenAI with strong performance across coding, mathematics, and multi-step agentic tasks. It supports vision, tool use, and function calling, and its agentic index of 58.9 places it well above most mid-tier models. Instruction following and long-context reasoning are both competitive.

For businesses, o3 is a strong fit for software engineering workflows, complex document analysis, and autonomous agent pipelines where reasoning depth is critical. Its 200K context window is adequate for most professional tasks, though it falls short of the 1M+ windows available on some competitors. The main trade-off is cost: at $2/1M input and $8/1M output, it is expensive for high-volume use.

Teams that need reliable, deep reasoning on hard tasks and can absorb the cost will find o3 a capable workhorse. For budget-conscious deployments or tasks requiring very large context, newer or cheaper alternatives may offer better overall value.

Assessed June 17, 2026

Editorial notes

OpenAI o3 delivers strong reasoning, excellent coding performance, and reliable agentic capability with vision and tool use, though its 200K context and premium pricing limit it relative to newer frontier models.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence5.1Technical6.9Value6.5Content7.5
Intelligence 5.1/10
Technical 6.9/10
Content 7.5/10
Value 6.5/10

How OpenAI: o3 compares

OpenAI: o3 ranks #80 of 382 AI models we track for overall intelligence, #67 of 293 for agentic tasks. Its 200K-token context window is larger than 65% of the models we list. At $2.00 per million input tokens it is cheaper than 16% of comparable models.

About OpenAI: o3

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

30.4 Intelligence Index
58.9 Agentic Index
88.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 82.7% Graduate-level scientific reasoning
HLE 20% Humanity's Last Exam
MMLU Pro 85.3% Multi-task language understanding
MATH 500 99.2% Mathematical problem-solving
AIME 90.3% Competition mathematics
AIME 2025 88.3% Competition mathematics (2025)
SciCode 41% Scientific computing

Technical

LiveCodeBench 80.8% Live coding evaluation
TerminalBench Hard 37.1% Agentic terminal tasks
τ²-Bench 80.7% Conversational agent benchmark

Content

IFBench 71.4% Instruction following
LCR 69.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does OpenAI: o3 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID openai/o3
Provideropenai
Model Familyo3
Release Date April 16, 2025
Context Length200,000 tokens
Max Completion100,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $8.00 $0.008000

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
3,907ms
Best Latency (TTFT)
90.5 tok/s
Best Throughput
1/1
Active Endpoints
Available via: OpenAI

Leaderboard Categories

Frequently asked questions about OpenAI: o3

How much does OpenAI: o3 cost?

OpenAI: o3 costs $2.00 per million input tokens and $8.00 per million output tokens.

What is the context window of OpenAI: o3?

OpenAI: o3 has a context window of 200,000 tokens (200K).

What can OpenAI: o3 do?

OpenAI: o3 supports image/vision input, tool use, and function calling.

Who created OpenAI: o3?

OpenAI: o3 is developed by OpenAI and was released on April 16, 2025.