OpenAI: GPT-4.1

OpenAI: GPT-4.1

openai · Released Apr 14, 2025
55
Our Score

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.

$2.00 / 1M Input Price
$8.00 / 1M Output Price
1M tokens Context Window
32,768 tokens Max Output

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image + File → Text
TokenizerGPT

Performance Indices

Source: Artificial Analysis

26.3 Intelligence Index
21.8 Coding Index
30.3 Agentic Index
34.7 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 66.6%
Graduate-level scientific reasoning
HLE 4.6%
Humanity's Last Exam
MMLU Pro 80.6%
Multi-task language understanding
LiveCodeBench 45.7%
Live coding evaluation
SciCode 38.1%
Scientific computing
MATH 500 91.3%
Mathematical problem-solving
AIME 43.7%
Competition mathematics
AIME 2025 34.7%
Competition mathematics (2025)
IFBench 43%
Instruction following
LCR 61%
Long-context reasoning
TerminalBench Hard 13.6%
Agentic terminal tasks
τ²-Bench 47.1%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID openai/gpt-4.1
Provideropenai
Model FamilyGPT-4
Release Date April 14, 2025
Context Length1,047,576 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $8.00 $0.008000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
628ms
Best Latency (TTFT)
70 tok/s
Best Throughput
2/2
Active Endpoints
Available via: OpenAI, Azure

Leaderboard Categories