OpenAI: GPT-4.1

OpenAI: GPT-4.1

openai · Released Apr 14, 2025 Specialist
Intelligence #92 / 525
59.1 Our Score
Speed #93 / 244
126.9 tokens / sec
Input #439 / 525
$2.00 per 1M tokens
Output #444 / 525
$8.00 per 1M tokens
Context #34 / 525
1M tokens

Analysis Summary

OpenAI: GPT-4.1 sits in the Specialist tier on our leaderboard, ranked #92 of 525 published models on overall intelligence. At $2.00 input and $8.00 output per 1M tokens, it is among the most expensive on the market. It offers an exceptionally large context window suited to long-document workflows and supports tool use, function calling, and vision.

Editorial notes

OpenAI's GPT-4.1 is a well-rounded flagship model with strong coding performance, a near-1M token context window, vision and file input, and reliable tool use — a solid all-round choice for business teams already in the OpenAI ecosystem.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.9Technical4.1Value6.8Content6.5
Intelligence 4.9/10
Technical 4.1/10
Content 6.5/10
Value 6.8/10

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and..

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

26.3 Intelligence Index
21.8 Coding Index
30.3 Agentic Index
34.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 66.6% Graduate-level scientific reasoning
HLE 4.6% Humanity's Last Exam
MMLU Pro 80.6% Multi-task language understanding
MATH 500 91.3% Mathematical problem-solving
AIME 43.7% Competition mathematics
AIME 2025 34.7% Competition mathematics (2025)
SciCode 38.1% Scientific computing

Technical

LiveCodeBench 45.7% Live coding evaluation
TerminalBench Hard 13.6% Agentic terminal tasks
τ²-Bench 47.1% Conversational agent benchmark

Content

IFBench 43% Instruction following
LCR 61% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does OpenAI: GPT-4.1 stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID openai/gpt-4.1
Provideropenai
Model FamilyGPT-4
Release Date April 14, 2025
Context Length1,047,576 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $2.00 $0.002000
Output $8.00 $0.008000

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
719ms
Best Latency (TTFT)
43 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Azure, OpenAI

Leaderboard Categories