OpenAI: gpt-oss-120b

OpenAI: gpt-oss-120b

openai · Released Aug 5, 2025 Specialist
Intelligence #125 / 556
53.9 Our Score
Speed #12 / 257
282.1 tokens / sec
Input #140 / 557
$0.039 per 1M tokens
Output #165 / 557
$0.180 per 1M tokens
Context #220 / 557
131,072 tokens

Analysis Summary

OpenAI: gpt-oss-120b sits in the Specialist tier on our leaderboard, ranked #125 of 556 published models on overall intelligence. At $0.039 input and $0.180 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

OpenAI gpt-oss-120b is a compact open-weights model with strong math performance and tool use support, but limited coding and agentic capability keep it suited to lighter general tasks at its very competitive $0.04/1M input price.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.6Technical3.5Value8Content5
Intelligence 4.6/10
Technical 3.5/10
Content 5/10
Value 8/10

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

24.5 Intelligence Index
15.5 Coding Index
25.2 Agentic Index
66.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 67.2% Graduate-level scientific reasoning
HLE 5.2% Humanity's Last Exam
MMLU Pro 77.5% Multi-task language understanding
AIME 2025 66.7% Competition mathematics (2025)
SciCode 36% Scientific computing

Technical

LiveCodeBench 70.7% Live coding evaluation
TerminalBench Hard 5.3% Agentic terminal tasks
τ²-Bench 45% Conversational agent benchmark

Content

IFBench 58.3% Instruction following
LCR 43.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does OpenAI: gpt-oss-120b stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID openai/gpt-oss-120b
Provideropenai
Release Date August 5, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000039
Output $0.18 $0.000180

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.1%
Avg Uptime
179ms
Best Latency (TTFT)
886 tok/s
Best Throughput
17/18
Active Endpoints
Available via: DeepInfra, Novita, SiliconFlow, BaseTen, Google, Phala, Parasail, SambaNova +7 more

Leaderboard Categories