OpenAI: gpt-oss-120b

OpenAI: gpt-oss-120b

openai · Released Aug 5, 2025 Specialist
Intelligence #83 / 583
61.3 Our Score
Speed #13 / 278
289.4 tokens / sec
Input #139 / 586
$0.030 per 1M tokens
Output #162 / 586
$0.150 per 1M tokens
Context #242 / 586
131,072 tokens

Analysis Summary

OpenAI's gpt-oss-120b is a 120-billion-parameter open-weight-style model priced at just $0.039 input and $0.18 output per million tokens. Its coding index of 28.6 and livecodebench score of 0.878 are strong, and its agentic index of 44.6 makes it one of the more capable models for multi-step tool use at this price point. Tool use and function calling are supported, and instruction following at 0.690 is above average.

For business use, gpt-oss-120b is a strong fit for coding assistance, automated pipelines, and agentic workflows where cost efficiency is a priority. Its intelligence index of 23.8 limits deep analytical reasoning, but for structured tasks, code generation, and tool-augmented automation, it punches well above its price. The 131K context window is adequate for most document-level tasks.

At under $0.04 input, this is one of the best value coding and agentic models available. Teams running high-volume coding pipelines or cost-sensitive agent workflows will find it a compelling option within the OpenAI ecosystem.

Assessed June 17, 2026

Editorial notes

OpenAI's gpt-oss-120b delivers strong coding and agentic performance at ultra-low cost, with a livecodebench score near the top of the field and solid instruction following for its price tier.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.3Technical5.4Value8Content6.5
Intelligence 4.3/10
Technical 5.4/10
Content 6.5/10
Value 8/10

How OpenAI: gpt-oss-120b compares

OpenAI: gpt-oss-120b ranks #115 of 382 AI models we track for overall intelligence, #57 of 111 for coding, #113 of 293 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.03 per million input tokens it is cheaper than 76% of comparable models.

About OpenAI: gpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

23.8 Intelligence Index
30.4 Coding Index
44.6 Agentic Index
93.4 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 78.2% Graduate-level scientific reasoning
HLE 18.5% Humanity's Last Exam
MMLU Pro 80.8% Multi-task language understanding
AIME 2025 93.4% Competition mathematics (2025)
SciCode 38.9% Scientific computing

Technical

LiveCodeBench 87.8% Live coding evaluation
TerminalBench Hard 23.5% Agentic terminal tasks
τ²-Bench 65.8% Conversational agent benchmark

Content

IFBench 69% Instruction following
LCR 50.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does OpenAI: gpt-oss-120b stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID openai/gpt-oss-120b
Provideropenai
Release Date August 5, 2025
Context Length131,072 tokens
Max Completion131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.03 $0.000030
Output $0.15 $0.000150

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

94.5%
Avg Uptime
163ms
Best Latency (TTFT)
703.5 tok/s
Best Throughput
19/20
Active Endpoints
Available via: Mara, DekaLLM, DeepInfra, WandB, Novita, SiliconFlow, DigitalOcean, Google +10 more

Leaderboard Categories

Frequently asked questions about OpenAI: gpt-oss-120b

How much does OpenAI: gpt-oss-120b cost?

OpenAI: gpt-oss-120b costs $0.03 per million input tokens and $0.15 per million output tokens.

What is the context window of OpenAI: gpt-oss-120b?

OpenAI: gpt-oss-120b has a context window of 131,072 tokens (131K).

Is OpenAI: gpt-oss-120b good for coding?

On our coding benchmark index, OpenAI: gpt-oss-120b ranks #57 of 111 models, placing it in the broader range of the field for code generation and debugging.

What can OpenAI: gpt-oss-120b do?

OpenAI: gpt-oss-120b supports tool use and function calling.

Who created OpenAI: gpt-oss-120b?

OpenAI: gpt-oss-120b is developed by OpenAI and was released on August 5, 2025.