Home > AI Models > OpenAI: gpt-oss-120b

OpenAI: gpt-oss-120b

Name: OpenAI: gpt-oss-120b Review
Item: OpenAI: gpt-oss-120b
Author: Design for Online Editorial

OpenAI: gpt-oss-120b

openai · Released Aug 5, 2025 Specialist

Intelligence #83 / 583

61.3 Our Score

Speed #13 / 278

289.4 tokens / sec

Input #139 / 586

$0.030 per 1M tokens

Output #162 / 586

$0.150 per 1M tokens

Context #242 / 586

131,072 tokens

OpenAI's gpt-oss-120b is a 120-billion-parameter open-weight-style model priced at just $0.039 input and $0.18 output per million tokens. Its coding index of 28.6 and livecodebench score of 0.878 are strong, and its agentic index of 44.6 makes it one of the more capable models for multi-step tool use at this price point. Tool use and function calling are supported, and instruction following at 0.690 is above average.

For business use, gpt-oss-120b is a strong fit for coding assistance, automated pipelines, and agentic workflows where cost efficiency is a priority. Its intelligence index of 23.8 limits deep analytical reasoning, but for structured tasks, code generation, and tool-augmented automation, it punches well above its price. The 131K context window is adequate for most document-level tasks.

At under $0.04 input, this is one of the best value coding and agentic models available. Teams running high-volume coding pipelines or cost-sensitive agent workflows will find it a compelling option within the OpenAI ecosystem.

Assessed June 17, 2026

Editorial notes

OpenAI's gpt-oss-120b delivers strong coding and agentic performance at ultra-low cost, with a livecodebench score near the top of the field and solid instruction following for its price tier.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 131,072 tokens
Max output: 131,072 tokens
Tokenizer: GPT
Released: Aug 5, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How OpenAI: gpt-oss-120b compares

OpenAI: gpt-oss-120b ranks #115 of 382 AI models we track for overall intelligence, #57 of 111 for coding, #113 of 293 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.03 per million input tokens it is cheaper than 76% of comparable models.

About OpenAI: gpt-oss-120b

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

23.8 Intelligence Index

30.4 Coding Index

44.6 Agentic Index

93.4 Math Index

Benchmark Scores

GPQA Diamond 78.2% Graduate-level scientific reasoning

HLE 18.5% Humanity's Last Exam

MMLU Pro 80.8% Multi-task language understanding

AIME 2025 93.4% Competition mathematics (2025)

SciCode 38.9% Scientific computing

LiveCodeBench 87.8% Live coding evaluation

TerminalBench Hard 23.5% Agentic terminal tasks

τ²-Bench 65.8% Conversational agent benchmark

IFBench 69% Instruction following

LCR 50.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does OpenAI: gpt-oss-120b stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID	`openai/gpt-oss-120b`
Provider	openai
Release Date	August 5, 2025
Context Length	131,072 tokens
Max Completion	131,072 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.03	$0.000030
Output	$0.15	$0.000150

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

94.5%

Avg Uptime

163ms

Best Latency (TTFT)

703.5 tok/s

Best Throughput

19/20

Active Endpoints

Available via: Mara, DekaLLM, DeepInfra, WandB, Novita, SiliconFlow, DigitalOcean, Google +10 more

Leaderboard Categories

Coding SEO Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about OpenAI: gpt-oss-120b

How much does OpenAI: gpt-oss-120b cost?

OpenAI: gpt-oss-120b costs $0.03 per million input tokens and $0.15 per million output tokens.

What is the context window of OpenAI: gpt-oss-120b?

OpenAI: gpt-oss-120b has a context window of 131,072 tokens (131K).

Is OpenAI: gpt-oss-120b good for coding?

On our coding benchmark index, OpenAI: gpt-oss-120b ranks #57 of 111 models, placing it in the broader range of the field for code generation and debugging.

What can OpenAI: gpt-oss-120b do?

OpenAI: gpt-oss-120b supports tool use and function calling.

Who created OpenAI: gpt-oss-120b?

OpenAI: gpt-oss-120b is developed by OpenAI and was released on August 5, 2025.

OpenAI: gpt-oss-120b

OpenAI: gpt-oss-120b

Analysis Summary

Performance Profile

How OpenAI: gpt-oss-120b compares

About OpenAI: gpt-oss-120b

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Frequently asked questions about OpenAI: gpt-oss-120b

How much does OpenAI: gpt-oss-120b cost?

What is the context window of OpenAI: gpt-oss-120b?

Is OpenAI: gpt-oss-120b good for coding?

What can OpenAI: gpt-oss-120b do?

Who created OpenAI: gpt-oss-120b?

OpenAI: gpt-oss-120b

Performance Profile

How OpenAI: gpt-oss-120b compares

About OpenAI: gpt-oss-120b

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models

Frequently asked questions about OpenAI: gpt-oss-120b

How much does OpenAI: gpt-oss-120b cost?

What is the context window of OpenAI: gpt-oss-120b?

Is OpenAI: gpt-oss-120b good for coding?

What can OpenAI: gpt-oss-120b do?

Who created OpenAI: gpt-oss-120b?