OpenAI: gpt-oss-120b

OpenAI: gpt-oss-120b

openai · Released Aug 5, 2025
67
Our Score

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.

$0.04 / 1M Input Price
$0.19 / 1M Output Price
131,072 tokens Context Window
120B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerGPT
Parameters120B

Performance Indices

Source: Artificial Analysis

33.3 Intelligence Index
28.6 Coding Index
44.7 Agentic Index
93.4 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 78.2%
Graduate-level scientific reasoning
HLE 18.5%
Humanity's Last Exam
MMLU Pro 80.8%
Multi-task language understanding
LiveCodeBench 87.8%
Live coding evaluation
SciCode 38.9%
Scientific computing
AIME 2025 93.4%
Competition mathematics (2025)
IFBench 69%
Instruction following
LCR 50.7%
Long-context reasoning
TerminalBench Hard 23.5%
Agentic terminal tasks
τ²-Bench 65.8%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID openai/gpt-oss-120b
Provideropenai
Release Date August 5, 2025
Context Length131,072 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.04 $0.000039
Output $0.19 $0.000190

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.7%
Avg Uptime
120ms
Best Latency (TTFT)
365 tok/s
Best Throughput
18/19
Active Endpoints
Available via: DeepInfra, Novita, Chutes, SiliconFlow, Clarifai, Google, AtlasCloud, Phala +10 more

Leaderboard Categories