OpenAI: gpt-oss-120b (free)

OpenAI: gpt-oss-120b (free)

openai · Released Aug 5, 2025 Specialist
Intelligence #134 / 525
50.0 Our Score
Speed #8 / 244
317.7 tokens / sec
Input
Not priced
Output
Not priced
Context #185 / 525
131,072 tokens

Analysis Summary

OpenAI: gpt-oss-120b (free) sits in the Specialist tier on our leaderboard, ranked #134 of 525 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use, function calling, and reasoning.

Editorial notes

OpenAI's gpt-oss-120b (free) offers impressive coding and math benchmarks at no cost, with strong reasoning scores that make it one of the best free-tier options currently available from a major provider.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence6.2Technical6.2Value0Content0
Intelligence 6.2/10
Technical 6.2/10
Content 0/10
Value 0/10

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized..

120B Parameters

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

33.3 Intelligence Index
28.6 Coding Index
93.4 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 78.2% Graduate-level scientific reasoning
HLE 18.5% Humanity's Last Exam
MMLU Pro 80.8% Multi-task language understanding
SciCode 38.9% Scientific computing

Technical

LiveCodeBench 87.8% Live coding evaluation

Benchmark data from Artificial Analysis and Hugging Face

How does OpenAI: gpt-oss-120b (free) stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID openai/gpt-oss-120b:free
Provideropenai
Release Date August 5, 2025
Context Length131,072 tokens
Max Completion131,072 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
780ms
Best Latency (TTFT)
22 tok/s
Best Throughput
1/1
Active Endpoints
Available via: OpenInference

Leaderboard Categories