OpenAI: gpt-oss-20b (free)

OpenAI: gpt-oss-20b (free)

openai · Released Aug 5, 2025
44
Our Score

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

131,072 tokens Context Window
131,072 tokens Max Output
20B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerGPT
Parameters20B

Performance Indices

Source: Artificial Analysis

24.5 Intelligence Index
18.5 Coding Index
89.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 68.8%
Graduate-level scientific reasoning
HLE 9.8%
Humanity's Last Exam
MMLU Pro 74.8%
Multi-task language understanding
LiveCodeBench 77.7%
Live coding evaluation
SciCode 34.4%
Scientific computing

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID openai/gpt-oss-20b:free
Provideropenai
Release Date August 5, 2025
Context Length131,072 tokens
Max Completion131,072 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

544ms
Best Latency (TTFT)
82 tok/s
Best Throughput
0/1
Active Endpoints
Available via: OpenInference