AllenAI: Olmo 3.1 32B Think

AllenAI: Olmo 3.1 32B Think

allenai · Released Dec 16, 2025
35
Our Score

Olmo 3.1 32B Think is a large-scale, 32-billion-parameter model designed for deep reasoning, complex multi-step logic, and advanced instruction following. Building on the Olmo 3 series, version 3.1 delivers refined reasoning behavior and stronger performance across demanding evaluations and nuanced conversational tasks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Think continues the Olmo initiative’s commitment to openness, providing full transparency across model weights, code, and training methodology.

$0.15 / 1M Input Price
$0.50 / 1M Output Price
65,536 tokens Context Window
65,536 tokens Max Output
32B Parameters

Architecture

ModalityText → Text
TokenizerOther
Parameters32B

Performance Indices

Source: Artificial Analysis

13.9 Intelligence Index
9.8 Coding Index
77.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 59.1%
Graduate-level scientific reasoning
HLE 6%
Humanity's Last Exam
MMLU Pro 76.3%
Multi-task language understanding
LiveCodeBench 69.5%
Live coding evaluation
SciCode 29.3%
Scientific computing
AIME 2025 77.3%
Competition mathematics (2025)
IFBench 66%
Instruction following

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID allenai/olmo-3.1-32b-think
Providerallenai
Release Date December 16, 2025
Context Length65,536 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.15 $0.000150
Output $0.50 $0.000500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

364ms
Best Latency (TTFT)
99.5 tok/s
Best Throughput
0/1
Active Endpoints
Available via: Parasail