AllenAI: Olmo 3.1 32B Think

AllenAI: Olmo 3.1 32B Think

allenai · Released Dec 16, 2025 Efficient
Intelligence #193 / 544
42.1 Our Score
Speed #122 / 252
105.2 tokens / sec
Input #231 / 544
$0.150 per 1M tokens
Output #239 / 544
$0.500 per 1M tokens
Context #334 / 544
65,536 tokens

Analysis Summary

AllenAI: Olmo 3.1 32B Think sits in the Efficient tier on our leaderboard, ranked #193 of 544 published models on overall intelligence. At $0.150 input and $0.500 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window.

Editorial notes

Olmo 3.1 32B Think from AllenAI shows strong instruction following and math scores but a low intelligence index and no agentic data, limiting its use to narrow reasoning or academic tasks.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.2Technical2.8Value7.5Content3.5
Intelligence 3.2/10
Technical 2.8/10
Content 3.5/10
Value 7.5/10

Olmo 3.1 32B Think is a large-scale, 32-billion-parameter model designed for deep reasoning, complex multi-step logic, and advanced instruction following. Building on the Olmo 3 series, version 3.1 delivers refined reasoning behavior and stronger performance across demanding evaluations and nuanced conversational tasks. Developed by Ai2 under the Apache 2.0 license, Olmo 3.1 32B Think continues the Olmo initiative’s commitment to openness, providing full transparency across model weights, code, and training methodology.

32B Parameters

Performance Indices

Source: Artificial Analysis

13.9 Intelligence Index
9.8 Coding Index
77.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 59.1% Graduate-level scientific reasoning
HLE 6% Humanity's Last Exam
MMLU Pro 76.3% Multi-task language understanding
AIME 2025 77.3% Competition mathematics (2025)
SciCode 29.3% Scientific computing

Technical

LiveCodeBench 69.5% Live coding evaluation

Content

IFBench 66% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does AllenAI: Olmo 3.1 32B Think stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID allenai/olmo-3.1-32b-think
Providerallenai
Release Date December 16, 2025
Context Length65,536 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.15 $0.000150
Output $0.50 $0.000500