Granite 4.0 H Small

Granite 4.0 H Small

IBM · Released Sep 22, 2025 Efficient
Intelligence #252 / 525
32.1 Our Score
Speed #2 / 244
484.3 tokens / sec
Input #145 / 525
$0.060 per 1M tokens
Output #178 / 525
$0.250 per 1M tokens
Context
Not reported

Analysis Summary

Granite 4.0 H Small sits in the Efficient tier on our leaderboard, ranked #252 of 525 published models on overall intelligence. At $0.060 input and $0.250 output per 1M tokens, it is among the most expensive on the market.

Editorial notes

Granite 4.0 H Small from IBM is a small, low-cost model with limited reasoning and coding capability; its $0.06 input pricing offers value but performance is too constrained for most business tasks.

Assessed April 26, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.5Technical1.4Value7Content2
Intelligence 2.5/10
Technical 1.4/10
Content 2/10
Value 7/10

Performance Indices

Source: Artificial Analysis

10.8 Intelligence Index
8.5 Coding Index
9.8 Agentic Index
13.7 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 41.6% Graduate-level scientific reasoning
HLE 3.7% Humanity's Last Exam
MMLU Pro 62.4% Multi-task language understanding
AIME 2025 13.7% Competition mathematics (2025)
SciCode 20.9% Scientific computing

Technical

LiveCodeBench 25.1% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 17.3% Conversational agent benchmark

Content

IFBench 31.5% Instruction following
LCR 9% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Granite 4.0 H Small stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

ProviderIBM
Release Date September 22, 2025
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.06 $0.000060
Output $0.25 $0.000250