IBM: Granite 4.0 Micro

IBM: Granite 4.0 Micro

ibm-granite · Released Oct 20, 2025 Efficient
Intelligence #262 / 557
32.3 Our Score
AA Index #344 / 368
7.7 Artificial Analysis
Input #126 / 560
$0.017 per 1M tokens
Output #147 / 560
$0.112 per 1M tokens
Context #313 / 560
131,000 tokens

Analysis Summary

IBM: Granite 4.0 Micro sits in the Efficient tier on our leaderboard, ranked #262 of 557 published models on overall intelligence. At $0.017 input and $0.112 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window.

Editorial notes

IBM Granite 4.0 Micro is a very small model with low scores across all benchmarks; ultra-low pricing suits edge or on-device use cases, but capability is insufficient for most business workflows.

Assessed May 17, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.9Technical0.9Value8Content2.5
Intelligence 1.9/10
Technical 0.9/10
Content 2.5/10
Value 8/10

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long..

Performance Indices

Source: Artificial Analysis

7.7 Intelligence Index
5 Coding Index
7 Agentic Index
6 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 33.6% Graduate-level scientific reasoning
HLE 5.1% Humanity's Last Exam
MMLU Pro 44.7% Multi-task language understanding
AIME 2025 6% Competition mathematics (2025)
SciCode 11.9% Scientific computing

Technical

LiveCodeBench 18% Live coding evaluation
TerminalBench Hard 1.5% Agentic terminal tasks
τ²-Bench 12.6% Conversational agent benchmark

Content

IFBench 24.8% Instruction following
LCR 4% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does IBM: Granite 4.0 Micro stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID ibm-granite/granite-4.0-h-micro
Provideribm-granite
Release Date October 20, 2025
Context Length131,000 tokens
Max Completion131,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.02 $0.000017
Output $0.11 $0.000112

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
484ms
Best Latency (TTFT)
30 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Cloudflare