IBM: Granite 4.0 Micro

IBM: Granite 4.0 Micro

ibm-granite · Released Oct 20, 2025 Professional
Intelligence #14 / 590
82.0 Our Score
AA Index #362 / 385
2.4 Artificial Analysis
Input #134 / 592
$0.017 per 1M tokens
Output #154 / 592
$0.112 per 1M tokens
Context #336 / 592
131,000 tokens

Analysis Summary

IBM Granite 4.0 Micro is a compact model from IBM's Granite family, priced at $0.017/1M input and $0.112/1M output, making it one of the most affordable options in the market. Its intelligence index of 2.4 and GPQA of 0.336 place it at the lower end of the capability spectrum, with limited reasoning depth. The math index of 6 and livecodebench of 0.18 confirm it is not suited to analytical or coding-heavy tasks.

For businesses, it is best reserved for high-volume, low-complexity workloads: simple text classification, keyword extraction, basic summarisation, or routing tasks where cost per call is the dominant concern. The 131K context window is adequate for document-level passes. The absence of tool use or function calling limits its utility in structured or agentic pipelines.

At its price point, Granite 4.0 Micro is a viable option for bulk processing tasks that do not require reasoning depth. Teams should benchmark it carefully against their specific task requirements before committing to production use.

Assessed June 30, 2026

Editorial notes

IBM's Granite 4.0 Micro is an ultra-low-cost model at $0.017/1M input with a 131K context window, suited to simple classification or extraction tasks where cost is the overriding priority.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.1Technical0.9Value8Content1.6
Intelligence 1.1/10
Technical 0.9/10
Content 1.6/10
Value 8/10

How IBM: Granite 4.0 Micro compares

IBM: Granite 4.0 Micro ranks #362 of 385 AI models we track for overall intelligence, #280 of 293 for agentic tasks. Its 131K-token context window is larger than 43% of the models we list. At $0.02 per million input tokens it is cheaper than 77% of comparable models.

About IBM: Granite 4.0 Micro

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long..

Performance Indices

Source: Artificial Analysis

2.4 Intelligence Index
7 Agentic Index
6 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 33.6% Graduate-level scientific reasoning
HLE 5.1% Humanity's Last Exam
MMLU Pro 44.7% Multi-task language understanding
AIME 2025 6% Competition mathematics (2025)
SciCode 11.9% Scientific computing

Technical

LiveCodeBench 18% Live coding evaluation
TerminalBench Hard 1.5% Agentic terminal tasks
τ²-Bench 12.6% Conversational agent benchmark

Content

IFBench 24.8% Instruction following
LCR 4% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does IBM: Granite 4.0 Micro stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID ibm-granite/granite-4.0-h-micro
Provideribm-granite
Release Date October 20, 2025
Context Length131,000 tokens
Max Completion131,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.02 $0.000017
Output $0.11 $0.000112

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
598ms
Best Latency (TTFT)
27 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Cloudflare

Frequently asked questions about IBM: Granite 4.0 Micro

How much does IBM: Granite 4.0 Micro cost?

IBM: Granite 4.0 Micro costs $0.02 per million input tokens and $0.11 per million output tokens.

What is the context window of IBM: Granite 4.0 Micro?

IBM: Granite 4.0 Micro has a context window of 131,000 tokens (131K).

Who created IBM: Granite 4.0 Micro?

IBM: Granite 4.0 Micro is developed by IBM and was released on October 20, 2025.