IBM: Granite 4.0 Micro
Analysis Summary
IBM: Granite 4.0 Micro sits in the Efficient tier on our leaderboard, ranked #262 of 557 published models on overall intelligence. At $0.017 input and $0.112 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window.
Editorial notes
IBM Granite 4.0 Micro is a very small model with low scores across all benchmarks; ultra-low pricing suits edge or on-device use cases, but capability is insufficient for most business workflows.
Assessed May 17, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long..
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does IBM: Granite 4.0 Micro stack up?
Compare side-by-side with other efficient models.
Model Information
| OpenRouter ID |
ibm-granite/granite-4.0-h-micro
|
| Provider | ibm-granite |
| Release Date | October 20, 2025 |
| Context Length | 131,000 tokens |
| Max Completion | 131,000 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.02 | $0.000017 |
| Output | $0.11 | $0.000112 |
Live Performance
Live endpoint metrics — refreshed every 30 minutes.
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 20, 2026 8:38 pm