IBM: Granite 4.0 Micro
Analysis Summary
IBM Granite 4.0 Micro is a compact model from IBM's Granite family, priced at $0.017/1M input and $0.112/1M output, making it one of the most affordable options in the market. Its intelligence index of 2.4 and GPQA of 0.336 place it at the lower end of the capability spectrum, with limited reasoning depth. The math index of 6 and livecodebench of 0.18 confirm it is not suited to analytical or coding-heavy tasks.
For businesses, it is best reserved for high-volume, low-complexity workloads: simple text classification, keyword extraction, basic summarisation, or routing tasks where cost per call is the dominant concern. The 131K context window is adequate for document-level passes. The absence of tool use or function calling limits its utility in structured or agentic pipelines.
At its price point, Granite 4.0 Micro is a viable option for bulk processing tasks that do not require reasoning depth. Teams should benchmark it carefully against their specific task requirements before committing to production use.
Assessed June 30, 2026
Editorial notes
IBM's Granite 4.0 Micro is an ultra-low-cost model at $0.017/1M input with a 131K context window, suited to simple classification or extraction tasks where cost is the overriding priority.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How IBM: Granite 4.0 Micro compares
IBM: Granite 4.0 Micro ranks #362 of 385 AI models we track for overall intelligence, #280 of 293 for agentic tasks. Its 131K-token context window is larger than 43% of the models we list. At $0.02 per million input tokens it is cheaper than 77% of comparable models.
About IBM: Granite 4.0 Micro
Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long..
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does IBM: Granite 4.0 Micro stack up?
Compare side-by-side with other professional models.
Model Information
| OpenRouter ID |
ibm-granite/granite-4.0-h-micro
|
| Provider | ibm-granite |
| Release Date | October 20, 2025 |
| Context Length | 131,000 tokens |
| Max Completion | 131,000 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.02 | $0.000017 |
| Output | $0.11 | $0.000112 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
External Resources
Explore Related Models
Frequently asked questions about IBM: Granite 4.0 Micro
How much does IBM: Granite 4.0 Micro cost?
IBM: Granite 4.0 Micro costs $0.02 per million input tokens and $0.11 per million output tokens.
What is the context window of IBM: Granite 4.0 Micro?
IBM: Granite 4.0 Micro has a context window of 131,000 tokens (131K).
Who created IBM: Granite 4.0 Micro?
IBM: Granite 4.0 Micro is developed by IBM and was released on October 20, 2025.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: July 2, 2026 8:38 pm