IBM: Granite 4.1 8B
Analysis Summary
IBM: Granite 4.1 8B sits in the Efficient tier on our leaderboard, ranked #222 of 557 published models on overall intelligence. At $0.050 input and $0.100 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports tool use and function calling.
Editorial notes
IBM Granite 4.1 8B is a low-cost compact model with tool use, but benchmark scores are very limited across reasoning and coding, restricting it to simple automation tasks.
Assessed May 14, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks..
Capabilities
Performance Indices
Source: Artificial Analysis
Benchmark Scores
Intelligence
Technical
Content
Benchmark data from Artificial Analysis and Hugging Face
How does IBM: Granite 4.1 8B stack up?
Compare side-by-side with other efficient models.
Model Information
| OpenRouter ID |
ibm-granite/granite-4.1-8b
|
| Provider | ibm-granite |
| Release Date | April 30, 2026 |
| Context Length | 131,072 tokens |
| Max Completion | 131,072 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.05 | $0.000050 |
| Output | $0.10 | $0.000100 |
Live Performance
Live endpoint metrics — refreshed every 30 minutes.
Leaderboard Categories
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 20, 2026 8:38 pm