Granite 4.1 8B Pricing
FastIBM · 131K tokens context
Granite 4.1 8B from IBM costs $0.050 per 1 million input tokens and $0.100 per 1 million output tokens as of July 2026 (live OpenRouter data). The model supports a 131,072-token context window (approximately 98,304 words) with a 131K-token maximum output. A typical 1,000-token request costs $0.0000 in input charges; a 10,000-token request costs $0.0005.
| Input price | $0.050 / 1M tokens |
|---|---|
| Output price | $0.100 / 1M tokens |
| Output / input ratio | 2.0× |
| Context window | 131,072 tokens (~98,304 words) |
| Maximum output | 131,072 tokens |
| Cost per 1K tokens (input) | $0.0000 |
| Tier | Fast |
| Last verified |
Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family.
Input Price
$0.050
per 1 million tokens
Output Price
$0.100
per 1 million tokens
Context Window
131K tokens
max 131K output
Cost Examples
| Request Type | Tokens | Input Cost | Output Cost |
|---|---|---|---|
| 1,000 word article | 1,333 | $0.0000666 | $0.00004 |
| 10-page document (2,500 words) | 3,333 | $0.000167 | $0.0001 |
| 1,000 lines of code | 5,000 | $0.00025 | $0.00015 |
| 100K token document | 100,000 | $0.005 | $0.003 |
Output cost estimated at 30% of input token count. Use the calculator for exact figures.
Strengths
- ✓Extremely cheap at $0.050/1M input tokens
- ✓Low latency for high-throughput workloads
- ✓Cost-effective at scale
Limitations
- –Less capable than flagship models on complex reasoning
- –Quality and availability can vary by hosting provider
Best Use Cases
Calculate Granite 4.1 8B Costs
Use the TokenRate calculator to convert any budget, token count, or text into exact Granite 4.1 8B costs — and compare across all models.
Open Calculator →Granite 4.1 8B — FAQ
Related Models
Related Guides