Llama 3 70B Instruct Pricing
FlagshipMeta · 8K tokens context
Llama 3 70B Instruct from Meta costs $0.510 per 1 million input tokens and $0.740 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 8,192-token context window (approximately 6,144 words) with a 8K-token maximum output. A typical 1,000-token request costs $0.0005 in input charges; a 10,000-token request costs $0.0051.
| Input price | $0.510 / 1M tokens |
|---|---|
| Output price | $0.740 / 1M tokens |
| Output / input ratio | 1.5× |
| Context window | 8,192 tokens (~6,144 words) |
| Maximum output | 8,000 tokens |
| Cost per 1K tokens (input) | $0.0005 |
| Tier | Flagship |
| Last verified |
Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases.
Input Price
$0.510
per 1 million tokens
Output Price
$0.740
per 1 million tokens
Context Window
8K tokens
max 8K output
Cost Examples
| Request Type | Tokens | Input Cost | Output Cost |
|---|---|---|---|
| 1,000 word article | 1,333 | $0.00068 | $0.000296 |
| 10-page document (2,500 words) | 3,333 | $0.0017 | $0.00074 |
| 1,000 lines of code | 5,000 | $0.00255 | $0.00111 |
| 100K token document | 100,000 | $0.051 | $0.0222 |
Output cost estimated at 30% of input token count. Use the calculator for exact figures.
Strengths
- ✓Affordable at $0.51/1M input tokens
- ✓Frontier-class quality on complex tasks
- ✓Strong general-purpose performance
Limitations
- –Smaller 8K-token context limits long-document use
- –Quality and availability can vary by hosting provider
Best Use Cases
Calculate Llama 3 70B Instruct Costs
Use the TokenRate calculator to convert any budget, token count, or text into exact Llama 3 70B Instruct costs — and compare across all models.
Open Calculator →Llama 3 70B Instruct — FAQ
Related Models
Related Guides