TokenRate

Llama 3 8B Instruct Pricing

Fast

Meta · 8K tokens context

Llama 3 8B Instruct from Meta costs $0.140 per 1 million input tokens and $0.140 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 8,192-token context window (approximately 6,144 words) with a 8K-token maximum output. A typical 1,000-token request costs $0.0001 in input charges; a 10,000-token request costs $0.0014.

Llama 3 8B Instruct pricing and capability summary
Input price$0.140 / 1M tokens
Output price$0.140 / 1M tokens
Output / input ratio1.0×
Context window8,192 tokens (~6,144 words)
Maximum output8,192 tokens
Cost per 1K tokens (input)$0.0001
TierFast
Last verified

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases.

Live pricing from OpenRouter

Input Price

$0.140

per 1 million tokens

Output Price

$0.140

per 1 million tokens

Context Window

8K tokens

max 8K output

Cost Examples

Request TypeTokensInput CostOutput Cost
1,000 word article1,333$0.000187$0.000056
10-page document (2,500 words)3,333$0.000467$0.00014
1,000 lines of code5,000$0.0007$0.00021
100K token document100,000$0.014$0.0042

Output cost estimated at 30% of input token count. Use the calculator for exact figures.

Strengths

  • Extremely cheap at $0.140/1M input tokens
  • Low latency for high-throughput workloads
  • Cost-effective at scale

Limitations

  • Less capable than flagship models on complex reasoning
  • Smaller 8K-token context limits long-document use

Best Use Cases

High-volume classification
Text extraction and summarization
Simple chat and Q&A
Cost-sensitive pipelines

Calculate Llama 3 8B Instruct Costs

Use the TokenRate calculator to convert any budget, token count, or text into exact Llama 3 8B Instruct costs — and compare across all models.

Open Calculator →

Llama 3 8B Instruct — FAQ

Related Models

Related Guides