TokenRate

Llama 4 Scout Pricing

Fast

Meta · 10M tokens context

Llama 4 Scout from Meta costs $0.100 per 1 million input tokens and $0.300 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 10,000,000-token context window (approximately 7,500,000 words) with a 16K-token maximum output. A typical 1,000-token request costs $0.0001 in input charges; a 10,000-token request costs $0.0010.

Llama 4 Scout pricing and capability summary
Input price$0.100 / 1M tokens
Output price$0.300 / 1M tokens
Output / input ratio3.0×
Context window10,000,000 tokens (~7,500,000 words)
Maximum output16,384 tokens
Cost per 1K tokens (input)$0.0001
TierFast
Last verified

Llama 4 Scout is Meta's latest MoE model with an industry-leading 10M token context window at an affordable price. Remarkable context-to-cost ratio — suitable for entire-codebase and very long document tasks.

Live pricing from OpenRouter

Input Price

$0.100

per 1 million tokens

Output Price

$0.300

per 1 million tokens

Context Window

10M tokens

max 16K output

Cost Examples

Request TypeTokensInput CostOutput Cost
1,000 word article1,333$0.000133$0.00012
10-page document (2,500 words)3,333$0.000333$0.0003
1,000 lines of code5,000$0.0005$0.00045
100K token document100,000$0.01$0.009

Output cost estimated at 30% of input token count. Use the calculator for exact figures.

Strengths

  • 10M token context window — largest available
  • Very cheap at $0.17/1M symmetric
  • MoE architecture for efficient inference
  • Open weights

Limitations

  • Very long contexts require specialist infrastructure
  • MoE quality varies by host

Best Use Cases

Entire-codebase analysis
Ultra-long document processing
Long-running agent tasks

Calculate Llama 4 Scout Costs

Use the TokenRate calculator to convert any budget, token count, or text into exact Llama 4 Scout costs — and compare across all models.

Open Calculator →

Llama 4 Scout — FAQ

Related Models

Related Guides