TokenRate

Llama 3.2 11B Vision Pricing

Fast

Meta · 131K tokens context

Llama 3.2 11B Vision from Meta costs $0.345 per 1 million input tokens and $0.345 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 131,072-token context window (approximately 98,304 words) with a 4K-token maximum output. A typical 1,000-token request costs $0.0003 in input charges; a 10,000-token request costs $0.0034.

Llama 3.2 11B Vision pricing and capability summary
Input price$0.345 / 1M tokens
Output price$0.345 / 1M tokens
Output / input ratio1.0×
Context window131,072 tokens (~98,304 words)
Maximum output4,096 tokens
Cost per 1K tokens (input)$0.0003
TierFast
Last verified

Llama 3.2 11B Vision is Meta's small open-weight multimodal model — capable of understanding images at a fraction of GPT-4o's cost. The go-to for budget image + text pipelines.

Live pricing from OpenRouter

Input Price

$0.345

per 1 million tokens

Output Price

$0.345

per 1 million tokens

Context Window

131K tokens

max 4K output

Cost Examples

Request TypeTokensInput CostOutput Cost
1,000 word article1,333$0.00046$0.000138
10-page document (2,500 words)3,333$0.00115$0.000345
1,000 lines of code5,000$0.00172$0.000517
100K token document100,000$0.0345$0.0104

Output cost estimated at 30% of input token count. Use the calculator for exact figures.

Strengths

  • Multimodal (image + text) at $0.16/1M
  • Open weights — self-hostable
  • Fast

Limitations

  • Below GPT-4o Vision on complex visual reasoning
  • Small output limit

Best Use Cases

Image classification
Visual Q&A on a budget
Document OCR pipelines

Calculate Llama 3.2 11B Vision Costs

Use the TokenRate calculator to convert any budget, token count, or text into exact Llama 3.2 11B Vision costs — and compare across all models.

Open Calculator →

Llama 3.2 11B Vision — FAQ

Related Models

Related Guides