Llama 3.2 11B Vision Pricing
FastMeta · 131K tokens context
Llama 3.2 11B Vision from Meta costs $0.345 per 1 million input tokens and $0.345 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 131,072-token context window (approximately 98,304 words) with a 4K-token maximum output. A typical 1,000-token request costs $0.0003 in input charges; a 10,000-token request costs $0.0034.
| Input price | $0.345 / 1M tokens |
|---|---|
| Output price | $0.345 / 1M tokens |
| Output / input ratio | 1.0× |
| Context window | 131,072 tokens (~98,304 words) |
| Maximum output | 4,096 tokens |
| Cost per 1K tokens (input) | $0.0003 |
| Tier | Fast |
| Last verified |
Llama 3.2 11B Vision is Meta's small open-weight multimodal model — capable of understanding images at a fraction of GPT-4o's cost. The go-to for budget image + text pipelines.
Input Price
$0.345
per 1 million tokens
Output Price
$0.345
per 1 million tokens
Context Window
131K tokens
max 4K output
Cost Examples
| Request Type | Tokens | Input Cost | Output Cost |
|---|---|---|---|
| 1,000 word article | 1,333 | $0.00046 | $0.000138 |
| 10-page document (2,500 words) | 3,333 | $0.00115 | $0.000345 |
| 1,000 lines of code | 5,000 | $0.00172 | $0.000517 |
| 100K token document | 100,000 | $0.0345 | $0.0104 |
Output cost estimated at 30% of input token count. Use the calculator for exact figures.
Strengths
- ✓Multimodal (image + text) at $0.16/1M
- ✓Open weights — self-hostable
- ✓Fast
Limitations
- –Below GPT-4o Vision on complex visual reasoning
- –Small output limit
Best Use Cases
Calculate Llama 3.2 11B Vision Costs
Use the TokenRate calculator to convert any budget, token count, or text into exact Llama 3.2 11B Vision costs — and compare across all models.
Open Calculator →Llama 3.2 11B Vision — FAQ
Related Models
Related Guides