Meta API Pricing
Meta publishes the Llama family of open-weight models. Llama 3.1 ranges from a tiny 8B variant to a 405B frontier model, and is hosted by every major inference provider.
Models
3 tracked
All tiers, latest pricing.
All Meta Models
| Model | Tier | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| Llama 3.1 8B | fast | $0.050 | $0.080 | 128K |
| Llama 3.1 70B | balanced | $0.590 | $0.790 | 128K |
| Llama 3.1 405B | flagship | $2.70 | $2.70 | 128K |
Model Details
Llama 3.1 8B
$0.050 inLlama 3.1 8B is the smallest open-weight Llama 3.1 model — extremely cheap to host or call, and good enough for classification, extraction, and basic chat.
Llama 3.1 70B
$0.590 inLlama 3.1 70B is the mid-size open-weight model in the 3.1 family — a popular sweet spot for production workloads that need GPT-4o-mini-class quality at open-weight prices.
Llama 3.1 405B
$2.70 inLlama 3.1 405B is Meta's largest open-weight model — competitive with GPT-4-class models on many benchmarks and uniquely available for self-hosting. Symmetric input/output pricing is common across hosted providers.
Calculate Meta API Costs
Use the TokenRate calculator to estimate exactly what Meta models will cost for your workload.
Open Calculator →Other Providers