Question 1

How much does Llama 3.1 405B cost per token?

Accepted Answer

As of May 2026, Llama 3.1 405B costs $2.70 per 1 million input tokens and $2.70 per 1 million output tokens — an output-to-input ratio of 1.0×. A 1,000-token input request costs $0.0027; a 10,000-token input request costs $0.0270. These prices are sourced from live OpenRouter data and verified against the provider's published pricing page.

Question 2

How many tokens does Llama 3.1 405B support per request?

Accepted Answer

Llama 3.1 405B's context window and per-request output cap are listed in the pricing table above. The context window is the maximum input + output combined; the output limit is how many tokens the model can generate in a single response. Most 2026 flagship models support at least 128,000-token context windows, with Gemini 2.5 Pro reaching 1 million tokens.

Question 3

Is Llama 3.1 405B good for coding?

Accepted Answer

Llama 3.1 405B's coding performance depends on its tier (see the badge above). Reasoning-tier models (OpenAI o3, DeepSeek R1) lead on hard algorithmic problems; balanced models like Claude Sonnet 4 and GPT-4o lead on long-context codebase work. For specifics, check the strengths section above and the /compare/best-models-for-coding ranking.

Question 4

How does Llama 3.1 405B compare to other AI models?

Accepted Answer

See the related models grid below this section for direct comparisons. Llama 3.1 405B is best understood relative to: (1) cheaper models in the same family (when speed matters more than capability), (2) competitor flagships at the same tier (for head-to-head decisions), and (3) reasoning models (when accuracy on hard problems outweighs cost).

Input price	$2.70 / 1M tokens
Output price	$2.70 / 1M tokens
Output / input ratio	1.0×
Context window	128,000 tokens (~96,000 words)
Maximum output	4,096 tokens
Cost per 1K tokens (input)	$0.0027
Tier	Flagship
Last verified	2026-06-10

Request Type	Tokens	Input Cost	Output Cost
1,000 word article	1,333	$0.0036	$0.00108
10-page document (2,500 words)	3,333	$0.009	$0.0027
1,000 lines of code	5,000	$0.0135	$0.00405
100K token document	100,000	$0.27	$0.081

Llama 3.1 405B Pricing

Cost Examples

Strengths

Limitations

Best Use Cases

Calculate Llama 3.1 405B Costs

Llama 3.1 405B — FAQ