Llama 3.3 Nemotron Super 49B V1.5 Pricing
BalancedNVIDIA · 131K tokens context
Llama 3.3 Nemotron Super 49B V1.5 from NVIDIA costs $0.400 per 1 million input tokens and $0.400 per 1 million output tokens as of July 2026 (live OpenRouter data). The model supports a 131,072-token context window (approximately 98,304 words) with a 16K-token maximum output. A typical 1,000-token request costs $0.0004 in input charges; a 10,000-token request costs $0.0040.
| Input price | $0.400 / 1M tokens |
|---|---|
| Output price | $0.400 / 1M tokens |
| Output / input ratio | 1.0× |
| Context window | 131,072 tokens (~98,304 words) |
| Maximum output | 16,384 tokens |
| Cost per 1K tokens (input) | $0.0004 |
| Tier | Balanced |
| Last verified |
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context.
Input Price
$0.400
per 1 million tokens
Output Price
$0.400
per 1 million tokens
Context Window
131K tokens
max 16K output
Cost Examples
| Request Type | Tokens | Input Cost | Output Cost |
|---|---|---|---|
| 1,000 word article | 1,333 | $0.000533 | $0.00016 |
| 10-page document (2,500 words) | 3,333 | $0.00133 | $0.0004 |
| 1,000 lines of code | 5,000 | $0.002 | $0.0006 |
| 100K token document | 100,000 | $0.04 | $0.012 |
Output cost estimated at 30% of input token count. Use the calculator for exact figures.
Strengths
- ✓Affordable at $0.40/1M input tokens
- ✓Strong general-purpose performance
- ✓Strong general-purpose performance
Limitations
- –Quality and availability can vary by hosting provider
- –Quality and availability can vary by hosting provider
Best Use Cases
Calculate Llama 3.3 Nemotron Super 49B V1.5 Costs
Use the TokenRate calculator to convert any budget, token count, or text into exact Llama 3.3 Nemotron Super 49B V1.5 costs — and compare across all models.
Open Calculator →Llama 3.3 Nemotron Super 49B V1.5 — FAQ
Related Models
Related Guides