Hermes 4 405B Pricing
FlagshipNous Research · 131K tokens context
Hermes 4 405B from Nous Research costs $1.00 per 1 million input tokens and $3.00 per 1 million output tokens as of July 2026 (live OpenRouter data). The model supports a 131,072-token context window (approximately 98,304 words) with a 32K-token maximum output. A typical 1,000-token request costs $0.0010 in input charges; a 10,000-token request costs $0.0100.
| Input price | $1.00 / 1M tokens |
|---|---|
| Output price | $3.00 / 1M tokens |
| Output / input ratio | 3.0× |
| Context window | 131,072 tokens (~98,304 words) |
| Maximum output | 32,000 tokens |
| Cost per 1K tokens (input) | $0.0010 |
| Tier | Flagship |
| Last verified |
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research.
Input Price
$1.00
per 1 million tokens
Output Price
$3.00
per 1 million tokens
Context Window
131K tokens
max 32K output
Cost Examples
| Request Type | Tokens | Input Cost | Output Cost |
|---|---|---|---|
| 1,000 word article | 1,333 | $0.00133 | $0.0012 |
| 10-page document (2,500 words) | 3,333 | $0.00333 | $0.003 |
| 1,000 lines of code | 5,000 | $0.005 | $0.0045 |
| 100K token document | 100,000 | $0.10 | $0.09 |
Output cost estimated at 30% of input token count. Use the calculator for exact figures.
Strengths
- ✓Affordable at $1.00/1M input tokens
- ✓Frontier-class quality on complex tasks
- ✓Strong general-purpose performance
Limitations
- –Quality and availability can vary by hosting provider
- –Quality and availability can vary by hosting provider
Best Use Cases
Calculate Hermes 4 405B Costs
Use the TokenRate calculator to convert any budget, token count, or text into exact Hermes 4 405B costs — and compare across all models.
Open Calculator →Hermes 4 405B — FAQ
Related Models
Related Guides