DeepSeek V4 Flash Pricing
FastDeepSeek · 1M tokens context
DeepSeek V4 Flash from DeepSeek costs $0.098 per 1 million input tokens and $0.197 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 1,048,576-token context window (approximately 786,432 words) with a 131K-token maximum output. A typical 1,000-token request costs $0.0001 in input charges; a 10,000-token request costs $0.0010.
| Input price | $0.098 / 1M tokens |
|---|---|
| Output price | $0.197 / 1M tokens |
| Output / input ratio | 2.0× |
| Context window | 1,048,576 tokens (~786,432 words) |
| Maximum output | 131,072 tokens |
| Cost per 1K tokens (input) | $0.0001 |
| Tier | Fast |
| Last verified |
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Input Price
$0.098
per 1 million tokens
Output Price
$0.197
per 1 million tokens
Context Window
1M tokens
max 131K output
Cost Examples
| Request Type | Tokens | Input Cost | Output Cost |
|---|---|---|---|
| 1,000 word article | 1,333 | $0.000131 | $0.0000786 |
| 10-page document (2,500 words) | 3,333 | $0.000328 | $0.000197 |
| 1,000 lines of code | 5,000 | $0.000491 | $0.000295 |
| 100K token document | 100,000 | $0.00983 | $0.0059 |
Output cost estimated at 30% of input token count. Use the calculator for exact figures.
Strengths
- ✓Extremely cheap at $0.098/1M input tokens
- ✓Massive 1M-token context window
- ✓Low latency for high-throughput workloads
Limitations
- –Less capable than flagship models on complex reasoning
- –Quality and availability can vary by hosting provider
Best Use Cases
Calculate DeepSeek V4 Flash Costs
Use the TokenRate calculator to convert any budget, token count, or text into exact DeepSeek V4 Flash costs — and compare across all models.
Open Calculator →DeepSeek V4 Flash — FAQ
Related Models
Related Guides