Google API Pricing
Google's Gemini family pushes the long-context frontier with 1M+ token windows, native multimodality, and aggressive pricing on the Flash tier. Available via Google AI Studio and Vertex AI.
Models
5 tracked
All tiers, latest pricing.
All Google Models
| Model | Tier | Input / 1M | Output / 1M | Context |
|---|---|---|---|---|
| Gemini 1.5 Flash | fast | $0.075 | $0.300 | 1M |
| Gemini 2.0 Flash | fast | $0.100 | $0.400 | 1M |
| Gemini 2.5 Flash | balanced | $0.300 | $2.50 | 1M |
| Gemini 2.5 Pro | flagship | $1.25 | $10.00 | 1M |
| Gemini 1.5 Pro | flagship | $1.25 | $5.00 | 2M |
Model Details
Gemini 1.5 Flash
$0.075 inGemini 1.5 Flash is one of the cheapest capable models available. With a 1M context window and ultra-low pricing, it's ideal for bulk document processing and cost-sensitive pipelines.
Gemini 2.0 Flash
$0.100 inGemini 2.0 Flash is Google's speed-optimized model — extremely affordable with a 1M token context window. One of the best value options for high-throughput workloads.
Gemini 2.5 Flash
$0.300 inGemini 2.5 Flash is the mid-tier 2.5-generation model — markedly faster than Pro while keeping the full 1M context window. The default choice when you want long context cheaply.
Gemini 2.5 Pro
$1.25 inGemini 2.5 Pro is Google's most capable model, featuring a massive 1M token context window — the largest of any major model. It's particularly strong on reasoning, code, and tasks requiring long document understanding.
Gemini 1.5 Pro
$1.25 inGemini 1.5 Pro is the previous-generation Google flagship — famous for its 2M token context window. Still useful for extreme long-context jobs but outclassed by 2.5 Pro on most benchmarks.
Calculate Google API Costs
Use the TokenRate calculator to estimate exactly what Google models will cost for your workload.
Open Calculator →Other Providers