Google API Pricing

Google's Gemini family pushes the long-context frontier with 1M+ token windows, native multimodality, and aggressive pricing on the Flash tier. Available via Google AI Studio and Vertex AI.

Official site: ai.google →

Cheapest

Gemma 3 4B

$0.050/1M in

Flagship

Gemini 2.5 Pro

$1.25/1M in

Models

30 tracked

All tiers, latest pricing.

All Google Models

Model	Tier	Input / 1M	Output / 1M	Context
Gemma 3 4B	fast	$0.050	$0.100	131K
Gemma 3 12B	fast	$0.050	$0.150	131K
Gemma 3n 4B	fast	$0.060	$0.120	33K
Gemma 4 26B A4B	fast	$0.070	$0.340	262K
Gemini 1.5 Flash	fast	$0.075	$0.300	1M
Gemini 2.0 Flash Lite	fast	$0.075	$0.300	1M
Gemini 2.0 Flash	fast	$0.100	$0.400	1M
Gemini 2.5 Flash Lite	fast	$0.100	$0.400	1M
Gemma 3 27B	fast	$0.100	$0.300	262K
Gemma 4 31B	fast	$0.120	$0.370	262K
Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)	fast	$0.250	$1.50	66K
Gemini 3.1 Flash Lite	fast	$0.250	$1.50	1M
Gemini 3.1 Flash Lite Preview	fast	$0.250	$1.50	1M
Gemini 2.5 Flash	balanced	$0.300	$2.50	1M
Gemini 3.5 Flash-Lite	fast	$0.300	$2.50	1M
Nano Banana (Gemini 2.5 Flash Image)	fast	$0.300	$2.50	33K
Nano Banana 2 (Gemini 3.1 Flash Image)	fast	$0.500	$3.00	131K
Nano Banana 2 (Gemini 3.1 Flash Image Preview)	fast	$0.500	$3.00	131K
Gemini 3 Flash Preview	fast	$0.500	$3.00	1M
Gemma 2 27B	fast	$0.650	$0.650	8K
Gemini 2.5 Pro	flagship	$1.25	$10.00	1M
Gemini 1.5 Pro	flagship	$1.25	$5.00	2M
Gemini 2.5 Pro Preview 06-05	fast	$1.25	$10.00	1M
Gemini 2.5 Pro Preview 05-06	fast	$1.25	$10.00	1M
Gemini 3.6 Flash	fast	$1.50	$7.50	1M
Gemini 3.5 Flash	fast	$1.50	$9.00	1M
Nano Banana Pro (Gemini 3 Pro Image)	fast	$2.00	$12.00	131K
Gemini 3.1 Pro Preview Custom Tools	fast	$2.00	$12.00	1M
Gemini 3.1 Pro Preview	fast	$2.00	$12.00	1M
Nano Banana Pro (Gemini 3 Pro Image Preview)	fast	$2.00	$12.00	66K

Model Details

Gemma 3 4B

$0.050 in

Gemma 3 introduces multimodality, supporting vision-language input and text outputs.

Gemma 3 12B

$0.050 in

Gemma 3 introduces multimodality, supporting vision-language input and text outputs.

Gemma 3n 4B

$0.060 in

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets.

Gemma 4 26B A4B

$0.070 in

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind.

Gemini 1.5 Flash

$0.075 in

Gemini 1.5 Flash is one of the cheapest capable models available. With a 1M context window and ultra-low pricing, it's ideal for bulk document processing and cost-sensitive pipelines.

Gemini 2.0 Flash Lite

$0.075 in

Gemini 2.0 Flash Lite is Google's cheapest capable model — matching Gemini 1.5 Flash pricing while running on the newer 2.0 architecture. The best entry point for cost-optimized 1M-context workloads.

Gemini 2.0 Flash

$0.100 in

Gemini 2.0 Flash is Google's speed-optimized model — extremely affordable with a 1M token context window. One of the best value options for high-throughput workloads.

Gemini 2.5 Flash Lite

$0.100 in

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency.

Gemma 3 27B

$0.100 in

Gemma 3 introduces multimodality, supporting vision-language input and text outputs.

Gemma 4 31B

$0.120 in

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output.

Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)

$0.250 in

Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) is Google's fastest, most cost-efficient Gemini image model, built for high-velocity developer pipelines and rapid-fire visual exploration.

Gemini 3.1 Flash Lite

$0.250 in

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads.

Gemini 3.1 Flash Lite Preview

$0.250 in

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases.

Gemini 2.5 Flash

$0.300 in

Gemini 2.5 Flash is the mid-tier 2.5-generation model — markedly faster than Pro while keeping the full 1M context window. The default choice when you want long context cheaply.

Gemini 3.5 Flash-Lite

$0.300 in

Gemini 3.5 Flash-Lite is a high-efficiency model from Google with upgraded agentic capabilities. It is suited for subagents that execute focused tasks within complex, multi-agent workflows.

Nano Banana (Gemini 2.5 Flash Image)

$0.300 in

Nano Banana (Gemini 2.5 Flash Image) is Google's a fast, low-cost model tuned for high-throughput tasks like classification, extraction, and simple chat. It costs $0.300 per million input tokens with a 33K-token context window and native image understanding.

Nano Banana 2 (Gemini 3.1 Flash Image)

$0.500 in

Nano Banana 2 (Gemini 3.1 Flash Image) is Google's a fast, low-cost model tuned for high-throughput tasks like classification, extraction, and simple chat. It costs $0.500 per million input tokens with a 131K-token context window and native image understanding.

Nano Banana 2 (Gemini 3.1 Flash Image Preview)

$0.500 in

Nano Banana 2 (Gemini 3.1 Flash Image Preview) is Google's a fast, low-cost model tuned for high-throughput tasks like classification, extraction, and simple chat. It costs $0.500 per million input tokens with a 131K-token context window and native image understanding.

Gemini 3 Flash Preview

$0.500 in

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance.

Gemma 2 27B

$0.650 in

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models.

Gemini 2.5 Pro

$1.25 in

Gemini 2.5 Pro is Google's most capable model, featuring a massive 1M token context window — the largest of any major model. It's particularly strong on reasoning, code, and tasks requiring long document understanding.

Gemini 1.5 Pro

$1.25 in

Gemini 1.5 Pro is the previous-generation Google flagship — famous for its 2M token context window. Still useful for extreme long-context jobs but outclassed by 2.5 Pro on most benchmarks.

Gemini 2.5 Pro Preview 06-05

$1.25 in

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks.

Gemini 2.5 Pro Preview 05-06

$1.25 in

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks.

Gemini 3.6 Flash

$1.50 in

Gemini 3.6 Flash is a high-efficiency model from Google for coding, agentic workflows, and web and app development.

Gemini 3.5 Flash

$1.50 in

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed.

Nano Banana Pro (Gemini 3 Pro Image)

$2.00 in

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro.

Gemini 3.1 Pro Preview Custom Tools

$2.00 in

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party.

Gemini 3.1 Pro Preview

$2.00 in

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows.

Nano Banana Pro (Gemini 3 Pro Image Preview)

$2.00 in

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro.

Calculate Google API Costs

Use the TokenRate calculator to estimate exactly what Google models will cost for your workload.

Open Calculator →

Other Providers