TokenRate

Gemma 3 4B Pricing

Fast

Google · 131K tokens context

Gemma 3 4B from Google costs $0.050 per 1 million input tokens and $0.100 per 1 million output tokens as of June 2026 (live OpenRouter data). The model supports a 131,072-token context window (approximately 98,304 words) with a 16K-token maximum output. A typical 1,000-token request costs $0.0000 in input charges; a 10,000-token request costs $0.0005.

Gemma 3 4B pricing and capability summary
Input price$0.050 / 1M tokens
Output price$0.100 / 1M tokens
Output / input ratio2.0×
Context window131,072 tokens (~98,304 words)
Maximum output16,384 tokens
Cost per 1K tokens (input)$0.0000
TierFast
Last verified

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Live pricing from OpenRouter

Input Price

$0.050

per 1 million tokens

Output Price

$0.100

per 1 million tokens

Context Window

131K tokens

max 16K output

Cost Examples

Request TypeTokensInput CostOutput Cost
1,000 word article1,333$0.0000666$0.00004
10-page document (2,500 words)3,333$0.000167$0.0001
1,000 lines of code5,000$0.00025$0.00015
100K token document100,000$0.005$0.003

Output cost estimated at 30% of input token count. Use the calculator for exact figures.

Strengths

  • Extremely cheap at $0.050/1M input tokens
  • Low latency for high-throughput workloads
  • Multimodal: understands images as well as text

Limitations

  • Less capable than flagship models on complex reasoning
  • Quality and availability can vary by hosting provider

Best Use Cases

High-volume classification
Text extraction and summarization
Simple chat and Q&A
Cost-sensitive pipelines

Calculate Gemma 3 4B Costs

Use the TokenRate calculator to convert any budget, token count, or text into exact Gemma 3 4B costs — and compare across all models.

Open Calculator →

Gemma 3 4B — FAQ

Related Models

Related Guides