Google Gemini API Pricing

Google Gemini API pricing covers low-cost Flash Lite models, fast Flash models, Pro models with long-context breakpoints, and multimodal image/audio variants. Here are the standard rates tracked by ModelPricing.ai. Rates sourced from the official Google Gemini pricing page. Compare with Anthropic Claude and OpenAI, or see all models in our LLM pricing comparison.

Google Gemini API Pricing at a Glance

Start here for the main Gemini API cost tradeoffs: Flash Lite for volume, Flash for speed, Pro for deeper reasoning, and image variants when output modality matters.

ModelInput / 1MOutput / 1MPricing Note
Gemini 2.0 Flash Lite$0.075$0.30Lowest-cost Google text row
Gemini 2.5 Flash$0.30$2.50Fast mid-tier inference
Gemini 2.5 Pro$1.25 / $2.50$10.00 / $15.00Breakpoint at 200K input tokens
Gemini 3 Flash$0.50$3.00Flat pricing for fast reasoning
Gemini 3.1 Pro Preview$2.00 / $4.00$12.00 / $18.00Breakpoint at 200K input tokens

All Gemini Models

Model Input $/1M tokens Output $/1M tokens Type Notes
gemini-2.0-flash $0.150 $0.600 Flat
gemini-2.0-flash-lite $0.075 $0.300 Flat
gemini-2.5-computer-use $1.25 / $2.50 $10.00 / $15.00 Breakpoint Threshold: 200K tokens
gemini-2.5-flash $0.300 $2.50 Flat
gemini-2.5-flash-image $0.300 $2.50 Multimodal text, image
gemini-2.5-flash-lite $0.100 $0.400 Flat
gemini-2.5-flash-native-audio $0.500 $2.00 Multimodal text, audio
gemini-2.5-flash-preview-tts $0.500 $10.00 Flat
gemini-2.5-pro $1.25 / $2.50 $10.00 / $15.00 Breakpoint Threshold: 200K tokens
gemini-2.5-pro-preview-tts $1.00 $20.00 Flat
gemini-3-flash $0.500 $3.00 Flat
gemini-3-pro-image-preview $2.00 $12.00 Multimodal text, image
gemini-3-pro-preview $2.00 / $4.00 $12.00 / $12.00 Breakpoint Threshold: 200K tokens
gemini-3.1-flash-image-preview $0.500 $3.00 Multimodal text, image
gemini-3.1-flash-lite-preview $0.250 $1.50 Flat
gemini-3.1-pro-preview $2.00 / $4.00 $12.00 / $18.00 Breakpoint Threshold: 200K tokens
gemini-3.5-flash $1.50 $9.00 Flat

Gemini API Cost Breakdown

Gemini API costs depend on which model you use and how many tokens you process. The cheapest option is Gemini 2.0 Flash Lite at just $0.075 per million input tokens — one of the lowest-cost LLM APIs on the market. Mid-tier models like Gemini 2.5 Flash offer stronger reasoning at $0.30/1M input, while Gemini 2.5 Pro starts at $1.25/1M input for complex tasks.

For most workloads, output tokens cost 2-8x more than input tokens. Gemini API pricing is competitive with both OpenAI and Anthropic, especially at the budget and mid-range tiers. Use our LLM cost calculator to estimate your specific Gemini API costs.

How Gemini Pricing Works

Gemini models use three pricing structures. Flat pricing charges a fixed rate per million tokens regardless of context length. This applies to Flash and Flash Lite models, making costs simple and predictable.

Breakpoint pricing applies to Pro-tier models like Gemini 2.5 Pro and Gemini 3 Pro. These models have a lower rate for requests under 200K input tokens and a higher rate above that threshold, letting you benefit from lower costs for typical workloads.

Multimodal pricing is used by image and audio models. These have separate rates for each modality — for example, Gemini 3 Pro Image charges $2/1M for text output but $120/1M for image output tokens.

Gemini vs Claude vs GPT: Price Comparison

How does Gemini API pricing stack up against Claude and GPT? Here's a side-by-side comparison at each price tier. See our full LLM pricing comparison for all models.

TierModelInput / 1MOutput / 1M
BudgetGoogle gemini-2.0-flash-lite$0.07$0.30
Anthropic claude-haiku-3$0.25$1.25
OpenAI gpt-4.1-nano$0.10$0.40
Mid-rangeGoogle gemini-2.5-flash$0.30$2.50
Anthropic claude-haiku-4-5$1.00$5.00
OpenAI gpt-5-mini$0.25$2.00
FlagshipGoogle gemini-2.5-pro$1.25$10.00
Anthropic claude-sonnet-4-6$3.00$15.00
OpenAI gpt-5$1.25$10.00

Gemini API Monthly Cost Estimates

How much will the Gemini API cost you per month? Google's pricing is among the most competitive, especially at the budget tier. Use our LLM cost calculator for a precise estimate.

Light Use

$1-10/mo

  • Personal projects
  • <1K requests/day
  • Flash Lite for most tasks

Medium Use

$10-75/mo

  • Small team apps
  • 1-5K requests/day
  • Mix of Flash and Pro

Heavy Use

$75-400/mo

  • Production apps
  • 5-20K requests/day
  • Pro for quality tasks

Enterprise

$400+/mo

  • Large-scale deployments
  • 20K+ requests/day
  • Pro with multimodal

Which Gemini Model Should You Use?

Google's model lineup covers everything from ultra-cheap inference to multimodal generation. Here's how to pick the right Gemini model for your use case.

Use CaseRecommended ModelEst. Monthly CostWhy This Model
High-volume classificationFlash Lite 2.0$1-10One of the cheapest LLM APIs available
General chatbotFlash 2.5$10-50Strong reasoning at very low cost
Code & complex reasoningGemini 2.5 Pro$30-150Best Gemini model for hard tasks
Image generationGemini 3 Pro Image$50-300Native multimodal output
Real-time applicationsFlash 2.0$3-20Fastest inference, low latency

5 Ways to Reduce Your Gemini API Costs

1

Use Flash Lite for simple tasks

At $0.075/1M input tokens, Flash Lite 2.0 is 17x cheaper than Gemini 2.5 Pro. Use it for classification, extraction, and any task that doesn't need deep reasoning.

2

Stay under the 200K breakpoint

Gemini Pro models double their input cost above 200K tokens. Split long documents into chunks or use Flash models for long-context tasks.

3

Check free-tier quotas

Google offers free-tier access for Gemini API through AI Studio with rate limits. Check your dashboard before assuming production traffic will be free.

4

Be mindful of multimodal costs

Image output tokens on Gemini 3 Pro Image cost significantly more than text. Only use multimodal models when you actually need image or audio output.

5

Monitor usage with the ModelPricing API

Track your Gemini API spending programmatically to identify cost spikes and optimize model selection. Get started free.

Gemini Model Tiers

Flash Lite

The most affordable tier. Gemini 2.0 Flash Lite starts at just $0.075/1M input tokens, ideal for high-volume tasks like classification, extraction, and real-time applications.

Flash

Fast and cost-effective. Gemini 2.5 Flash and 3 Flash offer strong reasoning at $0.30-0.50/1M input tokens, great for coding, analysis, and multi-step workflows.

Pro

Most capable tier for complex reasoning and research. Gemini 2.5 Pro and 3 Pro use breakpoint pricing starting at $1.25-2/1M input tokens under 200K context.

Frequently Asked Questions

How much does the Gemini API cost?

Gemini API pricing varies by model. Gemini 2.0 Flash Lite starts at $0.075/1M input tokens, while Gemini 2.5 Pro costs $1.25-2.50/1M input tokens depending on context length.

What is the cheapest Gemini model?

Gemini 2.0 Flash Lite is the most affordable at $0.075/1M input tokens and $0.30/1M output tokens, making it one of the cheapest LLM APIs available.

Does Gemini have breakpoint pricing?

Yes. Gemini Pro-tier models such as Gemini 2.5 Pro, Gemini 3 Pro Preview, and Gemini 3.1 Pro Preview use breakpoint pricing where long-context requests above 200K input tokens move to a higher tier.

How does Gemini pricing compare to GPT and Claude?

Gemini Flash and Flash Lite models are very competitive for low-latency and high-volume work, while Pro models are priced around the mid-to-flagship tier. The best fit depends heavily on context length and multimodal needs.

Does Gemini support multimodal pricing?

Yes. Models like Gemini 3 Pro Image and Gemini 2.5 Flash Image have separate rates for text and image tokens, with image output costing significantly more than text output.

Is the Gemini API free?

Google AI Studio offers free-tier quotas with rate limits, and paid usage starts from the low-cost Flash Lite tiers. Check the Google AI Studio or Cloud billing dashboard for current quotas before relying on free usage.

How much does Gemini 3 cost?

Gemini 3 Flash costs $0.50/1M input tokens and $3/1M output tokens with flat pricing. Gemini 3 Pro Preview uses breakpoint pricing at $2/1M input (under 200K tokens) or $4/1M input (above 200K tokens).

Which Gemini model is best for coding?

Gemini 2.5 Pro is the strongest Gemini model for code generation and debugging, starting at $1.25/1M input tokens. For lighter coding tasks, Gemini 2.5 Flash at $0.30/1M input offers good performance at a much lower cost.

Estimate Your Gemini API Costs

Get accurate, real-time cost estimates for any Gemini model with our API. Or try the LLM cost calculator to compare across all providers.

Get Started Free