Google Gemini API Pricing

Q: How much does the Gemini API cost?

Gemini API pricing varies by model. Gemini 2.0 Flash Lite starts at $0.075/1M input tokens, while Gemini 2.5 Pro costs $1.25-2.50/1M input tokens depending on context length.

Q: What is the cheapest Gemini model?

Gemini 2.0 Flash Lite is the most affordable at $0.075/1M input tokens and $0.30/1M output tokens, making it one of the cheapest LLM APIs available.

Q: Does Gemini have breakpoint pricing?

Yes. Gemini Pro-tier models such as Gemini 2.5 Pro, Gemini 3 Pro Preview, and Gemini 3.1 Pro Preview use breakpoint pricing where long-context requests above 200K input tokens move to a higher tier.

Q: How does Gemini pricing compare to GPT and Claude?

Gemini Flash and Flash Lite models are very competitive for low-latency and high-volume work, while Pro models are priced around the mid-to-flagship tier. The best fit depends heavily on context length and multimodal needs.

Q: Does Gemini support multimodal pricing?

Yes. Models like Gemini 3 Pro Image and Gemini 2.5 Flash Image have separate rates for text and image tokens, with image output costing significantly more than text output.

Q: Is the Gemini API free?

Google AI Studio offers free-tier quotas with rate limits, and paid usage starts from the low-cost Flash Lite tiers. Check the Google AI Studio or Cloud billing dashboard for current quotas before relying on free usage.

Q: How much does Gemini 3 cost?

Gemini 3 Flash costs $0.50/1M input tokens and $3/1M output tokens with flat pricing. Gemini 3 Pro Preview uses breakpoint pricing at $2/1M input (under 200K tokens) or $4/1M input (above 200K tokens).

Q: Which Gemini model is best for coding?

Gemini 2.5 Pro is the strongest Gemini model for code generation and debugging, starting at $1.25/1M input tokens. For lighter coding tasks, Gemini 2.5 Flash at $0.30/1M input offers good performance at a much lower cost.

Google Gemini API pricing covers low-cost Flash Lite models, fast Flash models, Pro models with long-context breakpoints, and multimodal image/audio variants. Here are the standard rates tracked by ModelPricing.ai. Rates sourced from the official Google Gemini pricing page. Compare with Anthropic Claude and OpenAI, or see all models in our LLM pricing comparison.

Google Gemini API Pricing at a Glance

Start here for the main Gemini API cost tradeoffs: Flash Lite for volume, Flash for speed, Pro for deeper reasoning, and image variants when output modality matters.

Model	Input / 1M	Output / 1M	Pricing Note
Gemini 2.0 Flash Lite	$0.075	$0.30	Lowest-cost Google text row
Gemini 2.5 Flash	$0.30	$2.50	Fast mid-tier inference
Gemini 2.5 Pro	$1.25 / $2.50	$10.00 / $15.00	Breakpoint at 200K input tokens
Gemini 3 Flash	$0.50	$3.00	Flat pricing for fast reasoning
Gemini 3.1 Pro Preview	$2.00 / $4.00	$12.00 / $18.00	Breakpoint at 200K input tokens

All Gemini Models

Model	Input $/1M tokens	Output $/1M tokens	Type	Notes
gemini-2.0-flash	$0.150	$0.600	Flat
gemini-2.0-flash-lite	$0.075	$0.300	Flat
gemini-2.5-computer-use	$1.25 / $2.50	$10.00 / $15.00	Breakpoint	Threshold: 200K tokens
gemini-2.5-flash	$0.300	$2.50	Flat
gemini-2.5-flash-image	$0.300	$2.50	Multimodal	text, image
gemini-2.5-flash-lite	$0.100	$0.400	Flat
gemini-2.5-flash-native-audio	$0.500	$2.00	Multimodal	text, audio
gemini-2.5-flash-preview-tts	$0.500	$10.00	Flat
gemini-2.5-pro	$1.25 / $2.50	$10.00 / $15.00	Breakpoint	Threshold: 200K tokens
gemini-2.5-pro-preview-tts	$1.00	$20.00	Flat
gemini-3-flash	$0.500	$3.00	Flat
gemini-3-pro-image-preview	$2.00	$12.00	Multimodal	text, image
gemini-3-pro-preview	$2.00 / $4.00	$12.00 / $12.00	Breakpoint	Threshold: 200K tokens
gemini-3.1-flash-image-preview	$0.500	$3.00	Multimodal	text, image
gemini-3.1-flash-lite-preview	$0.250	$1.50	Flat
gemini-3.1-pro-preview	$2.00 / $4.00	$12.00 / $18.00	Breakpoint	Threshold: 200K tokens
gemini-3.5-flash	$1.50	$9.00	Flat

Gemini API Cost Breakdown

Gemini API costs depend on which model you use and how many tokens you process. The cheapest option is Gemini 2.0 Flash Lite at just $0.075 per million input tokens — one of the lowest-cost LLM APIs on the market. Mid-tier models like Gemini 2.5 Flash offer stronger reasoning at $0.30/1M input, while Gemini 2.5 Pro starts at $1.25/1M input for complex tasks.

For most workloads, output tokens cost 2-8x more than input tokens. Gemini API pricing is competitive with both OpenAI and Anthropic, especially at the budget and mid-range tiers. Use our LLM cost calculator to estimate your specific Gemini API costs.

How Gemini Pricing Works

Gemini models use three pricing structures. Flat pricing charges a fixed rate per million tokens regardless of context length. This applies to Flash and Flash Lite models, making costs simple and predictable.

Breakpoint pricing applies to Pro-tier models like Gemini 2.5 Pro and Gemini 3 Pro. These models have a lower rate for requests under 200K input tokens and a higher rate above that threshold, letting you benefit from lower costs for typical workloads.

Multimodal pricing is used by image and audio models. These have separate rates for each modality — for example, Gemini 3 Pro Image charges $2/1M for text output but $120/1M for image output tokens.

Gemini vs Claude vs GPT: Price Comparison

How does Gemini API pricing stack up against Claude and GPT? Here's a side-by-side comparison at each price tier. See our full LLM pricing comparison for all models.

Tier	Model	Input / 1M	Output / 1M
Budget	Google gemini-2.0-flash-lite	$0.07	$0.30
	Anthropic claude-haiku-3	$0.25	$1.25
	OpenAI gpt-4.1-nano	$0.10	$0.40
Mid-range	Google gemini-2.5-flash	$0.30	$2.50
	Anthropic claude-haiku-4-5	$1.00	$5.00
	OpenAI gpt-5-mini	$0.25	$2.00
Flagship	Google gemini-2.5-pro	$1.25	$10.00
	Anthropic claude-sonnet-4-6	$3.00	$15.00
	OpenAI gpt-5	$1.25	$10.00

Gemini API Monthly Cost Estimates

How much will the Gemini API cost you per month? Google's pricing is among the most competitive, especially at the budget tier. Use our LLM cost calculator for a precise estimate.

Light Use

$1-10/mo

Personal projects
<1K requests/day
Flash Lite for most tasks

Medium Use

$10-75/mo

Small team apps
1-5K requests/day
Mix of Flash and Pro

Heavy Use

$75-400/mo

Production apps
5-20K requests/day
Pro for quality tasks

Enterprise

$400+/mo

Large-scale deployments
20K+ requests/day
Pro with multimodal

Which Gemini Model Should You Use?

Google's model lineup covers everything from ultra-cheap inference to multimodal generation. Here's how to pick the right Gemini model for your use case.

Use Case	Recommended Model	Est. Monthly Cost	Why This Model
High-volume classification	Flash Lite 2.0	$1-10	One of the cheapest LLM APIs available
General chatbot	Flash 2.5	$10-50	Strong reasoning at very low cost
Code & complex reasoning	Gemini 2.5 Pro	$30-150	Best Gemini model for hard tasks
Image generation	Gemini 3 Pro Image	$50-300	Native multimodal output
Real-time applications	Flash 2.0	$3-20	Fastest inference, low latency

5 Ways to Reduce Your Gemini API Costs

Use Flash Lite for simple tasks

At $0.075/1M input tokens, Flash Lite 2.0 is 17x cheaper than Gemini 2.5 Pro. Use it for classification, extraction, and any task that doesn't need deep reasoning.

Stay under the 200K breakpoint

Gemini Pro models double their input cost above 200K tokens. Split long documents into chunks or use Flash models for long-context tasks.

Check free-tier quotas

Google offers free-tier access for Gemini API through AI Studio with rate limits. Check your dashboard before assuming production traffic will be free.

Be mindful of multimodal costs

Image output tokens on Gemini 3 Pro Image cost significantly more than text. Only use multimodal models when you actually need image or audio output.

Monitor usage with the ModelPricing API

Track your Gemini API spending programmatically to identify cost spikes and optimize model selection. Get started free.

Gemini Model Tiers

Flash Lite

The most affordable tier. Gemini 2.0 Flash Lite starts at just $0.075/1M input tokens, ideal for high-volume tasks like classification, extraction, and real-time applications.

Flash

Fast and cost-effective. Gemini 2.5 Flash and 3 Flash offer strong reasoning at $0.30-0.50/1M input tokens, great for coding, analysis, and multi-step workflows.

Pro

Most capable tier for complex reasoning and research. Gemini 2.5 Pro and 3 Pro use breakpoint pricing starting at $1.25-2/1M input tokens under 200K context.

Frequently Asked Questions

How much does the Gemini API cost?

Gemini API pricing varies by model. Gemini 2.0 Flash Lite starts at $0.075/1M input tokens, while Gemini 2.5 Pro costs $1.25-2.50/1M input tokens depending on context length.

What is the cheapest Gemini model?

Gemini 2.0 Flash Lite is the most affordable at $0.075/1M input tokens and $0.30/1M output tokens, making it one of the cheapest LLM APIs available.

Does Gemini have breakpoint pricing?

Yes. Gemini Pro-tier models such as Gemini 2.5 Pro, Gemini 3 Pro Preview, and Gemini 3.1 Pro Preview use breakpoint pricing where long-context requests above 200K input tokens move to a higher tier.

How does Gemini pricing compare to GPT and Claude?

Gemini Flash and Flash Lite models are very competitive for low-latency and high-volume work, while Pro models are priced around the mid-to-flagship tier. The best fit depends heavily on context length and multimodal needs.

Does Gemini support multimodal pricing?

Yes. Models like Gemini 3 Pro Image and Gemini 2.5 Flash Image have separate rates for text and image tokens, with image output costing significantly more than text output.

Is the Gemini API free?

Google AI Studio offers free-tier quotas with rate limits, and paid usage starts from the low-cost Flash Lite tiers. Check the Google AI Studio or Cloud billing dashboard for current quotas before relying on free usage.

How much does Gemini 3 cost?

Gemini 3 Flash costs $0.50/1M input tokens and $3/1M output tokens with flat pricing. Gemini 3 Pro Preview uses breakpoint pricing at $2/1M input (under 200K tokens) or $4/1M input (above 200K tokens).

Which Gemini model is best for coding?

Gemini 2.5 Pro is the strongest Gemini model for code generation and debugging, starting at $1.25/1M input tokens. For lighter coding tasks, Gemini 2.5 Flash at $0.30/1M input offers good performance at a much lower cost.

Estimate Your Gemini API Costs

Get accurate, real-time cost estimates for any Gemini model with our API. Or try the LLM cost calculator to compare across all providers.

Get Started Free