Google Gemini API Pricing
Google Gemini API pricing covers low-cost Flash Lite models, fast Flash models, Pro models with long-context breakpoints, and multimodal image/audio variants. Here are the standard rates tracked by ModelPricing.ai. Rates sourced from the official Google Gemini pricing page. Compare with Anthropic Claude and OpenAI, or see all models in our LLM pricing comparison.
Google Gemini API Pricing at a Glance
Start here for the main Gemini API cost tradeoffs: Flash Lite for volume, Flash for speed, Pro for deeper reasoning, and image variants when output modality matters.
| Model | Input / 1M | Output / 1M | Pricing Note |
|---|---|---|---|
| Gemini 2.0 Flash Lite | $0.075 | $0.30 | Lowest-cost Google text row |
| Gemini 2.5 Flash | $0.30 | $2.50 | Fast mid-tier inference |
| Gemini 2.5 Pro | $1.25 / $2.50 | $10.00 / $15.00 | Breakpoint at 200K input tokens |
| Gemini 3 Flash | $0.50 | $3.00 | Flat pricing for fast reasoning |
| Gemini 3.1 Pro Preview | $2.00 / $4.00 | $12.00 / $18.00 | Breakpoint at 200K input tokens |
All Gemini Models
| Model | Input $/1M tokens | Output $/1M tokens | Type | Notes |
|---|---|---|---|---|
| gemini-2.0-flash | $0.150 | $0.600 | Flat | |
| gemini-2.0-flash-lite | $0.075 | $0.300 | Flat | |
| gemini-2.5-computer-use | $1.25 / $2.50 | $10.00 / $15.00 | Breakpoint | Threshold: 200K tokens |
| gemini-2.5-flash | $0.300 | $2.50 | Flat | |
| gemini-2.5-flash-image | $0.300 | $2.50 | Multimodal | text, image |
| gemini-2.5-flash-lite | $0.100 | $0.400 | Flat | |
| gemini-2.5-flash-native-audio | $0.500 | $2.00 | Multimodal | text, audio |
| gemini-2.5-flash-preview-tts | $0.500 | $10.00 | Flat | |
| gemini-2.5-pro | $1.25 / $2.50 | $10.00 / $15.00 | Breakpoint | Threshold: 200K tokens |
| gemini-2.5-pro-preview-tts | $1.00 | $20.00 | Flat | |
| gemini-3-flash | $0.500 | $3.00 | Flat | |
| gemini-3-pro-image-preview | $2.00 | $12.00 | Multimodal | text, image |
| gemini-3-pro-preview | $2.00 / $4.00 | $12.00 / $12.00 | Breakpoint | Threshold: 200K tokens |
| gemini-3.1-flash-image-preview | $0.500 | $3.00 | Multimodal | text, image |
| gemini-3.1-flash-lite-preview | $0.250 | $1.50 | Flat | |
| gemini-3.1-pro-preview | $2.00 / $4.00 | $12.00 / $18.00 | Breakpoint | Threshold: 200K tokens |
| gemini-3.5-flash | $1.50 | $9.00 | Flat |
Gemini API Cost Breakdown
Gemini API costs depend on which model you use and how many tokens you process. The cheapest option is Gemini 2.0 Flash Lite at just $0.075 per million input tokens — one of the lowest-cost LLM APIs on the market. Mid-tier models like Gemini 2.5 Flash offer stronger reasoning at $0.30/1M input, while Gemini 2.5 Pro starts at $1.25/1M input for complex tasks.
For most workloads, output tokens cost 2-8x more than input tokens. Gemini API pricing is competitive with both OpenAI and Anthropic, especially at the budget and mid-range tiers. Use our LLM cost calculator to estimate your specific Gemini API costs.
How Gemini Pricing Works
Gemini models use three pricing structures. Flat pricing charges a fixed rate per million tokens regardless of context length. This applies to Flash and Flash Lite models, making costs simple and predictable.
Breakpoint pricing applies to Pro-tier models like Gemini 2.5 Pro and Gemini 3 Pro. These models have a lower rate for requests under 200K input tokens and a higher rate above that threshold, letting you benefit from lower costs for typical workloads.
Multimodal pricing is used by image and audio models. These have separate rates for each modality — for example, Gemini 3 Pro Image charges $2/1M for text output but $120/1M for image output tokens.
Gemini vs Claude vs GPT: Price Comparison
How does Gemini API pricing stack up against Claude and GPT? Here's a side-by-side comparison at each price tier. See our full LLM pricing comparison for all models.
| Tier | Model | Input / 1M | Output / 1M |
|---|---|---|---|
| Budget | Google gemini-2.0-flash-lite | $0.07 | $0.30 |
| Anthropic claude-haiku-3 | $0.25 | $1.25 | |
| OpenAI gpt-4.1-nano | $0.10 | $0.40 | |
| Mid-range | Google gemini-2.5-flash | $0.30 | $2.50 |
| Anthropic claude-haiku-4-5 | $1.00 | $5.00 | |
| OpenAI gpt-5-mini | $0.25 | $2.00 | |
| Flagship | Google gemini-2.5-pro | $1.25 | $10.00 |
| Anthropic claude-sonnet-4-6 | $3.00 | $15.00 | |
| OpenAI gpt-5 | $1.25 | $10.00 |
Gemini API Monthly Cost Estimates
How much will the Gemini API cost you per month? Google's pricing is among the most competitive, especially at the budget tier. Use our LLM cost calculator for a precise estimate.
Light Use
$1-10/mo
- Personal projects
- <1K requests/day
- Flash Lite for most tasks
Medium Use
$10-75/mo
- Small team apps
- 1-5K requests/day
- Mix of Flash and Pro
Heavy Use
$75-400/mo
- Production apps
- 5-20K requests/day
- Pro for quality tasks
Enterprise
$400+/mo
- Large-scale deployments
- 20K+ requests/day
- Pro with multimodal
Which Gemini Model Should You Use?
Google's model lineup covers everything from ultra-cheap inference to multimodal generation. Here's how to pick the right Gemini model for your use case.
| Use Case | Recommended Model | Est. Monthly Cost | Why This Model |
|---|---|---|---|
| High-volume classification | Flash Lite 2.0 | $1-10 | One of the cheapest LLM APIs available |
| General chatbot | Flash 2.5 | $10-50 | Strong reasoning at very low cost |
| Code & complex reasoning | Gemini 2.5 Pro | $30-150 | Best Gemini model for hard tasks |
| Image generation | Gemini 3 Pro Image | $50-300 | Native multimodal output |
| Real-time applications | Flash 2.0 | $3-20 | Fastest inference, low latency |
5 Ways to Reduce Your Gemini API Costs
Use Flash Lite for simple tasks
At $0.075/1M input tokens, Flash Lite 2.0 is 17x cheaper than Gemini 2.5 Pro. Use it for classification, extraction, and any task that doesn't need deep reasoning.
Stay under the 200K breakpoint
Gemini Pro models double their input cost above 200K tokens. Split long documents into chunks or use Flash models for long-context tasks.
Check free-tier quotas
Google offers free-tier access for Gemini API through AI Studio with rate limits. Check your dashboard before assuming production traffic will be free.
Be mindful of multimodal costs
Image output tokens on Gemini 3 Pro Image cost significantly more than text. Only use multimodal models when you actually need image or audio output.
Monitor usage with the ModelPricing API
Track your Gemini API spending programmatically to identify cost spikes and optimize model selection. Get started free.
Gemini Model Tiers
Flash Lite
The most affordable tier. Gemini 2.0 Flash Lite starts at just $0.075/1M input tokens, ideal for high-volume tasks like classification, extraction, and real-time applications.
Flash
Fast and cost-effective. Gemini 2.5 Flash and 3 Flash offer strong reasoning at $0.30-0.50/1M input tokens, great for coding, analysis, and multi-step workflows.
Pro
Most capable tier for complex reasoning and research. Gemini 2.5 Pro and 3 Pro use breakpoint pricing starting at $1.25-2/1M input tokens under 200K context.
Frequently Asked Questions
How much does the Gemini API cost?
Gemini API pricing varies by model. Gemini 2.0 Flash Lite starts at $0.075/1M input tokens, while Gemini 2.5 Pro costs $1.25-2.50/1M input tokens depending on context length.
What is the cheapest Gemini model?
Gemini 2.0 Flash Lite is the most affordable at $0.075/1M input tokens and $0.30/1M output tokens, making it one of the cheapest LLM APIs available.
Does Gemini have breakpoint pricing?
Yes. Gemini Pro-tier models such as Gemini 2.5 Pro, Gemini 3 Pro Preview, and Gemini 3.1 Pro Preview use breakpoint pricing where long-context requests above 200K input tokens move to a higher tier.
How does Gemini pricing compare to GPT and Claude?
Gemini Flash and Flash Lite models are very competitive for low-latency and high-volume work, while Pro models are priced around the mid-to-flagship tier. The best fit depends heavily on context length and multimodal needs.
Does Gemini support multimodal pricing?
Yes. Models like Gemini 3 Pro Image and Gemini 2.5 Flash Image have separate rates for text and image tokens, with image output costing significantly more than text output.
Is the Gemini API free?
Google AI Studio offers free-tier quotas with rate limits, and paid usage starts from the low-cost Flash Lite tiers. Check the Google AI Studio or Cloud billing dashboard for current quotas before relying on free usage.
How much does Gemini 3 cost?
Gemini 3 Flash costs $0.50/1M input tokens and $3/1M output tokens with flat pricing. Gemini 3 Pro Preview uses breakpoint pricing at $2/1M input (under 200K tokens) or $4/1M input (above 200K tokens).
Which Gemini model is best for coding?
Gemini 2.5 Pro is the strongest Gemini model for code generation and debugging, starting at $1.25/1M input tokens. For lighter coding tasks, Gemini 2.5 Flash at $0.30/1M input offers good performance at a much lower cost.
Estimate Your Gemini API Costs
Get accurate, real-time cost estimates for any Gemini model with our API. Or try the LLM cost calculator to compare across all providers.
Get Started Free