Question 1

What is the cheapest LLM API?

Accepted Answer

In the current standard-rate table, the lowest input prices are GPT-5 Nano ($0.05/1M), Gemini 2.0 Flash Lite ($0.075/1M), and nano/flash models around $0.10/1M. They are best for high-volume classification, extraction, and routing.

Question 2

How do LLM API prices compare across providers?

Accepted Answer

All three major providers now mix budget, mid-range, flagship, and special-mode pricing. OpenAI has the lowest standard input floor, Gemini is especially competitive in Flash tiers, and Anthropic is simpler on long-context Claude pricing for Sonnet 4.6, Opus 4.8, Fable 5, and Mythos 5.

Question 3

How much do LLM APIs cost per token?

Accepted Answer

LLM APIs typically charge $0.05-15 per million input tokens and $0.30-75 per million output tokens. Costs vary by model tier: budget models cost under $0.15/1M input, mid-range models cost $0.25-3/1M, and flagship models cost $1.25-15/1M input.

Question 4

Which LLM is cheapest for high-volume usage?

Accepted Answer

For high-volume usage, start with the cheapest rows in the live table and validate quality against your workload. GPT-5 Nano and Gemini Flash Lite are good low-cost baselines; route harder requests to Sonnet, Opus, Fable, GPT, or Gemini Pro only when needed.

Question 5

Which is cheaper, OpenAI or Anthropic?

Accepted Answer

At the budget tier, OpenAI usually has the lowest standard input price. At mid-range and flagship tiers, the right answer depends on context length, output volume, latency, quality, and whether cache, batch, priority, fast mode, or data residency pricing applies.

Question 6

What is the cheapest AI API in 2026?

Accepted Answer

The lowest standard input price currently tracked is GPT-5 Nano at $0.05 per million input tokens, with Gemini Flash Lite close behind. Treat this as a starting point: total cost still depends on output length, cache use, and model quality.

Question 7

How much does it cost to run an AI chatbot?

Accepted Answer

Running an AI chatbot typically costs $5-100/month for small-scale deployments using budget models, $100-500/month for production apps with mid-range models, and $500+/month for high-volume enterprise deployments. The exact cost depends on your model choice, message volume, and average conversation length.

Question 8

Are there free LLM APIs?

Accepted Answer

Free access changes often. Google AI Studio has free-tier quotas for Gemini, and some providers offer trial credits or promos, but production API usage is usually paid. Check each provider dashboard before relying on free usage.

Question 9

How do I estimate my LLM API costs?

Accepted Answer

Use our LLM cost calculator to estimate costs based on expected token usage, request volume, and model choice. For programmatic estimates, the ModelPricing API returns costs from the tracked standard rate table.

Model	Input $/1M tokens	Output $/1M tokens	Type	Notes
claude-3-7-sonnet	$3.00	$15.00	Flat
claude-fable-5	$10.00	$50.00	Flat
claude-haiku-3	$0.250	$1.25	Flat
claude-haiku-3-5	$0.800	$4.00	Flat
claude-haiku-4-5	$1.00	$5.00	Flat
claude-mythos-5	$10.00	$50.00	Flat
claude-opus-3	$15.00	$75.00	Flat
claude-opus-4-0	$15.00	$75.00	Flat
claude-opus-4-1	$15.00	$75.00	Flat
claude-opus-4-5	$5.00	$25.00	Flat
claude-opus-4-6	$5.00	$25.00	Flat
claude-opus-4-7	$5.00	$25.00	Flat
claude-opus-4-8	$5.00	$25.00	Flat
claude-sonnet-4-0	$3.00	$15.00	Flat
claude-sonnet-4-5	$3.00	$15.00	Flat
claude-sonnet-4-6	$3.00	$15.00	Flat
gemini-2.0-flash	$0.150	$0.600	Flat
gemini-2.0-flash-lite	$0.075	$0.300	Flat
gemini-2.5-computer-use	$1.25 / $2.50	$10.00 / $15.00	Breakpoint	Threshold: 200K tokens
gemini-2.5-flash	$0.300	$2.50	Flat
gemini-2.5-flash-image	$0.300	$2.50	Multimodal	text, image
gemini-2.5-flash-lite	$0.100	$0.400	Flat
gemini-2.5-flash-native-audio	$0.500	$2.00	Multimodal	text, audio
gemini-2.5-flash-preview-tts	$0.500	$10.00	Flat
gemini-2.5-pro	$1.25 / $2.50	$10.00 / $15.00	Breakpoint	Threshold: 200K tokens
gemini-2.5-pro-preview-tts	$1.00	$20.00	Flat
gemini-3-flash	$0.500	$3.00	Flat
gemini-3-pro-image-preview	$2.00	$12.00	Multimodal	text, image
gemini-3-pro-preview	$2.00 / $4.00	$12.00 / $12.00	Breakpoint	Threshold: 200K tokens
gemini-3.1-flash-image-preview	$0.500	$3.00	Multimodal	text, image
gemini-3.1-flash-lite-preview	$0.250	$1.50	Flat
gemini-3.1-pro-preview	$2.00 / $4.00	$12.00 / $18.00	Breakpoint	Threshold: 200K tokens
gemini-3.5-flash	$1.50	$9.00	Flat
gpt-4.1	$2.00	$8.00	Flat
gpt-4.1-mini	$0.400	$1.60	Flat
gpt-4.1-nano	$0.100	$0.400	Flat
gpt-4o	$2.50	$10.00	Flat
gpt-4o-mini	$0.150	$0.600	Flat
gpt-5	$1.25	$10.00	Flat
gpt-5-codex	$1.25	$10.00	Flat
gpt-5-mini	$0.250	$2.00	Flat
gpt-5-nano	$0.050	$0.400	Flat
gpt-5-pro	$15.00	$120.00	Flat
gpt-5.1	$1.25	$10.00	Flat
gpt-5.1-codex	$1.25	$10.00	Flat
gpt-5.1-codex-max	$1.25	$10.00	Flat
gpt-5.2	$1.75	$14.00	Flat
gpt-5.2-codex	$1.75	$14.00	Flat
gpt-5.2-pro	$21.00	$168.00	Flat
gpt-5.3-codex	$1.75	$14.00	Flat
gpt-5.4	$2.50 / $5.00	$15.00 / $22.50	Breakpoint	Threshold: 272K tokens
gpt-5.4-mini	$0.750	$4.50	Flat
gpt-5.4-nano	$0.200	$1.25	Flat
gpt-5.4-pro	$30.00 / $60.00	$180.00 / $270.00	Breakpoint	Threshold: 272K tokens
gpt-5.5	$5.00 / $10.00	$30.00 / $45.00	Breakpoint	Threshold: 272K tokens
gpt-5.5-pro	$30.00	$180.00	Flat
o1	$15.00	$60.00	Flat
o1-mini	$1.10	$4.40	Flat
o1-pro	$150.00	$600.00	Flat
o3	$2.00	$8.00	Flat
o3-deep-research	$10.00	$40.00	Flat
o3-mini	$1.10	$4.40	Flat
o3-pro	$20.00	$80.00	Flat
o4-mini	$1.10	$4.40	Flat
o4-mini-deep-research	$2.00	$8.00	Flat

Rank	Model	Provider	Input / 1M	Output / 1M
1	gpt-5-nano	OpenAI	$0.050	$0.40
2	gemini-2.0-flash-lite	Google Gemini	$0.075	$0.30
3	gemini-2.5-flash-lite	Google Gemini	$0.100	$0.40
4	gpt-4.1-nano	OpenAI	$0.100	$0.40
5	gemini-2.0-flash	Google Gemini	$0.150	$0.60
6	gpt-4o-mini	OpenAI	$0.150	$0.60
7	gpt-5.4-nano	OpenAI	$0.200	$1.25
8	gemini-3.1-flash-lite-preview	Google Gemini	$0.250	$1.50
9	claude-haiku-3	Anthropic Claude	$0.250	$1.25
10	gpt-5-mini	OpenAI	$0.250	$2.00

Tier	OpenAI	Anthropic	Google
Budget	GPT-5 Nano — $0.05/$0.40	Haiku 3 — $0.25/$1.25	Flash Lite — $0.075/$0.30
Mid-range	GPT-5 — $1.25/$10	Sonnet 4.6 — $3/$15	Gemini 2.5 Pro — $1.25/$10
Flagship	GPT-5.4 — $2.50/$15	Fable 5 / Mythos 5 — $10/$50	Gemini 3.1 Pro — $2/$12
Reasoning	o3 — $2/$8	Sonnet 4.6 — $3/$15	Gemini 2.5 Pro — $1.25/$10

Use Case	Best Model	Provider	Input / 1M	Output / 1M
High-volume chatbot	GPT-5 Nano	OpenAI	$0.05	$0.40
Code generation	Gemini 2.5 Pro	Google	$1.25	$10.00
Document analysis	Gemini 2.5 Flash	Google	$0.30	$2.50
Complex reasoning	Claude Fable 5	Anthropic	$10.00	$50.00
Classification / extraction	Gemini Flash Lite	Google	$0.075	$0.30
Agentic workflows	Claude Sonnet 4.6	Anthropic	$3.00	$15.00

LLM Pricing Comparison — Compare AI API Costs

Side-by-Side Provider Comparison

Cheapest LLM APIs in 2026

OpenAI vs Anthropic vs Google: Quick Comparison

Cheapest LLM API by Use Case

Compare by Provider

Google Gemini

Anthropic Claude

OpenAI

How to Choose the Right Model

Frequently Asked Questions