LLM Economics

Gemini Cost Calculator

Estimate Google Gemini API spend per call, day, month and year — then compare against OpenAI and Claude and see how much switching could save.

Inputs

Choose the model tier you plan to use in production.

Average prompt size per API call. A page of text ≈ 750 tokens.

Average completion length. Short answers ≈ 200–500 tokens.

Total API calls per month across all users / jobs.

Fraction of input tokens served from prompt cache (costs ~10× less). 0 = no caching.

Estimated yearly cost
$750,000.00/yr
Switch to OpenAI GPT-5
Switching to OpenAI GPT-5 would cut this workload from $750,000.00/yr to $750,000.00/yr.
Monthly cost
$62,500.00/mo
Daily cost
$2,083.33/day
Cost per request
$6.2500
Model
Gemini 2.5 Pro

Cost breakdown

ItemMonthlyYearly
Input tokens$10,000.00$120,000.00
Output tokens$5,000.00$60,000.00
Spend$62,500.00$750,000.00

Comparison

OptionMonthlyYearly
OpenAI GPT-5cheapest$62,500.00$750,000.00
Claude Claude Sonnet 4.6$105,000.00$1,260,000.00
Gemini Gemini 2.5 Procurrent$62,500.00$750,000.00

Data updated 2026-06-30 · openai.com/api/pricing openai.com/api/pricing · platform.claude.com/docs/about-claude/pricing platform.claude.com/docs/about-claude/pricing · ai.google.dev/gemini-api/docs/pricing ai.google.dev/gemini-api/docs/pricing

Industry Benchmark

Output price vs. peer average ($/1M tokens)Industry avg: 11.67 $/1M
You are at the 43th percentile

Data updated 2026-06-30 · openai.com/api/pricing openai.com/api/pricing · platform.claude.com/docs/about-claude/pricing platform.claude.com/docs/about-claude/pricing · ai.google.dev/gemini-api/docs/pricing ai.google.dev/gemini-api/docs/pricing

Trends & comparison

Trend

Comparison (monthly vs. yearly)

How to use this calculator

Pick a Gemini model, enter token volumes and monthly requests. The result compares Gemini against OpenAI and Claude comparable tiers and recommends the cheapest with the annual saving.

Gemini pricing tiers

Gemini 2.5 Pro, Flash and Flash Lite trade capability for cost. Flash tiers are often the cheapest comparable option for high-volume workloads.

Benchmarks & sources

Prices come from a versioned JSON config validated at build time and displayed with their update date in the footer.

Frequently asked questions

How is Gemini API cost calculated?

Cost = (input tokens ÷ 1,000,000 × input price) + (output tokens ÷ 1,000,000 × output price), times monthly requests, with cache and batch discounts where enabled.

Pro vs Flash vs Flash Lite?

Pro is highest quality, Flash is fast and cheap for most production tasks, and Flash Lite is the lowest-cost option for very high volume. The calculator shows the trade-off instantly.

How reliable are the OpenAI and Claude comparisons?

They use comparable-tier pricing sourced directly from official provider documentation; verify against official price pages. The footer shows the data version and source.

Related calculators

Same cluster

Gemini Cost Calculator — Google AI API Pricing & Savings | LLM Economics