Gemini Cost Calculator

Estimate Google Gemini API spend per call, day, month and year — then compare against OpenAI and Claude and see how much switching could save.

Inputs

Model

Choose the model tier you plan to use in production.

Input tokens / request

Average prompt size per API call. A page of text ≈ 750 tokens.

Output tokens / request

Average completion length. Short answers ≈ 200–500 tokens.

Monthly requests

Total API calls per month across all users / jobs.

Cache hit ratio: 0.00

Fraction of input tokens served from prompt cache (costs ~10× less). 0 = no caching.

Apply Batch API discount (50%)(Async batch jobs run cheaper. Only available for non-realtime workloads.)

Estimated yearly cost

$750,000.00/yr

Switch to OpenAI GPT-5

Switching to OpenAI GPT-5 would cut this workload from $750,000.00/yr to $750,000.00/yr.

Monthly cost

$62,500.00/mo

Daily cost

$2,083.33/day

Cost per request

$6.2500

Model

Gemini 2.5 Pro

Cost breakdown

Item	Monthly	Yearly
Input tokens	$10,000.00	$120,000.00
Output tokens	$5,000.00	$60,000.00
Spend	$62,500.00	$750,000.00

Comparison

Option	Monthly	Yearly
OpenAI GPT-5cheapest	$62,500.00	$750,000.00
Claude Claude Sonnet 4.6	$105,000.00	$1,260,000.00
Gemini Gemini 2.5 Procurrent	$62,500.00	$750,000.00

Data updated 2026-06-30 · openai.com/api/pricing openai.com/api/pricing · platform.claude.com/docs/about-claude/pricing platform.claude.com/docs/about-claude/pricing · ai.google.dev/gemini-api/docs/pricing ai.google.dev/gemini-api/docs/pricing

Industry Benchmark

Output price vs. peer average ($/1M tokens)Industry avg: 11.67 $/1M

You are at the 43th percentile

Trends & comparison

Trend

Comparison (monthly vs. yearly)

How to use this calculator

Pick a Gemini model, enter token volumes and monthly requests. The result compares Gemini against OpenAI and Claude comparable tiers and recommends the cheapest with the annual saving.

Gemini pricing tiers

Gemini 2.5 Pro, Flash and Flash Lite trade capability for cost. Flash tiers are often the cheapest comparable option for high-volume workloads.

Benchmarks & sources

Prices come from a versioned JSON config validated at build time and displayed with their update date in the footer.

Frequently asked questions

How is Gemini API cost calculated?▾

Cost = (input tokens ÷ 1,000,000 × input price) + (output tokens ÷ 1,000,000 × output price), times monthly requests, with cache and batch discounts where enabled.

Pro vs Flash vs Flash Lite?▾

Pro is highest quality, Flash is fast and cheap for most production tasks, and Flash Lite is the lowest-cost option for very high volume. The calculator shows the trade-off instantly.

How reliable are the OpenAI and Claude comparisons?▾

They use comparable-tier pricing sourced directly from official provider documentation; verify against official price pages. The footer shows the data version and source.