Gemini Cost Calculator
Estimate Google Gemini API spend per call, day, month and year — then compare against OpenAI and Claude and see how much switching could save.
Inputs
Choose the model tier you plan to use in production.
Average prompt size per API call. A page of text ≈ 750 tokens.
Average completion length. Short answers ≈ 200–500 tokens.
Total API calls per month across all users / jobs.
Fraction of input tokens served from prompt cache (costs ~10× less). 0 = no caching.
Cost breakdown
| Item | Monthly | Yearly |
|---|---|---|
| Input tokens | $10,000.00 | $120,000.00 |
| Output tokens | $5,000.00 | $60,000.00 |
| Spend | $62,500.00 | $750,000.00 |
Comparison
| Option | Monthly | Yearly |
|---|---|---|
| OpenAI GPT-5cheapest | $62,500.00 | $750,000.00 |
| Claude Claude Sonnet 4.6 | $105,000.00 | $1,260,000.00 |
| Gemini Gemini 2.5 Procurrent | $62,500.00 | $750,000.00 |
Data updated 2026-06-30 · openai.com/api/pricing openai.com/api/pricing · platform.claude.com/docs/about-claude/pricing platform.claude.com/docs/about-claude/pricing · ai.google.dev/gemini-api/docs/pricing ai.google.dev/gemini-api/docs/pricing
Industry Benchmark
Data updated 2026-06-30 · openai.com/api/pricing openai.com/api/pricing · platform.claude.com/docs/about-claude/pricing platform.claude.com/docs/about-claude/pricing · ai.google.dev/gemini-api/docs/pricing ai.google.dev/gemini-api/docs/pricing
Trends & comparison
Trend
Comparison (monthly vs. yearly)
How to use this calculator
Pick a Gemini model, enter token volumes and monthly requests. The result compares Gemini against OpenAI and Claude comparable tiers and recommends the cheapest with the annual saving.
Gemini pricing tiers
Gemini 2.5 Pro, Flash and Flash Lite trade capability for cost. Flash tiers are often the cheapest comparable option for high-volume workloads.
Benchmarks & sources
Prices come from a versioned JSON config validated at build time and displayed with their update date in the footer.
Frequently asked questions
How is Gemini API cost calculated?▾
Cost = (input tokens ÷ 1,000,000 × input price) + (output tokens ÷ 1,000,000 × output price), times monthly requests, with cache and batch discounts where enabled.
Pro vs Flash vs Flash Lite?▾
Pro is highest quality, Flash is fast and cheap for most production tasks, and Flash Lite is the lowest-cost option for very high volume. The calculator shows the trade-off instantly.
How reliable are the OpenAI and Claude comparisons?▾
They use comparable-tier pricing sourced directly from official provider documentation; verify against official price pages. The footer shows the data version and source.