Google AI API Pricing — Gemini 2.5 Pro, Flash & More
Google's Gemini models offer an impressive combination of performance and affordability. With context windows up to 2M tokens and aggressive pricing on the Flash line, they're a top choice for developers building cost-sensitive AI applications at scale.
All Google AI Models
7 models · Prices in USD per 1M tokens
| Model | Tier | Input $/1M | Output $/1M |
|---|---|---|---|
| Gemini 2.5 Pro | Flagship | $1.25 | $10.00 |
| Gemini 2.5 Flash | Mid-tier | $0.30 | $2.50 |
| Gemini 2.5 Flash-Lite | Budget | $0.10 | $0.40 |
| Gemini 2.0 Flash | Budget | $0.10 | $0.40 |
| Gemini 3.1 Pro Preview | Flagship | $2.00 | $12.00 |
| Gemini 3 Flash Preview | Mid-tier | $0.50 | $3.00 |
| Gemini 2.0 Flash-Lite | Budget | $0.07 | $0.30 |
About Google's Gemini Models
Google's Gemini family is built by DeepMind and represents Google's most advanced AI technology. Available through Google AI Studio and Vertex AI, these models are deeply integrated with the Google Cloud ecosystem and support multimodal inputs including text, images, audio, and video.
Gemini 2.5 Pro is Google's flagship model, offering state-of-the-art reasoning with a 1M-token context window at $1.25/$10.00 per million tokens — making it one of the most competitive flagship models on price. It supports extended thinking for complex multi-step problems.
The Flash models are where Google truly shines on cost. Gemini 2.5 Flash matches GPT-4o Mini pricing at $0.15/$0.60 while delivering strong benchmark results, and Gemini 2.0 Flash undercuts it at $0.10/$0.40. For budget-conscious teams, Gemini 1.5 Flash offers the lowest rates in Google's lineup at $0.075/$0.30 with a full 1M-token context window. All models support context caching to reduce repeated input costs.
Disclosure: Pricing data on this page is sourced from Google's official AI pricing page and was last verified on 2026-02-28. Prices are subject to change without notice. Always confirm current pricing on the provider's website.