Gemini 2.5 Flash pricing
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning). Below you will find 6 current rows with input and output dollars per million. Right now the lowest input is $0.300 and the lowest output is $2.50.
Pricing across providers
Every row is a seller of Gemini 2.5 Flash with token pricing we track. The cheapest input in this snapshot is from Google Vertex. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
GV Google Vertex | $0.300 | $2.50 | $0.030 | — |
G Googlenative | $0.300 | $2.50 | $0.030 | — |
VA Vercel Ai Gateway | $0.300 | $2.50 | — | — |
D DeepInfra | $0.300 | $2.50 | — | — |
O Openrouter | $0.300 | $2.50 | — | — |
R Replicate | $2.50 | $2.50 | — | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Gemini 2.5 Flash.
Provider
0.030000¢ / req
0.125000¢ / req
Model specifications
Quick spec sheet for Gemini 2.5 Flash before you dive back into pricing. Family: Gemini 2. Reported under Google.
- Context window
- 1,000,000 tokens
- Max output
- 1,000,000 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jun 2025
- Primary provider
- Model family
- Gemini 2
Compare Gemini 2.5 Flash
Open a pair page to see Gemini 2.5 Flash next to another model with a shared provider matrix. 6 shortcuts below.
- Gemini 2.5 Flash vs Gemini 2.0 Flash
Gemini 2.0 Flash 84% cheaper on output
- Gemini 2.5 Flash vs Gemini 1.5 Pro
Gemini 2.5 Flash 50% cheaper on output
- Gemini 2.5 Flash vs Gemini 1.5 Flash
Gemini 1.5 Flash 88% cheaper on output
- Gemini 2.5 Flash vs GPT-4o
Gemini 2.5 Flash 75% cheaper on output
- Gemini 2.5 Flash vs GPT-4o mini
GPT-4o mini 76% cheaper on output
- Gemini 2.5 Flash vs Claude Sonnet 4.6
Gemini 2.5 Flash 83% cheaper on output
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide...
Also from Google
Other models by Google with live pricing in our catalog.