Gemini 2.5 Flash Lite pricing
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance... Below you will find 3 current rows with input and output dollars per million. Right now the lowest input is $0.100 and the lowest output is $0.400.
Pricing across providers
Every row is a seller of Gemini 2.5 Flash Lite with token pricing we track. The cheapest input in this snapshot is from Google. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
G Googlenative | $0.100 | $0.400 | $0.010 | — |
O Openrouter | $0.100 | $0.400 | — | — |
GV Google Vertex | $0.100 | $0.400 | $0.010 | — |
Input vs output · per provider
Cost calculator
Use this block to stress test Gemini 2.5 Flash Lite cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.010000¢ / req
0.020000¢ / req
Model specifications
Quick spec sheet for Gemini 2.5 Flash Lite before you dive back into pricing. Family: Gemini 2. Reported under Google.
- Context window
- 1,048,576 tokens
- Max output
- 65,535 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jul 2025
- Primary provider
- Model family
- Gemini 2
Compare Gemini 2.5 Flash Lite
Open a pair page to see Gemini 2.5 Flash Lite next to another model with a shared provider matrix. 6 shortcuts below.
- Gemini 2.5 Flash Lite vs Gemini 2.0 Flash
Same output pricing
- Gemini 2.5 Flash Lite vs Gemini 1.5 Pro
Gemini 2.5 Flash Lite 92% cheaper on output
- Gemini 2.5 Flash Lite vs Gemini 1.5 Flash
Gemini 1.5 Flash 25% cheaper on output
- Gemini 2.5 Flash Lite vs GPT-4o
Gemini 2.5 Flash Lite 96% cheaper on output
- Gemini 2.5 Flash Lite vs GPT-4o mini
Gemini 2.5 Flash Lite 33% cheaper on output
- Gemini 2.5 Flash Lite vs Claude Sonnet 4.6
Gemini 2.5 Flash Lite 97% cheaper on output
Frequently asked questions
Quick frequently asked items for Gemini 2.5 Flash Lite pricing and limits. The short model note from our index: Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Also from Google
Other models by Google with live pricing in our catalog.