GoogleGemini 2VisionTool use

Gemini 2.5 Flash pricing

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning). Below you will find 6 current rows with input and output dollars per million. Right now the lowest input is $0.300 and the lowest output is $2.50.

1.0M context·6 providers·verified Apr 7, 2026
Best input$0.300per 1M tokens · Google Vertex
Best output$2.50per 1M tokens · Google Vertex

Pricing across providers

Every row is a seller of Gemini 2.5 Flash with token pricing we track. The cheapest input in this snapshot is from Google Vertex. The bar chart shows the same input and output dollars per million for a quick scan.

GV
Google Vertex
Input / 1M
$0.300
Output / 1M
$2.50
Cached in: $0.030
G
Googlenative
Input / 1M
$0.300
Output / 1M
$2.50
Cached in: $0.030
VA
Vercel Ai Gateway
Input / 1M
$0.300
Output / 1M
$2.50
D
DeepInfra
Input / 1M
$0.300
Output / 1M
$2.50
O
Openrouter
Input / 1M
$0.300
Output / 1M
$2.50
R
Replicate
Input / 1M
$2.50
Output / 1M
$2.50

Input vs output · per provider

Cost calculator

Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Gemini 2.5 Flash.

Provider

In: $0.300/M·Out: $2.50/M·Cache: $0.030/M

0.030000¢ / req

0.125000¢ / req

Daily
$16
Monthly
$465
Annual
$5.7K

Model specifications

Quick spec sheet for Gemini 2.5 Flash before you dive back into pricing. Family: Gemini 2. Reported under Google.

Context window
1,000,000 tokens
Max output
1,000,000 tokens
Vision (images)
Yes
Tool / function calling
Yes
Streaming
No
Released
Jun 2025
Primary provider
Google
Model family
Gemini 2

Compare Gemini 2.5 Flash

Open a pair page to see Gemini 2.5 Flash next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide...

Gemini 2.5 Flash costs $0.30 per million input tokens and $2.50 per million output tokens via the native API. Prompt caching reduces input costs to $0.03/M tokens.

Also from Google

Other models by Google with live pricing in our catalog.