Gemini 3.5 Flash pricing
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution... This page tracks 3 listings in total. Highlighted lows are $1.50 per million input and $9.00 per million output (see table for which seller matches each).
Pricing across providers
Use this table to read Gemini 3.5 Flash list prices. We show 3 sources right now. Lowest input in the grid: Google. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
G Googlenative | $1.50 | $9.00 | $0.150 | — |
GV Google Vertex | $1.50 | $9.00 | $0.150 | — |
O Openrouter | $1.50 | $9.00 | — | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Gemini 3.5 Flash.
Provider
0.150000¢ / req
0.450000¢ / req
Model specifications
Quick spec sheet for Gemini 3.5 Flash before you dive back into pricing. Reported under Google.
- Context window
- 1,048,576 tokens
- Max output
- 65,536 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- May 2026
- Primary provider
- Model family
- N/A
Compare Gemini 3.5 Flash
Open a pair page to see Gemini 3.5 Flash next to another model with a shared provider matrix. 6 shortcuts below.
Locked
Compare with
Pick a model on both sides.
Popular Gemini 3.5 Flash comparisons
- Gemini 3.5 Flash vs Gemini 2.0 Flash
Gemini 2.0 Flash 95% cheaper on output
- Gemini 3.5 Flash vs Gemini 1.5 Pro
Gemini 1.5 Pro 44% cheaper on output
- Gemini 3.5 Flash vs Gemini 1.5 Flash
Gemini 1.5 Flash 96% cheaper on output
- Gemini 3.5 Flash vs GPT-4o
Gemini 3.5 Flash 10% cheaper on output
- Gemini 3.5 Flash vs GPT-4o mini
GPT-4o mini 93% cheaper on output
- Gemini 3.5 Flash vs Claude Sonnet 4.6
Gemini 3.5 Flash 40% cheaper on output
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...
Also from Google
Other models by Google with live pricing in our catalog.