Gemini 3.1 Flash Lite pricing
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic... Live index: 1 priced offer. Best input $0.250 per million tokens from Openrouter. Best output $1.50 per million tokens from Openrouter.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Gemini 3.1 Flash Lite. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.250 | $1.50 | — | — |
Input vs output · 1M tokens
Cost calculator
Pick any provider row and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Gemini 3.1 Flash Lite.
0.025000¢ / req
0.075000¢ / req
Model specifications
These fields describe Gemini 3.1 Flash Lite as we store it (source: Google). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 1,048,576 tokens
- Max output
- 65,536 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- May 2026
- Primary provider
- Model family
- N/A
Compare Gemini 3.1 Flash Lite
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Gemini 3.1 Flash Lite.
- Gemini 3.1 Flash Lite vs Gemini 2.0 Flash
Compare pricing side by side
- Gemini 3.1 Flash Lite vs Gemini 1.5 Pro
Compare pricing side by side
- Gemini 3.1 Flash Lite vs Gemini 1.5 Flash
Compare pricing side by side
- Gemini 3.1 Flash Lite vs GPT-4o
Compare pricing side by side
- Gemini 3.1 Flash Lite vs GPT-4o mini
Compare pricing side by side
- Gemini 3.1 Flash Lite vs Claude Sonnet 4.6
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Gemini 3.1 Flash Lite pricing and limits. The short model note from our index: Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Also from Google
Other models by Google with live pricing in our catalog.