Gemma 4 31B pricing
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function... Live index: 1 priced offer. Best input $0.120 per million tokens from Openrouter. Best output $0.370 per million tokens from Openrouter.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Gemma 4 31B. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.120 | $0.370 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Gemma 4 31B cost scales with traffic.
0.012000¢ / req
0.018500¢ / req
Model specifications
Context length, caps, and capability flags for Gemma 4 31B. Values follow the main provider (Google) record in our index.
- Context window
- 262,144 tokens
- Max output
- 131,072 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- Apr 2026
- Primary provider
- Model family
- N/A
Compare Gemma 4 31B
Open a pair page to see Gemma 4 31B next to another model with a shared provider matrix. 6 shortcuts below.
Locked
Compare with
Pick a model on both sides.
Popular Gemma 4 31B comparisons
- Gemma 4 31B vs Gemini 2.0 Flash
Compare pricing side by side
- Gemma 4 31B vs Gemini 1.5 Pro
Compare pricing side by side
- Gemma 4 31B vs Gemini 1.5 Flash
Compare pricing side by side
- Gemma 4 31B vs GPT-4o
Compare pricing side by side
- Gemma 4 31B vs GPT-4o mini
Compare pricing side by side
- Gemma 4 31B vs Claude Sonnet 4.6
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Gemma 4 31B pricing and limits. The short model note from our index: Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Also from Google
Other models by Google with live pricing in our catalog.