o4 Mini pricing
OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning... Live index: 5 priced offers. Best input $1.00 per million tokens from Replicate. Best output $4.00 per million tokens from Replicate.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 5 offers are listed for o4 Mini. Best input in this view: Replicate.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O OpenAInative | $1.10 | $4.40 | $0.275 | — |
O Openrouter | $1.10 | $4.40 | — | — |
VA Vercel Ai Gateway | $1.10 | $4.40 | $0.275 | — |
R Replicate | $1.00 | $4.00 | — | — |
A Azure | $1.10 | $4.40 | $0.275 | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for o4 Mini.
Provider
0.110000¢ / req
0.220000¢ / req
Model specifications
Quick spec sheet for o4 Mini before you dive back into pricing. Reported under Azure.
- Context window
- 200,000 tokens
- Max output
- 100,000 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Apr 2025
- Primary provider
- Azure
- Model family
- N/A
Compare o4 Mini
These links open full side by side pages for o4 Mini. We picked pairs that people often shop together. 6 ready to open.
- o4 Mini vs GPT-4o
o4 Mini 55% cheaper on output
- o4 Mini vs GPT-4o mini
GPT-4o mini 86% cheaper on output
- o4 Mini vs Claude Sonnet 4.6
o4 Mini 70% cheaper on output
- o4 Mini vs Gemini 2.0 Flash
Gemini 2.0 Flash 90% cheaper on output
- o4 Mini vs o3
o4 Mini 44% cheaper on output
- o4 Mini vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive re...
Also from Azure
Other models by Azure with live pricing in our catalog.