GPT-4o-mini (2024-07-18) pricing
GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than [GPT-3.5 Turbo](/models/openai/gpt-3.5-turbo). It maintains SOTA intelligence, while being significantly more cost-effective. GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferen. Below you will find 4 current rows with input and output dollars per million. Right now the lowest input is $0.150 and the lowest output is $0.600.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 4 offers are listed for GPT-4o-mini (2024-07-18). Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.150 | $0.600 | — | — |
GC Github Copilot | N/A | N/A | — | — |
A Azure | $0.165 | $0.660 | $0.083 | — |
O OpenAInative | $0.150 | $0.600 | $0.075 | — |
Input vs output · per provider
Cost calculator
Use this block to stress test GPT-4o-mini (2024-07-18) cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.015000¢ / req
0.030000¢ / req
Model specifications
Quick spec sheet for GPT-4o-mini (2024-07-18) before you dive back into pricing. Family: GPT-4o mini. Reported under OpenAI.
- Context window
- 128,000 tokens
- Max output
- 16,384 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jul 2024
- Primary provider
- OpenAI
- Model family
- GPT-4o mini
Compare GPT-4o-mini (2024-07-18)
Open a pair page to see GPT-4o-mini (2024-07-18) next to another model with a shared provider matrix. 6 shortcuts below.
- GPT-4o-mini (2024-07-18) vs GPT-4o
GPT-4o-mini (2024-07-18) 94% cheaper on output
- GPT-4o-mini (2024-07-18) vs GPT-4o mini
Same output pricing
- GPT-4o-mini (2024-07-18) vs o3
GPT-4o-mini (2024-07-18) 92% cheaper on output
- GPT-4o-mini (2024-07-18) vs Claude Sonnet 4.6
GPT-4o-mini (2024-07-18) 96% cheaper on output
- GPT-4o-mini (2024-07-18) vs Gemini 2.0 Flash
Gemini 2.0 Flash 33% cheaper on output
- GPT-4o-mini (2024-07-18) vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around GPT-4o-mini (2024-07-18) rates. The short model note from our index: GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than othe...
Also from OpenAI
Other models by OpenAI with live pricing in our catalog.