GPT-4o Audio pricing
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens. Live index: 2 priced offers. Best input $2.50 per million tokens from Openrouter. Best output $10.00 per million tokens from Openrouter.
Pricing across providers
Every row is a seller of GPT-4o Audio with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $2.50 | $10.00 | — | — |
O OpenAInative | $2.50 | $10.00 | — | — |
Input vs output · per provider
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how GPT-4o Audio cost scales with traffic.
Provider
0.250000¢ / req
0.500000¢ / req
Model specifications
These fields describe GPT-4o Audio as we store it (Family: GPT-4o. source: OpenAI). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 128,000 tokens
- Max output
- 16,384 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Aug 2025
- Primary provider
- OpenAI
- Model family
- GPT-4o
Compare GPT-4o Audio
Open a pair page to see GPT-4o Audio next to another model with a shared provider matrix. 6 shortcuts below.
- GPT-4o Audio vs GPT-4o
Same output pricing
- GPT-4o Audio vs GPT-4o mini
GPT-4o mini 94% cheaper on output
- GPT-4o Audio vs o3
o3 20% cheaper on output
- GPT-4o Audio vs Claude Sonnet 4.6
GPT-4o Audio 33% cheaper on output
- GPT-4o Audio vs Gemini 2.0 Flash
Gemini 2.0 Flash 96% cheaper on output
- GPT-4o Audio vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around GPT-4o Audio rates. The short model note from our index: The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currentl...
Also from OpenAI
Other models by OpenAI with live pricing in our catalog.