GPT Audio pricing
The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per million input tokens and $64 per million output tokens. Live index: 2 priced offers. Best input $2.50 per million tokens from Openrouter. Best output $10.00 per million tokens from Openrouter.
Pricing across providers
Use this table to read GPT Audio list prices. We show 2 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $2.50 | $10.00 | — | — |
O OpenAInative | $2.50 | $10.00 | — | — |
Input vs output · per provider
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how GPT Audio cost scales with traffic.
Provider
0.250000¢ / req
0.500000¢ / req
Model specifications
Quick spec sheet for GPT Audio before you dive back into pricing. Reported under OpenAI.
- Context window
- 128,000 tokens
- Max output
- 16,384 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Jan 2026
- Primary provider
- OpenAI
- Model family
- N/A
Compare GPT Audio
These links open full side by side pages for GPT Audio. We picked pairs that people often shop together. 6 ready to open.
- GPT Audio vs GPT-4o
Same output pricing
- GPT Audio vs GPT-4o mini
GPT-4o mini 94% cheaper on output
- GPT Audio vs o3
o3 20% cheaper on output
- GPT Audio vs Claude Sonnet 4.6
GPT Audio 33% cheaper on output
- GPT Audio vs Gemini 2.0 Flash
Gemini 2.0 Flash 96% cheaper on output
- GPT Audio vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around GPT Audio rates. The short model note from our index: The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced at $32 per mil...
Also from OpenAI
Other models by OpenAI with live pricing in our catalog.