GPT-3.5 Turbo 16k pricing
This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up to Sep 2021. This page tracks 2 listings in total. Highlighted lows are $3.00 per million input and $4.00 per million output (see table for which seller matches each).
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 2 offers are listed for GPT-3.5 Turbo 16k. Best input in this view: OpenAI.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O OpenAInative | $3.00 | $4.00 | — | — |
O Openrouter | $3.00 | $4.00 | — | — |
Input vs output · per provider
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how GPT-3.5 Turbo 16k cost scales with traffic.
Provider
0.300000¢ / req
0.200000¢ / req
Model specifications
Context length, caps, and capability flags for GPT-3.5 Turbo 16k. Values follow the main provider (OpenAI) record in our index.
- Context window
- 16,385 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Aug 2023
- Primary provider
- OpenAI
- Model family
- N/A
Compare GPT-3.5 Turbo 16k
Open a pair page to see GPT-3.5 Turbo 16k next to another model with a shared provider matrix. 6 shortcuts below.
- GPT-3.5 Turbo 16k vs GPT-4o
GPT-3.5 Turbo 16k 60% cheaper on output
- GPT-3.5 Turbo 16k vs GPT-4o mini
GPT-4o mini 85% cheaper on output
- GPT-3.5 Turbo 16k vs o3
GPT-3.5 Turbo 16k 50% cheaper on output
- GPT-3.5 Turbo 16k vs Claude Sonnet 4.6
GPT-3.5 Turbo 16k 73% cheaper on output
- GPT-3.5 Turbo 16k vs Gemini 2.0 Flash
Gemini 2.0 Flash 90% cheaper on output
- GPT-3.5 Turbo 16k vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for GPT-3.5 Turbo 16k pricing and limits. The short model note from our index: This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up to Sep 2021.
Also from OpenAI
Other models by OpenAI with live pricing in our catalog.