Llama 3 3 70b Versatile pricing
If you are budgeting for Llama 3 3 70b Versatile, start with the numbers below. We index 1 provider price. Cheapest input is $0.590 per million tokens. Cheapest output is $0.790 per million tokens. The model lists a 128K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Llama 3 3 70b Versatile. Best input in this view: Groq.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
G Groq | $0.590 | $0.790 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llama 3 3 70b Versatile cost scales with traffic.
0.059000¢ / req
0.039500¢ / req
Model specifications
Quick spec sheet for Llama 3 3 70b Versatile before you dive back into pricing. Family: Llama 3.3. Reported under Meta.
- Context window
- 128,000 tokens
- Max output
- 32,768 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- Llama 3.3
Compare Llama 3 3 70b Versatile
Open a pair page to see Llama 3 3 70b Versatile next to another model with a shared provider matrix. 6 shortcuts below.
- Llama 3 3 70b Versatile vs Llama 3.1 70B
Compare pricing side by side
- Llama 3 3 70b Versatile vs Llama 3.1 8B
Compare pricing side by side
- Llama 3 3 70b Versatile vs GPT-4o
Compare pricing side by side
- Llama 3 3 70b Versatile vs GPT-4o mini
Compare pricing side by side
- Llama 3 3 70b Versatile vs Claude Sonnet 4.6
Compare pricing side by side
- Llama 3 3 70b Versatile vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page.
Also from Meta
Other models by Meta with live pricing in our catalog.