Llama3 1 70b pricing
If you are budgeting for Llama3 1 70b, start with the numbers below. We index 2 provider prices. Cheapest input is $0.600 per million tokens. Cheapest output is $0.600 per million tokens. The model lists a 128K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 2 offers are listed for Llama3 1 70b. Best input in this view: Cerebras.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
S Snowflake | N/A | N/A | — | — |
C Cerebras | $0.600 | $0.600 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llama3 1 70b cost scales with traffic.
Provider
Model specifications
Quick spec sheet for Llama3 1 70b before you dive back into pricing. Reported under Meta.
- Context window
- 128,000 tokens
- Max output
- 128,000 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama3 1 70b
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Llama3 1 70b.
- Llama3 1 70b vs Llama 3.1 70B
Compare pricing side by side
- Llama3 1 70b vs Llama 3.1 8B
Compare pricing side by side
- Llama3 1 70b vs GPT-4o
Compare pricing side by side
- Llama3 1 70b vs GPT-4o mini
Compare pricing side by side
- Llama3 1 70b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama3 1 70b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Llama3 1 70b pricing and limits.
Also from Meta
Other models by Meta with live pricing in our catalog.