Llama V2 70b pricing
Compare Llama V2 70b API pricing across 1 listed source. The best input rate we show is $0.100 per million tokens from Fireworks AI. The best output rate is $0.100 per million tokens from Fireworks AI.
Pricing across providers
Use this table to read Llama V2 70b list prices. We show 1 source right now. Lowest input in the grid: Fireworks AI. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
FA Fireworks AI | $0.100 | $0.100 | — | — |
Input vs output · 1M tokens
Cost calculator
Pick any provider row and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Llama V2 70b.
0.010000¢ / req
0.005000¢ / req
Model specifications
Context length, caps, and capability flags for Llama V2 70b. Values follow the main provider (Meta) record in our index.
- Context window
- 4,096 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama V2 70b
Open a pair page to see Llama V2 70b next to another model with a shared provider matrix. 6 shortcuts below.
- Llama V2 70b vs Llama 3.1 70B
Compare pricing side by side
- Llama V2 70b vs Llama 3.1 8B
Compare pricing side by side
- Llama V2 70b vs GPT-4o
Compare pricing side by side
- Llama V2 70b vs GPT-4o mini
Compare pricing side by side
- Llama V2 70b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama V2 70b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page.
Also from Meta
Other models by Meta with live pricing in our catalog.