Meta

Llama 2 70b pricing

If you are budgeting for Llama 2 70b, start with the numbers below. We index 2 provider prices. Cheapest input is $0.650 per million tokens. Cheapest output is $2.75 per million tokens. The model lists a 4K context window in our data.

4K context·2 providers·verified Mar 13, 2026
Best input$0.650per 1M tokens · Replicate
Best output$2.75per 1M tokens · Replicate

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 2 offers are listed for Llama 2 70b. Best input in this view: Replicate.

R
Replicate
Input / 1M
$0.650
Output / 1M
$2.75
P
Perplexity
Input / 1M
$0.700
Output / 1M
$2.80

Input vs output · per provider

Cost calculator

Use this block to stress test Llama 2 70b cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.650/M·Out: $2.75/M

0.065000¢ / req

0.137500¢ / req

Daily
$20
Monthly
$608
Annual
$7.4K

Model specifications

These fields describe Llama 2 70b as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.

Context window
4,096 tokens
Max output
4,096 tokens
Vision (images)
No
Tool / function calling
No
Streaming
No
Released
N/A
Primary provider
Meta
Model family
N/A

Compare Llama 2 70b

Open a pair page to see Llama 2 70b next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Answers pull from the same numbers you see on this page.

Yes. Llama 2 70b is available on Replicate, Perplexity.

Also from Meta

Other models by Meta with live pricing in our catalog.