Meta

Llama 2 70b pricing

If you are budgeting for Llama 2 70b, start with the numbers below. We index 2 provider prices. Cheapest input is $0.650 per million tokens. Cheapest output is $2.75 per million tokens. The model lists a 4K context window in our data.

4K context·2 providers·verified Mar 13, 2026

Estimate cost Compare with Llama 3.1 70B Pick any other model

Best input$0.650per 1M tokens · Replicate

Best output$2.75per 1M tokens · Replicate

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 2 offers are listed for Llama 2 70b. Best input in this view: Replicate.

R

Replicate

Input / 1M

$0.650

Output / 1M

$2.75

P

Perplexity

Input / 1M

$0.700

Output / 1M

$2.80

Provider	Input / 1M	Output / 1M	Cached input	Batch
R Replicate	$0.650	$2.75	—	—
P Perplexity	$0.700	$2.80	—	—

Input vs output · per provider

Cost calculator

Use this block to stress test Llama 2 70b cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.650/M·Out: $2.75/M

Requests / day

Avg input tokens

0.065000¢ / req

Avg output tokens

0.137500¢ / req

Daily

$20

Monthly

$608

Annual

$7.4K

Model specifications

These fields describe Llama 2 70b as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.

Context window: 4,096 tokens
Max output: 4,096 tokens
Vision (images): No
Tool / function calling: No
Streaming: No
Released: N/A
Primary provider: Meta
Model family: N/A

Compare Llama 2 70b

Open a pair page to see Llama 2 70b next to another model with a shared provider matrix. 6 shortcuts below.

Locked

Compare with

Pick a model on both sides.

Popular Llama 2 70b comparisons

Frequently asked questions

Answers pull from the same numbers you see on this page.

Yes. Llama 2 70b is available on Replicate, Perplexity.

Also from Meta

Other models by Meta with live pricing in our catalog.

$0.40/M out128K ctx

$0.03/M out128K ctx

$0.00/M out33K ctx

$0.00/M out4K ctx

$0.20/M out16K ctx