Meta

Llama 2 7b pricing

If you are budgeting for Llama 2 7b, start with the numbers below. We index 1 provider price. Cheapest input is $0.050 per million tokens. Cheapest output is $0.250 per million tokens. The model lists a 4K context window in our data.

4K context·1 provider·verified Mar 13, 2026
Best input$0.050per 1M tokens · Replicate
Best output$0.250per 1M tokens · Replicate

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Llama 2 7b. Best input in this view: Replicate.

R
Replicate
Input / 1M
$0.050
Output / 1M
$0.250

Input vs output · 1M tokens

Cost calculator

Use this block to stress test Llama 2 7b cost without a spreadsheet. All estimates come from public list rates in this page.

In: $0.050/M·Out: $0.250/M

0.005000¢ / req

0.012500¢ / req

Daily
$1.75
Monthly
$53
Annual
$639

Model specifications

Context length, caps, and capability flags for Llama 2 7b. Values follow the main provider (Meta) record in our index.

Context window
4,096 tokens
Max output
4,096 tokens
Vision (images)
No
Tool / function calling
No
Streaming
No
Released
N/A
Primary provider
Meta
Model family
N/A

Compare Llama 2 7b

These links open full side by side pages for Llama 2 7b. We picked pairs that people often shop together. 6 ready to open.

Frequently asked questions

Quick frequently asked items for Llama 2 7b pricing and limits.

Yes. Llama 2 7b is available on Replicate.

Also from Meta

Other models by Meta with live pricing in our catalog.