Meta

Llama 2 7b Chat Int8 pricing

If you are budgeting for Llama 2 7b Chat Int8, start with the numbers below. We index 1 provider price. Cheapest input is $1.92 per million tokens. Cheapest output is $1.92 per million tokens. The model lists a 2K context window in our data.

2K context·1 provider·verified Apr 7, 2026
Best input$1.92per 1M tokens · Cloudflare
Best output$1.92per 1M tokens · Cloudflare

Pricing across providers

Every row is a seller of Llama 2 7b Chat Int8 with token pricing we track. The cheapest input in this snapshot is from Cloudflare. The bar chart shows the same input and output dollars per million for a quick scan.

C
Cloudflare
Input / 1M
$1.92
Output / 1M
$1.92

Input vs output · 1M tokens

Cost calculator

The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llama 2 7b Chat Int8 cost scales with traffic.

In: $1.92/M·Out: $1.92/M

0.192300¢ / req

0.096150¢ / req

Daily
$29
Monthly
$865
Annual
$10.5K

Model specifications

These fields describe Llama 2 7b Chat Int8 as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.

Context window
2,048 tokens
Max output
2,048 tokens
Vision (images)
No
Tool / function calling
No
Streaming
No
Released
N/A
Primary provider
Meta
Model family
N/A

Compare Llama 2 7b Chat Int8

Open a pair page to see Llama 2 7b Chat Int8 next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Read these after the table if you want plain language around Llama 2 7b Chat Int8 rates.

Yes. Llama 2 7b Chat Int8 is available on Cloudflare.

Also from Meta

Other models by Meta with live pricing in our catalog.