Llama 2 7b Chat Int8 pricing
If you are budgeting for Llama 2 7b Chat Int8, start with the numbers below. We index 1 provider price. Cheapest input is $1.92 per million tokens. Cheapest output is $1.92 per million tokens. The model lists a 2K context window in our data.
Pricing across providers
Every row is a seller of Llama 2 7b Chat Int8 with token pricing we track. The cheapest input in this snapshot is from Cloudflare. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
C Cloudflare | $1.92 | $1.92 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llama 2 7b Chat Int8 cost scales with traffic.
0.192300¢ / req
0.096150¢ / req
Model specifications
These fields describe Llama 2 7b Chat Int8 as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 2,048 tokens
- Max output
- 2,048 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama 2 7b Chat Int8
Open a pair page to see Llama 2 7b Chat Int8 next to another model with a shared provider matrix. 6 shortcuts below.
- Llama 2 7b Chat Int8 vs Llama 3.1 70B
Compare pricing side by side
- Llama 2 7b Chat Int8 vs Llama 3.1 8B
Compare pricing side by side
- Llama 2 7b Chat Int8 vs GPT-4o
Compare pricing side by side
- Llama 2 7b Chat Int8 vs GPT-4o mini
Compare pricing side by side
- Llama 2 7b Chat Int8 vs Claude Sonnet 4.6
Compare pricing side by side
- Llama 2 7b Chat Int8 vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Llama 2 7b Chat Int8 rates.
Also from Meta
Other models by Meta with live pricing in our catalog.