Llama 2 7b pricing
If you are budgeting for Llama 2 7b, start with the numbers below. We index 1 provider price. Cheapest input is $0.050 per million tokens. Cheapest output is $0.250 per million tokens. The model lists a 4K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Llama 2 7b. Best input in this view: Replicate.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
R Replicate | $0.050 | $0.250 | — | — |
Input vs output · 1M tokens
Cost calculator
Use this block to stress test Llama 2 7b cost without a spreadsheet. All estimates come from public list rates in this page.
0.005000¢ / req
0.012500¢ / req
Model specifications
Context length, caps, and capability flags for Llama 2 7b. Values follow the main provider (Meta) record in our index.
- Context window
- 4,096 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama 2 7b
These links open full side by side pages for Llama 2 7b. We picked pairs that people often shop together. 6 ready to open.
- Llama 2 7b vs Llama 3.1 70B
Compare pricing side by side
- Llama 2 7b vs Llama 3.1 8B
Compare pricing side by side
- Llama 2 7b vs GPT-4o
Compare pricing side by side
- Llama 2 7b vs GPT-4o mini
Compare pricing side by side
- Llama 2 7b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama 2 7b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Llama 2 7b pricing and limits.
Also from Meta
Other models by Meta with live pricing in our catalog.