Llama V2 7b pricing
If you are budgeting for Llama V2 7b, start with the numbers below. We index 1 provider price. Cheapest input is $0.200 per million tokens. Cheapest output is $0.200 per million tokens. The model lists a 4K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Llama V2 7b. Best input in this view: Fireworks AI.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
FA Fireworks AI | $0.200 | $0.200 | — | — |
Input vs output · 1M tokens
Cost calculator
Use this block to stress test Llama V2 7b cost without a spreadsheet. All estimates come from public list rates in this page.
0.020000¢ / req
0.010000¢ / req
Model specifications
These fields describe Llama V2 7b as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 4,096 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama V2 7b
Open a pair page to see Llama V2 7b next to another model with a shared provider matrix. 6 shortcuts below.
- Llama V2 7b vs Llama 3.1 70B
Compare pricing side by side
- Llama V2 7b vs Llama 3.1 8B
Compare pricing side by side
- Llama V2 7b vs GPT-4o
Compare pricing side by side
- Llama V2 7b vs GPT-4o mini
Compare pricing side by side
- Llama V2 7b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama V2 7b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page.
Also from Meta
Other models by Meta with live pricing in our catalog.