Llama 3 8b pricing
If you are budgeting for Llama 3 8b, start with the numbers below. We index 4 provider prices. Cheapest input is $0.030 per million tokens. Cheapest output is $0.040 per million tokens. The model lists a 8K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 4 offers are listed for Llama 3 8b. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.030 | $0.040 | — | — |
N Novita | $0.040 | $0.040 | — | — |
VA Vercel Ai Gateway | $0.050 | $0.080 | — | — |
R Replicate | $0.050 | $0.250 | — | — |
Input vs output · per provider
Cost calculator
Use this block to stress test Llama 3 8b cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.003000¢ / req
0.002000¢ / req
Model specifications
Context length, caps, and capability flags for Llama 3 8b. Family: Llama 3. Values follow the main provider (Meta) record in our index.
- Context window
- 8,086 tokens
- Max output
- 8,086 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- Llama 3
Compare Llama 3 8b
These links open full side by side pages for Llama 3 8b. We picked pairs that people often shop together. 6 ready to open.
- Llama 3 8b vs Llama 3.1 70B
Compare pricing side by side
- Llama 3 8b vs Llama 3.1 8B
Compare pricing side by side
- Llama 3 8b vs GPT-4o
Compare pricing side by side
- Llama 3 8b vs GPT-4o mini
Compare pricing side by side
- Llama 3 8b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama 3 8b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Llama 3 8b pricing and limits.
Also from Meta
Other models by Meta with live pricing in our catalog.