Llama3 1 8b pricing
If you are budgeting for Llama3 1 8b, start with the numbers below. We index 3 provider prices. Cheapest input is $0.025 per million tokens. Cheapest output is $0.040 per million tokens. The model lists a 128K context window in our data.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 3 offers are listed for Llama3 1 8b. Best input in this view: Lambda.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
S Snowflake | N/A | N/A | — | — |
L Lambda | $0.025 | $0.040 | — | — |
C Cerebras | $0.100 | $0.100 | — | — |
Input vs output · per provider
Cost calculator
Use this block to stress test Llama3 1 8b cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
Model specifications
These fields describe Llama3 1 8b as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 128,000 tokens
- Max output
- 128,000 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama3 1 8b
Open a pair page to see Llama3 1 8b next to another model with a shared provider matrix. 6 shortcuts below.
- Llama3 1 8b vs Llama 3.1 70B
Compare pricing side by side
- Llama3 1 8b vs Llama 3.1 8B
Compare pricing side by side
- Llama3 1 8b vs GPT-4o
Compare pricing side by side
- Llama3 1 8b vs GPT-4o mini
Compare pricing side by side
- Llama3 1 8b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama3 1 8b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page.
Also from Meta
Other models by Meta with live pricing in our catalog.