Llama V3p1 8b pricing
If you are budgeting for Llama V3p1 8b, start with the numbers below. We index 1 provider price. Cheapest input is $0.100 per million tokens. Cheapest output is $0.100 per million tokens. The model lists a 16K context window in our data.
Pricing across providers
Every row is a seller of Llama V3p1 8b with token pricing we track. The cheapest input in this snapshot is from Fireworks AI. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
FA Fireworks AI | $0.100 | $0.100 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llama V3p1 8b cost scales with traffic.
0.010000¢ / req
0.005000¢ / req
Model specifications
These fields describe Llama V3p1 8b as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 16,384 tokens
- Max output
- 16,384 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama V3p1 8b
These links open full side by side pages for Llama V3p1 8b. We picked pairs that people often shop together. 6 ready to open.
- Llama V3p1 8b vs Llama 3.1 70B
Compare pricing side by side
- Llama V3p1 8b vs Llama 3.1 8B
Compare pricing side by side
- Llama V3p1 8b vs GPT-4o
Compare pricing side by side
- Llama V3p1 8b vs GPT-4o mini
Compare pricing side by side
- Llama V3p1 8b vs Claude Sonnet 4.6
Compare pricing side by side
- Llama V3p1 8b vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page.
Also from Meta
Other models by Meta with live pricing in our catalog.