Llama 3.1 8B pricing
Meta's compact open-weight model for efficient inference at low cost. This page tracks 12 listings in total. Highlighted lows are $0.020 per million input and $0.030 per million output (see table for which seller matches each).
Pricing across providers
Every row is a seller of Llama 3.1 8B with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.020 | $0.050 | — | — |
O Ovhcloud | $0.100 | $0.100 | — | — |
VA Vercel Ai Gateway | $0.050 | $0.080 | — | — |
W Wandb | $22000.00 | $22000.00 | — | — |
N Novita | $0.020 | $0.050 | — | — |
L Llamagate | $0.030 | $0.050 | — | — |
N Nscale | $0.030 | $0.030 | — | — |
G Groq | $0.050 | $0.080 | — | — |
FA Fireworks AI | $0.200 | $0.200 | — | — |
D DeepInfra | $0.060 | $0.060 | — | — |
P Perplexity | $0.200 | $0.200 | — | — |
TA Together AI | $0.180 | $0.180 | — | — |
Input vs output · per provider
Cost calculator
Use this block to stress test Llama 3.1 8B cost without a spreadsheet. All estimates come from public list rates in this page.
Provider
0.002000¢ / req
0.002500¢ / req
Model specifications
These fields describe Llama 3.1 8B as we store it (Family: Llama 3.1. source: Meta). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 128,000 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- Yes
- Released
- Jul 2024
- Primary provider
- Meta
- Model family
- Llama 3.1
Compare Llama 3.1 8B
Open a pair page to see Llama 3.1 8B next to another model with a shared provider matrix. 6 shortcuts below.
- Llama 3.1 8B vs Llama 3.1 70B
Compare pricing side by side
- Llama 3.1 8B vs GPT-4o
Compare pricing side by side
- Llama 3.1 8B vs GPT-4o mini
Compare pricing side by side
- Llama 3.1 8B vs Claude Sonnet 4.6
Compare pricing side by side
- Llama 3.1 8B vs Gemini 2.0 Flash
Compare pricing side by side
- Llama 3.1 8B vs o3
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for Llama 3.1 8B pricing and limits. The short model note from our index: Meta's compact open-weight model for efficient inference at low cost.
Also from Meta
Other models by Meta with live pricing in our catalog.