Llama3 3 70b Instruct Fp8 pricing
If you are budgeting for Llama3 3 70b Instruct Fp8, start with the numbers below. We index 1 provider price. Cheapest input is $0.120 per million tokens. Cheapest output is $0.300 per million tokens. The model lists a 131K context window in our data.
Pricing across providers
Every row is a seller of Llama3 3 70b Instruct Fp8 with token pricing we track. The cheapest input in this snapshot is from Lambda. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
L Lambda | $0.120 | $0.300 | — | — |
Input vs output · 1M tokens
Cost calculator
Pick any provider row and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for Llama3 3 70b Instruct Fp8.
0.012000¢ / req
0.015000¢ / req
Model specifications
Context length, caps, and capability flags for Llama3 3 70b Instruct Fp8. Values follow the main provider (Meta) record in our index.
- Context window
- 131,072 tokens
- Max output
- 131,072 tokens
- Vision (images)
- No
- Tool / function calling
- Yes
- Streaming
- No
- Released
- N/A
- Primary provider
- Meta
- Model family
- N/A
Compare Llama3 3 70b Instruct Fp8
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for Llama3 3 70b Instruct Fp8.
- Llama3 3 70b Instruct Fp8 vs Llama 3.1 70B
Compare pricing side by side
- Llama3 3 70b Instruct Fp8 vs Llama 3.1 8B
Compare pricing side by side
- Llama3 3 70b Instruct Fp8 vs GPT-4o
Compare pricing side by side
- Llama3 3 70b Instruct Fp8 vs GPT-4o mini
Compare pricing side by side
- Llama3 3 70b Instruct Fp8 vs Claude Sonnet 4.6
Compare pricing side by side
- Llama3 3 70b Instruct Fp8 vs Gemini 2.0 Flash
Compare pricing side by side
Frequently asked questions
Read these after the table if you want plain language around Llama3 3 70b Instruct Fp8 rates.
Also from Meta
Other models by Meta with live pricing in our catalog.