Meta

Llama V3p1 8b pricing

If you are budgeting for Llama V3p1 8b, start with the numbers below. We index 1 provider price. Cheapest input is $0.100 per million tokens. Cheapest output is $0.100 per million tokens. The model lists a 16K context window in our data.

16K context·1 provider·verified Mar 13, 2026
Best input$0.100per 1M tokens · Fireworks AI
Best output$0.100per 1M tokens · Fireworks AI

Pricing across providers

Every row is a seller of Llama V3p1 8b with token pricing we track. The cheapest input in this snapshot is from Fireworks AI. The bar chart shows the same input and output dollars per million for a quick scan.

FA
Fireworks AI
Input / 1M
$0.100
Output / 1M
$0.100

Input vs output · 1M tokens

Cost calculator

The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llama V3p1 8b cost scales with traffic.

In: $0.100/M·Out: $0.100/M

0.010000¢ / req

0.005000¢ / req

Daily
$1.50
Monthly
$45
Annual
$548

Model specifications

These fields describe Llama V3p1 8b as we store it (source: Meta). They sit next to price so buyers can check limits and tools in one place.

Context window
16,384 tokens
Max output
16,384 tokens
Vision (images)
No
Tool / function calling
No
Streaming
No
Released
N/A
Primary provider
Meta
Model family
N/A

Compare Llama V3p1 8b

These links open full side by side pages for Llama V3p1 8b. We picked pairs that people often shop together. 6 ready to open.

Frequently asked questions

Answers pull from the same numbers you see on this page.

Yes. Llama V3p1 8b is available on Fireworks AI.

Also from Meta

Other models by Meta with live pricing in our catalog.