MetaLlama 3.1Tool use

Llama 3.1 8B pricing

Meta's compact open-weight model for efficient inference at low cost. This page tracks 12 listings in total. Highlighted lows are $0.020 per million input and $0.030 per million output (see table for which seller matches each).

128K context·12 providers·verified Apr 7, 2026
Best input$0.020per 1M tokens · Openrouter
Best output$0.030per 1M tokens · Nscale

Pricing across providers

Every row is a seller of Llama 3.1 8B with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.

O
Openrouter
Input / 1M
$0.020
Output / 1M
$0.050
O
Ovhcloud
Input / 1M
$0.100
Output / 1M
$0.100
VA
Vercel Ai Gateway
Input / 1M
$0.050
Output / 1M
$0.080
W
Wandb
Input / 1M
$22000.00
Output / 1M
$22000.00
N
Novita
Input / 1M
$0.020
Output / 1M
$0.050
L
Llamagate
Input / 1M
$0.030
Output / 1M
$0.050
N
Nscale
Input / 1M
$0.030
Output / 1M
$0.030
G
Groq
Input / 1M
$0.050
Output / 1M
$0.080
FA
Fireworks AI
Input / 1M
$0.200
Output / 1M
$0.200
D
DeepInfra
Input / 1M
$0.060
Output / 1M
$0.060
P
Perplexity
Input / 1M
$0.200
Output / 1M
$0.200
TA
Together AI
Input / 1M
$0.180
Output / 1M
$0.180

Input vs output · per provider

Cost calculator

Use this block to stress test Llama 3.1 8B cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.020/M·Out: $0.050/M

0.002000¢ / req

0.002500¢ / req

Daily
$0.45
Monthly
$14
Annual
$164

Model specifications

These fields describe Llama 3.1 8B as we store it (Family: Llama 3.1. source: Meta). They sit next to price so buyers can check limits and tools in one place.

Context window
128,000 tokens
Max output
4,096 tokens
Vision (images)
No
Tool / function calling
Yes
Streaming
Yes
Released
Jul 2024
Primary provider
Meta
Model family
Llama 3.1

Compare Llama 3.1 8B

Open a pair page to see Llama 3.1 8B next to another model with a shared provider matrix. 6 shortcuts below.

Frequently asked questions

Quick frequently asked items for Llama 3.1 8B pricing and limits. The short model note from our index: Meta's compact open-weight model for efficient inference at low cost.

Yes. Llama 3.1 8B is available on Openrouter, Ovhcloud, Vercel Ai Gateway, Wandb, Novita, Llamagate, Nscale, Groq, Fireworks AI, DeepInfra, Perplexity, Together AI.

Also from Meta

Other models by Meta with live pricing in our catalog.