Eleutherai

Llemma 7b pricing

Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at... Below you will find 1 current row with input and output dollars per million. Right now the lowest input is $0.800 and the lowest output is $1.20.

4K context·1 provider·verified Apr 7, 2026
Best input$0.800per 1M tokens · Openrouter
Best output$1.20per 1M tokens · Openrouter

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Llemma 7b. Best input in this view: Openrouter.

O
Openrouter
Input / 1M
$0.800
Output / 1M
$1.20

Input vs output · 1M tokens

Cost calculator

The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llemma 7b cost scales with traffic.

In: $0.800/M·Out: $1.20/M

0.080000¢ / req

0.060000¢ / req

Daily
$14
Monthly
$420
Annual
$5.1K

Model specifications

These fields describe Llemma 7b as we store it (source: Eleutherai). They sit next to price so buyers can check limits and tools in one place.

Context window
4,096 tokens
Max output
4,096 tokens
Vision (images)
No
Tool / function calling
No
Streaming
Yes
Released
Apr 2025
Primary provider
Eleutherai
Model family
N/A

Compare Llemma 7b

These links open full side by side pages for Llemma 7b. We picked pairs that people often shop together. 6 ready to open.

Locked

Compare with

Pick a model on both sides.

Popular Llemma 7b comparisons

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at...

Yes. Llemma 7b is available on Openrouter.