Llemma 7b pricing
Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at... Below you will find 1 current row with input and output dollars per million. Right now the lowest input is $0.800 and the lowest output is $1.20.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 1 offer is listed for Llemma 7b. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.800 | $1.20 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how Llemma 7b cost scales with traffic.
0.080000¢ / req
0.060000¢ / req
Model specifications
These fields describe Llemma 7b as we store it (source: Eleutherai). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 4,096 tokens
- Max output
- 4,096 tokens
- Vision (images)
- No
- Tool / function calling
- No
- Streaming
- Yes
- Released
- Apr 2025
- Primary provider
- Eleutherai
- Model family
- N/A
Compare Llemma 7b
These links open full side by side pages for Llemma 7b. We picked pairs that people often shop together. 6 ready to open.
Locked
Compare with
Pick a model on both sides.
Popular Llemma 7b comparisons
- Llemma 7b vs GPT-4o
Compare pricing side by side
- Llemma 7b vs GPT-4o mini
Compare pricing side by side
- Llemma 7b vs Claude Sonnet 4.6
Compare pricing side by side
- Llemma 7b vs Gemini 2.0 Flash
Compare pricing side by side
- Llemma 7b vs o3
Compare pricing side by side
- Llemma 7b vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at...