GPT-4.1 Mini pricing
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on MultiChallenge, and 84.1% on IFEval. Mini also shows strong coding ability (e.g., 31.6% on Aider’s polyglot diff benchmark) and vision understanding, making it suitable for interactive applications with tight performance constraints. Live index: 5 priced offers. Best input $0.400 per million tokens from Openrouter. Best output $1.60 per million tokens from Openrouter.
Pricing across providers
Use this table to read GPT-4.1 Mini list prices. We show 5 sources right now. Lowest input in the grid: Openrouter. The chart below the table helps when output prices are much higher than input prices.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.400 | $1.60 | $0.100 | — |
VA Vercel Ai Gateway | $0.400 | $1.60 | $0.100 | — |
A Azure | $0.400 | $1.60 | $0.100 | — |
R Replicate | $0.400 | $1.60 | — | — |
O OpenAInative | $0.400 | $1.60 | $0.100 | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for GPT-4.1 Mini.
Provider
0.040000¢ / req
0.080000¢ / req
Model specifications
Context length, caps, and capability flags for GPT-4.1 Mini. Family: GPT-4. Values follow the main provider (OpenAI) record in our index.
- Context window
- 1,047,576 tokens
- Max output
- 32,768 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Apr 2025
- Primary provider
- OpenAI
- Model family
- GPT-4
Compare GPT-4.1 Mini
Open a pair page to see GPT-4.1 Mini next to another model with a shared provider matrix. 6 shortcuts below.
- GPT-4.1 Mini vs GPT-4o
GPT-4.1 Mini 84% cheaper on output
- GPT-4.1 Mini vs GPT-4o mini
GPT-4o mini 62% cheaper on output
- GPT-4.1 Mini vs o3
GPT-4.1 Mini 80% cheaper on output
- GPT-4.1 Mini vs Claude Sonnet 4.6
GPT-4.1 Mini 89% cheaper on output
- GPT-4.1 Mini vs Gemini 2.0 Flash
Gemini 2.0 Flash 75% cheaper on output
- GPT-4.1 Mini vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on M...
Also from OpenAI
Other models by OpenAI with live pricing in our catalog.