GPT-4.1 pricing
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and ente. Live index: 6 priced offers. Best input $2.00 per million tokens from Openrouter. Best output $8.00 per million tokens from Openrouter.
Pricing across providers
All figures are list prices per million tokens unless a column says otherwise. 6 offers are listed for GPT-4.1. Best input in this view: Openrouter.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $2.00 | $8.00 | $0.500 | — |
A Azure | $2.00 | $8.00 | $0.500 | — |
GC Github Copilot | N/A | N/A | — | — |
VA Vercel Ai Gateway | $2.00 | $8.00 | $0.500 | — |
R Replicate | $2.00 | $8.00 | — | — |
O OpenAInative | $2.00 | $8.00 | $0.500 | — |
Input vs output · per provider
Cost calculator
Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for GPT-4.1.
Provider
0.200000¢ / req
0.400000¢ / req
Model specifications
Context length, caps, and capability flags for GPT-4.1. Family: GPT-4. Values follow the main provider (OpenAI) record in our index.
- Context window
- 1,047,576 tokens
- Max output
- 32,768 tokens
- Vision (images)
- Yes
- Tool / function calling
- Yes
- Streaming
- No
- Released
- Apr 2025
- Primary provider
- OpenAI
- Model family
- GPT-4
Compare GPT-4.1
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for GPT-4.1.
- GPT-4.1 vs GPT-4o
GPT-4.1 20% cheaper on output
- GPT-4.1 vs GPT-4o mini
GPT-4o mini 92% cheaper on output
- GPT-4.1 vs o3
Same output pricing
- GPT-4.1 vs Claude Sonnet 4.6
GPT-4.1 46% cheaper on output
- GPT-4.1 vs Gemini 2.0 Flash
Gemini 2.0 Flash 95% cheaper on output
- GPT-4.1 vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Answers pull from the same numbers you see on this page. The short model note from our index: GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o an...
Also from OpenAI
Other models by OpenAI with live pricing in our catalog.