OpenAIGPT-4VisionTool use

GPT-4.1 pricing

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and ente. Live index: 6 priced offers. Best input $2.00 per million tokens from Openrouter. Best output $8.00 per million tokens from Openrouter.

1.0M context·6 providers·verified May 2, 2026
Best input$2.00per 1M tokens · Openrouter
Best output$8.00per 1M tokens · Openrouter

Pricing across providers

All figures are list prices per million tokens unless a column says otherwise. 6 offers are listed for GPT-4.1. Best input in this view: Openrouter.

O
Openrouter
Input / 1M
$2.00
Output / 1M
$8.00
Cached in: $0.500
A
Azure
Input / 1M
$2.00
Output / 1M
$8.00
Cached in: $0.500
GC
Github Copilot
Input / 1M
N/A
Output / 1M
N/A
VA
Vercel Ai Gateway
Input / 1M
$2.00
Output / 1M
$8.00
Cached in: $0.500
R
Replicate
Input / 1M
$2.00
Output / 1M
$8.00
O
OpenAInative
Input / 1M
$2.00
Output / 1M
$8.00
Cached in: $0.500

Input vs output · per provider

Cost calculator

Pick any of the providers above and type how many tokens you expect per day, week, or year. We turn that into rough dollar totals for GPT-4.1.

Provider

In: $2.00/M·Out: $8.00/M·Cache: $0.500/M

0.200000¢ / req

0.400000¢ / req

Daily
$60
Monthly
$1.8K
Annual
$21.9K

Model specifications

Context length, caps, and capability flags for GPT-4.1. Family: GPT-4. Values follow the main provider (OpenAI) record in our index.

Context window
1,047,576 tokens
Max output
32,768 tokens
Vision (images)
Yes
Tool / function calling
Yes
Streaming
No
Released
Apr 2025
Primary provider
OpenAI
Model family
GPT-4

Compare GPT-4.1

Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for GPT-4.1.

Frequently asked questions

Answers pull from the same numbers you see on this page. The short model note from our index: GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o an...

GPT-4.1 costs $2.00 per million input tokens and $8.00 per million output tokens via the native API. Prompt caching reduces input costs to $0.50/M tokens.

Also from OpenAI

Other models by OpenAI with live pricing in our catalog.