OpenAITool use

gpt-oss-120b pricing

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation. Below you will find 15 current rows with input and output dollars per million. Right now the lowest input is $0.039 and the lowest output is $0.180.

131K context·15 providers·verified May 2, 2026
Best input$0.039per 1M tokens · Openrouter
Best output$0.180per 1M tokens · Openrouter

Pricing across providers

Every row is a seller of gpt-oss-120b with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.

O
Openrouter
Input / 1M
$0.180
Output / 1M
$0.800
O
Openrouter
Input / 1M
$0.039
Output / 1M
$0.180
B
Baseten
Input / 1M
$0.100
Output / 1M
$0.500
A
Azure
Input / 1M
$0.150
Output / 1M
$0.600
C
Cerebras
Input / 1M
$0.350
Output / 1M
$0.750
O
Ovhcloud
Input / 1M
$0.080
Output / 1M
$0.400
S
Sambanova
Input / 1M
$3.00
Output / 1M
$4.50
W
Wandb
Input / 1M
$15000.00
Output / 1M
$60000.00
IW
Ibm Watsonx
Input / 1M
$0.150
Output / 1M
$0.600
N
Novita
Input / 1M
$0.050
Output / 1M
$0.250
G
Groq
Input / 1M
$0.150
Output / 1M
$0.600
Cached in: $0.075
D
DeepInfra
Input / 1M
$0.050
Output / 1M
$0.450
R
Replicate
Input / 1M
$0.180
Output / 1M
$0.720
TA
Together AI
Input / 1M
$0.150
Output / 1M
$0.600
FA
Fireworks AI
Input / 1M
$0.150
Output / 1M
$0.600

Input vs output · per provider

Cost calculator

Use this block to stress test gpt-oss-120b cost without a spreadsheet. All estimates come from public list rates in this page.

Provider

In: $0.180/M·Out: $0.800/M

0.018000¢ / req

0.040000¢ / req

Daily
$5.80
Monthly
$174
Annual
$2.1K

Model specifications

These fields describe gpt-oss-120b as we store it (source: OpenAI). They sit next to price so buyers can check limits and tools in one place.

Context window
131,072 tokens
Max output
131,072 tokens
Vision (images)
No
Tool / function calling
Yes
Streaming
No
Released
Aug 2025
Primary provider
OpenAI
Model family
N/A

Compare gpt-oss-120b

These links open full side by side pages for gpt-oss-120b. We picked pairs that people often shop together. 6 ready to open.

Frequently asked questions

Read these after the table if you want plain language around gpt-oss-120b rates. The short model note from our index: gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward ...

Yes. gpt-oss-120b is available on Openrouter, Openrouter, Baseten, Azure, Cerebras, Ovhcloud, Sambanova, Wandb, Ibm Watsonx, Novita, Groq, DeepInfra, Replicate, Together AI, Fireworks AI.

Also from OpenAI

Other models by OpenAI with live pricing in our catalog.