UI-TARS 7B pricing
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement... Below you will find 1 current row with input and output dollars per million. Right now the lowest input is $0.100 and the lowest output is $0.200.
Pricing across providers
Every row is a seller of UI-TARS 7B with token pricing we track. The cheapest input in this snapshot is from Openrouter. The bar chart shows the same input and output dollars per million for a quick scan.
| Provider | Input / 1M | Output / 1M | Cached input | Batch |
|---|---|---|---|---|
O Openrouter | $0.100 | $0.200 | — | — |
Input vs output · 1M tokens
Cost calculator
The calculator uses the same dollars per million tokens as the table. Adjust sliders to see how UI-TARS 7B cost scales with traffic.
0.010000¢ / req
0.010000¢ / req
Model specifications
These fields describe UI-TARS 7B as we store it (source: Openrouter). They sit next to price so buyers can check limits and tools in one place.
- Context window
- 131,072 tokens
- Max output
- 2,048 tokens
- Vision (images)
- Yes
- Tool / function calling
- No
- Streaming
- No
- Released
- Jul 2025
- Primary provider
- Openrouter
- Model family
- N/A
Compare UI-TARS 7B
Jump into a comparison when you want one table for two models instead of two tabs. 6 curated matches for UI-TARS 7B.
Locked
Compare with
Pick a model on both sides.
Popular UI-TARS 7B comparisons
- UI-TARS 7B vs GPT-4o
Compare pricing side by side
- UI-TARS 7B vs GPT-4o mini
Compare pricing side by side
- UI-TARS 7B vs Claude Sonnet 4.6
Compare pricing side by side
- UI-TARS 7B vs Gemini 2.0 Flash
Compare pricing side by side
- UI-TARS 7B vs o3
Compare pricing side by side
- UI-TARS 7B vs Llama 3.1 70B
Compare pricing side by side
Frequently asked questions
Quick frequently asked items for UI-TARS 7B pricing and limits. The short model note from our index: UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with...
Also from Openrouter
Other models by Openrouter with live pricing in our catalog.