ByteDance: UI-TARS 7B

ByteDance: UI-TARS 7B

bytedance · Released Jul 22, 2025 Legacy
Intelligence #459 / 576
24.6 Our Score
Speed
— Not reported
Input #195 / 576
$0.100 per 1M tokens
Output #180 / 576
$0.200 per 1M tokens
Context #329 / 576
128,000 tokens

Analysis Summary

ByteDance's UI-TARS 7B is a 7-billion parameter vision model released July 2025, designed for UI understanding and interaction tasks with image input support and a 128K context window. No benchmark data is available, so its reasoning, instruction-following, or task completion performance cannot be assessed.

The vision capability and UI-focused design suggest potential value for screen-based automation, GUI testing, or visual workflow agents. However, without benchmark evidence, it is not possible to confirm whether the model performs reliably enough for production use in these scenarios.

At $0.10 input / $0.20 output, pricing is low and worth experimenting with for teams building UI automation pipelines. Independent evaluation on representative tasks is essential before committing to production deployment.

Assessed June 6, 2026

Editorial notes

ByteDance UI-TARS 7B is a vision-capable model with no benchmark data; its specialised UI understanding focus may suit narrow automation tasks, but capability cannot be confirmed without performance evidence.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value7.8Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 7.8/10

How ByteDance: UI-TARS 7B compares

Its 128K-token context window is larger than 43% of the models we list. At $0.10 per million input tokens it is cheaper than 66% of comparable models.

About ByteDance: UI-TARS 7B

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement..

7B Parameters

Capabilities

Vision

How does ByteDance: UI-TARS 7B stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID bytedance/ui-tars-1.5-7b
Providerbytedance
Release Date July 22, 2025
Context Length128,000 tokens
Max Completion2,048 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.20 $0.000200

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
1,003ms
Best Latency (TTFT)
12 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Parasail

Frequently asked questions about ByteDance: UI-TARS 7B

How much does ByteDance: UI-TARS 7B cost?

ByteDance: UI-TARS 7B costs $0.10 per million input tokens and $0.20 per million output tokens.

What is the context window of ByteDance: UI-TARS 7B?

ByteDance: UI-TARS 7B has a context window of 128,000 tokens (128K).

What can ByteDance: UI-TARS 7B do?

ByteDance: UI-TARS 7B supports image/vision input.

Who created ByteDance: UI-TARS 7B?

ByteDance: UI-TARS 7B is developed by ByteDance and was released on July 22, 2025.