ByteDance: UI-TARS 7B

ByteDance: UI-TARS 7B

bytedance · Released Jul 22, 2025 Legacy
Awaiting
Review
Benchmarks pending

Performance Profile

Intelligence0Technical0Value7.8Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 7.8/10

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement..

$0.10 / 1M
Input Price
$0.20 / 1M
Output Price
128,000 tokens
Context Window
2,048 tokens
Max Output
7B Parameters

Capabilities

Vision

Architecture

ModalityText + Image → Text
TokenizerOther
Parameters7B

How does ByteDance: UI-TARS 7B stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID bytedance/ui-tars-1.5-7b
Providerbytedance
Release Date July 22, 2025
Context Length128,000 tokens
Max Completion2,048 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.20 $0.000200

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
14,723ms
Best Latency (TTFT)
1 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Parasail