ByteDance: UI-TARS 7B
Analysis Summary
ByteDance: UI-TARS 7B sits in the Legacy tier on our leaderboard, ranked #340 of 551 published models on overall intelligence. At $0.100 input and $0.200 output per 1M tokens, it is among the most expensive on the market. It offers a standard large context window and supports vision.
Editorial notes
ByteDance UI-TARS 7B is a vision-capable model with no benchmark data available; its 7B parameter size and $0.1/$0.2 pricing suggest a lightweight specialist tool rather than a general-purpose business model.
Assessed May 5, 2026
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement..
Capabilities
How does ByteDance: UI-TARS 7B stack up?
Compare side-by-side with other legacy models.
Model Information
| OpenRouter ID |
bytedance/ui-tars-1.5-7b
|
| Provider | bytedance |
| Release Date | July 22, 2025 |
| Context Length | 128,000 tokens |
| Max Completion | 2,048 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $0.10 | $0.000100 |
| Output | $0.20 | $0.000200 |
Live Performance
Live endpoint metrics — refreshed every 30 minutes.
External Resources
Explore Related Models
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: May 11, 2026 8:38 pm