UI TARS Desktop
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
At a glance.
A compact read before the deeper capability notes and official setup links.
Core features.
Feature cards focus on what the tool helps users do, not generated setup commands.
TARS \ is a Multimodal AI Agent stack, currently shipping two projects: Agent TARS and UI-TARS-desktop:
Agent TARS is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product.
It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world MCP tools.
UI-TARS Desktop is a desktop application that provides a native GUI Agent based on the UI-TARS model.
remote computer as well as browser operators.
\[2025-11-05\] ๐ We're excited to announce the release of Agent TARS CLI v0.3.0!
Additionally, it features exclusive support for AIO agent Sandbox as isolated all-in-one tools execution environment.
\[2025-04-17\] - ๐ We're thrilled to announce the release of new UI-TARS Desktop application v0.1.0, featuring a redesigned Agent UI.
Agent / Skill / MCP / Workflow fit.
This panel keeps technical format separate from the user-facing AI category.
Official setup path.
Generated install snippets are intentionally not mirrored here because they drift. The page links to source-owned setup docs instead.
Evidence and adoption notes.
These notes help a user decide whether to investigate the official project further.
Source repository last pushed at 2026-05-18T02:55:49Z.
Generated from source metadata; confirm operational details in the official project before adopting it.
Review the upstream license, maintenance activity, and issue history before using it in production.