
Agent TARS is an open-source multimodal AI agent that leverages browser operations by visually interpreting web pages and seamlessly integrating with command lines and file systems.
Caution
DISCLAIMER: Agent TARS is still in Technical Preview stage and not stable yet. It's not recommended to use it in production.
Tip
Introduction Blog: https://agent-tars.com/2025/03/18/announcing-agent-tars-app
agent-tars-demo-01.mp4
For more showcases please head: https://agent-tars.com/showcase
- 🌐 Advanced Browser Operations: Executes sophisticated tasks like Deep Research and Operator functions through an agent framework, enabling comprehensive planning and execution.
- 🛠️ Comprehensive Tool Support: Integrates with search, file editing, command line, and Model Context Protocol (MCP) tools to handle complex workflows.
- 💻️ Enhanced Desktop App: A revamped UI with displays for browsers, multimodal elements, session management, model configuration, dialogue flow visualization, and browser/search status tracking.
- 🔄 Workflow Orchestration: Seamlessly connects GUI Agent tools—search, browse, explore links, and synthesize information into final outputs.
- ⚙️ Developer-Friendly Framework: Simplifies integration with UI-TARS and custom workflow creation for GUI Agent projects.
You can download the latest release version of Agent TARS from our releases page.
Note: If you have Homebrew installed, you can install UI-TARS Desktop by running the following command:
brew install --cask agent-tars
See Quick Start.
Please read the contributing guide and let's build Agent TARS together.
This repo has adopted the ByteDance Open Source Code of Conduct. Please check Code of conduct for more details.
Agent TARS is more than a tool —— it’s a platform for the future of multimodal agents. Upcoming enhancements include:
- Ongoing optimization of agent framework —— GUI Agent synergy with expanded model compatibility.
- Expansion to mobile device operations with cross-platform framework.
- Integration with game environments for AI-driven gameplay.
Thanks to:
- The browser-use project whose work inspired us to better operate browsers
- @alexchenzl for developing the innovative nanobrowser Chrome extension, which provided valuable technical references during our browser control in Electron
- @EGOIST for creating the remarkable AI chatbot ChatWise, from which we drew significant inspiration for local browser detection and local browser search.
- Anthropic for building the Model Context Protocol to help us better manage local tools
- puppeteer team for their excellent browser automation toolkit that greatly enhanced our workflow
- Web Infra team and the Rslib project helps us build our libraries better.
- The UI-TARS and UI-TARS-desktop development teams for laying crucial foundational frameworks
- All contributors and members of the open-source community who supported this journey with their expertise and encouragement
Agent TARS is Apache License 2.0 licensed.