by trycua
We’re hosting the Computer-Use Agents SOTA Challenge at Hack the North and online!
Track A (On-site @ UWaterloo): Reserved for participants accepted to Hack the North. 🏆 Prize: YC interview guaranteed.
Track B (Remote): Open to everyone worldwide. 🏆 Prize: Cash award.👉 Sign up here: trycua.com/hackathon
cua ("koo-ah") is Docker for Computer-Use Agents - it enables AI agents to control full operating systems in virtual containers and deploy them locally or to the cloud.
With the Computer SDK, you can:
With the Agent SDK, you can:
openai/
, openrouter/
, etc.) or our included local providers (huggingface-local/
, mlx/
)anthropic/claude-opus-4-1-20250805
(using Computer-Use Models)openai/computer-use-preview
openrouter/z-ai/glm-4.5v
huggingface-local/ByteDance-Seed/UI-TARS-1.5-7B
omniparser+{any LLM}
(using Composed Agents)huggingface-local/HelloKKMe/GTA1-7B+{any LLM}
huggingface/HelloKKMe/GTA1-32B+{any LLM}
vllm_hosted/HelloKKMe/GTA1-72B+{any LLM}
human/human
(using Human-in-the-Loop)Missing a model? Raise a feature request or contribute!
pip install cua-agent[all]
from agent import ComputerAgent agent = ComputerAgent( model="anthropic/claude-3-5-sonnet-20241022", tools=[computer], max_trajectory_budget=5.0 ) messages = [{"role": "user", "content": "Take a screenshot and tell me what you see"}] async for result in agent.run(messages): for item in result["output"]: if item["type"] == "message": print(item["content"][0]["text"])
*Configuration content*
pip install cua-computer[all]
from computer import Computer async with Computer( os_type="linux", provider_type="cloud", name="your-container-name", api_key="your-api-key" ) as computer: # Take screenshot screenshot = await computer.interface.screenshot() # Click and type await computer.interface.left_click(100, 100) await computer.interface.type("Hello!")
Module | Description | Installation |
---|---|---|
Lume | VM management for macOS/Linux using Apple's Virtualization.Framework | curl -fsSL https://raw.githubusercontent.com/trycua/cua/main/libs/lume/scripts/install.sh | bash |
Lumier | Docker interface for macOS and Linux VMs | docker pull trycua/lumier:latest |
Computer (Python) | Python Interface for controlling virtual machines | pip install "cua-computer[all]" |
Computer (Typescript) | Typescript Interface for controlling virtual machines | npm install @trycua/computer |
Agent | AI agent framework for automating tasks | pip install "cua-agent[all]" |
MCP Server | MCP server for using CUA with Claude Desktop | pip install cua-mcp-server |
SOM | Self-of-Mark library for Agent | pip install cua-som |
Computer Server | Server component for Computer | pip install cua-computer-server |
Core (Python) | Python Core utilities | pip install cua-core |
Core (Typescript) | Typescript Core utilities | npm install @trycua/core |
Join our Discord community to discuss ideas, get assistance, or share your demos!
Cua is open-sourced under the MIT License - see the LICENSE file for details.
The base image kasmweb/core-ubuntu-jammy
is maintained by Kasm Technologies and distributed under the Apache License 2.0. Usage of that image is subject to its own license terms.
Microsoft's OmniParser, which is used in this project, is licensed under the Creative Commons Attribution 4.0 International License (CC-BY-4.0) - see the OmniParser LICENSE file for details.
Some optional extras for this project depend on third-party packages that are licensed under terms different from the MIT License.
pip install "cua-agent[omni]"
) installs the cua-som
module, which includes ultralytics
and is licensed under the AGPL-3.0.When you choose to install and use such optional extras, your use, modification, and distribution of those third-party components are governed by their respective licenses (e.g., AGPL-3.0 for ultralytics
).
We welcome contributions to Cua! Please refer to our Contributing Guidelines for details.
Apple, macOS, and Apple Silicon are trademarks of Apple Inc.
Ubuntu and Canonical are registered trademarks of Canonical Ltd.
Microsoft is a registered trademark of Microsoft Corporation.
This project is not affiliated with, endorsed by, or sponsored by Apple Inc., Canonical Ltd., Microsoft Corporation, or Kasm Technologies.
Thank you to all our supporters!
f-trycua💻 | Pedro Piñera Buendía💻 | Amit Kumar💻 | Dung Duc Huynh (Kaka)💻 | Zayd Krunz💻 | Prashant Raj💻 | Leland Takamine💻 |
---|---|---|---|---|---|---|
ddupont💻 | Ethan Gutierrez💻 | Ricter Zheng💻 | Rahul Karajgikar💻 | trospix💻 | Evan smith💻 |
217 contributors
f-trycua
@f-trycua
ddupont808
@ddupont808
mdean808
@mdean808
jamesmurdza
@jamesmurdza
aktech
@aktech
onel
@onel
pepicrft
@pepicrft
mjspeck
@mjspeck
francedot
@francedot
ggcr
@ggcr
trospix
@trospix
jklapacz
@jklapacz
FinnBorge
@FinnBorge
allcontributors[bot]
@allcontributors