Build computer use agents that see screens, click buttons, type and run code
— with Cua
One unified interface for computer use models. Execute powerful workflows with structured outputs, multi-turn conversations, and custom tools.
# Learn more: cua.ai/docs
import os
from agent import ComputerAgent
from computer import Computer
# Set your API key from cua.ai/dashboard
os.environ["CUA_API_KEY"] = "sk_..."
# Initialize cloud computer (create from dashboard or CLI)
computer = Computer(
os_type="linux",
provider_type="cloud",
name="your-sandbox-name"
)
# Initialize agent with model and tools
agent = ComputerAgent(
model="cua/anthropic/claude-sonnet-4.5",
tools=[computer]
)
# Run agent with task
messages = [{"role": "user", "content": "Take a screenshot"}]
async for result in agent.run(messages):
print(result)Cloud-powered sandboxes for your computer use agents. Simple API integration, unlimited scale.
Create sandboxes, run tasks, and manage everything from your terminal
Run as many sandboxes as you need. No infrastructure management, no resource limits
Choose between Linux, Windows, and macOS sandboxes based on your automation needs
Access 100+ vision-language models from top providers with one API key
Only pay for what you use. Credit-based billing with no upfront costs or monthly minimums
Automatically route requests to the best model for your task, balancing performance and cost
Agent SDK, Computer SDK, macOS virtualization, and Docker images. All open source on GitHub
Generate large-scale UI datasets, capture agent trajectories, and run standardized benchmarks. Everything you need to train and evaluate computer-use agents.
Generate diverse UI screenshots with ground-truth bounding boxes, labels, and metadata for ML training
Capture multi-step agent interactions with full state tracking. Replay, intervene, and create training data
Evaluate agents on safety, click accuracy, and multi-step tasks. From basic interactions to complex workflows
Export datasets as Arrow/Parquet for ML training. Push directly to HuggingFace Hub with one command
Generate datasets across macOS, Windows, and Linux with authentic OS chrome, icons, and layouts
Seeded randomization and deterministic environments. Replay from any checkpoint for debugging
Visual replay of agent runs with step-by-step navigation. Freeze and intervene at any point
Pay only for what you use - credits for cloud compute and VLM inference
I am looking to...
Try before you commit - perfect for testing and evaluation
Flexible credits for cloud compute and Computer-Use VLMs, with managed cloud environments and inference.
100 credits per dollar
Scalable cloud containers tailored for large teams and organizations.
Credits are our unified currency for both compute time and AI inference
Per hour for cloud environments
Billed per minute. Sessions include full desktop access with persistent storage.
Per million tokens processed
Input and output tokens combined. Actual costs vary by model and usage.
Everything you need to know about CUA and computer-use agents
No DevOps. No infrastructure. Just ship.
Agent Runs to Date
No credit card required!