What is the core technology behind Operator?

Xiao Huang: What is the core technology behind Operator?

DOORM: The core technology of Operator is the CUA (Computer-Using Agent) model. CUA combines OpenAI’s multimodal GPT-4o large language model with reinforcement learning techniques, enabling it to “see” and “operate” computer screens like humans.