It can also ask follow-up questions to further personalize the tasks it completes, such as login information for other websites. Users can take control of the screen at any time.
The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that ...
The new tool, called Operator, is an AI agent: It relies on an AI model trained on both text and images to interpret commands and figure out how to use a web browser to execute them. OpenAI claims it ...
Generative artificial intelligence heavyweight OpenAI on Thursday previewed an AI agent that can carry out tasks on the web for users, as it seeks to enhance its chatbot amid intensifying competition.
Get the latest updates on our best-in-class productivity apps and intelligent cloud services. Rethink productivity, streamline business processes, and protect your business with Microsoft 365.
Get free access to Word, Excel, and PowerPoint using Microsoft 365 for the web (formerly Office).