AI translated#Artificial Intelligence #OpenAI

OpenAI Edges Closer to Launching AI Agent

21.01.2025 ForkLog

The programmer Tibor Blaho, renowned for accurate insights into upcoming AI products, has uncovered evidence of an OpenAI agent codenamed “Operator.”

Confirmed — the ChatGPT macOS desktop app has hidden options to define shortcuts for the desktop launcher to “Toggle Operator” and “Force Quit Operator” https://t.co/rSFobi4iPN pic.twitter.com/j19YSlexAS

— Tibor Blaho (@btibor91) January 19, 2025

According to his information, the desktop version of ChatGPT on macOS contains hidden functions to enable and disable “Operator.” Similar information was provided by an X user under the nickname M1.

Blaho also discovered mentions of the AI agent on the OpenAI website and its comparison with competitors’ solutions.

OpenAI website already has references to Operator/OpenAI CUA (Computer Use Agent) — “Operator System Card Table”, “Operator Research Eval Table” and “Operator Refusal Rate Table”

Including comparison to Claude 3.5 Sonnet Computer use, Google Mariner, etc.

(preview of tables… pic.twitter.com/OOBgC3ddkU

— Tibor Blaho (@btibor91) January 20, 2025

The figures in the table indicate that “Operator” is not entirely reliable in certain tasks.

In the OSWorld benchmark, which simulates a real computer environment, OpenAI’s AI agent scores 38.1%. This is higher than Anthropic’s solution but falls short of the 72.4% achieved by humans. Meanwhile, “Operator” surpasses human performance in WebVoyager, which assesses AI’s ability to navigate websites.

The neural network managed to create a Bitcoin wallet in 10% of cases. The success rate for registrations with a cloud provider is higher—at 60%.

Leaked charts indicate good performance of the AI agent in security, resisting attempts to perform “illegal actions” and seek “sensitive personal data.”

Back in November, it was reported that OpenAI planned to launch its own AI agent, “Operator.”

Подписывайтесь на ForkLog в социальных сетях

Telegram (основной канал) Facebook X

Found a mistake? Select it and press CTRL+ENTER

Рассылки ForkLog: держите руку на пульсе биткоин-индустрии!

Set in silicon

Google Enhances AI Features in Docs, Sheets, Slides, and Drive

OpenAI Integrates Shazam into ChatGPT

Nvidia CEO Views AI as a Job Creator, Not a Job Killer

Investors Disillusioned with SoftBank’s Bold Bet on OpenAI

Microsoft Introduces AI Agent Cowork for Microsoft 365

Australian Startup Develops Data Centres Using Human Brain Cells

China Embraces OpenClaw as AI Agents Gain Popularity