The Responses API also includes a file search utility that can quickly scan across files in a company’s databases to retrieve information. (OpenAI claims that it won’t train models on these files.) In addition, developers using the Responses API can tap OpenAI’s Computer-Using Agent (CUA) model, which powers Operator. The model generates mouse and keyboard actions, allowing developers to automate computer use tasks like data entry and app workflows.
Enterprises can optionally run the CUA model, which is releasing in research preview, locally on their own systems, OpenAI said. The consumer version of the CUA available in Operator can only take actions on the web.
To be clear, the Responses API won’t solve all the technical problems plaguing AI agents today.
While AI-powered search tools are more accurate than traditional AI models – a fact that is unsurprising given they can just look up the right answer – web search does not render AI hallucinations a solved problem. GPT-4o search still gets 10% of factual questions wrong. Beyond their accuracy, AI search tools also tend to struggle with short, navigational queries (such as “Lakers score today”), and recent reports suggest that ChatGPT’s citations aren’t always reliable.
In a blog post provided to TechCrunch, OpenAI said that the CUA model is “not yet highly reliable for automating tasks on operating systems,” and that it’s susceptible to making “inadvertent” mistakes.
However, OpenAI said these are early iterations of their agent tools, and it’s constantly working to improve them.
Alongside the Responses API, OpenAI is releasing an open-source toolkit called the Agents SDK, which offers developers free tools to integrate models with their internal systems, put in place safeguards, and monitor AI agent activities for debugging and optimization purposes. The Agents SDK is a follow-up of sorts to OpenAI’s Swarm, a framework for multi-agent orchestration that the company released late last year.
Godement said he hopes OpenAI can bridge the gap between AI agent demos and products this year, and that, in his opinion, “agents are the most impactful application of AI that will happen.” That echoes a proclamation OpenAI CEO Sam Altman made in January: that 2025 is the year AI agents enter the workforce.
Leave feedback about this
You must be logged in to post a comment.