OpenAI pushes AI agent capabilities with new developer API

May Be Interested In:Big tech wants Trump to bully Australia. They should be careful what they wish for



Developers using the Responses API can access the same models that power ChatGPT Search: GPT-4o search and GPT-4o mini search. These models can browse the web to answer questions and cite sources in their responses.

That’s notable because OpenAI says the added web search ability dramatically improves the factual accuracy of its AI models. On OpenAI’s SimpleQA benchmark, which aims to measure confabulation rate, GPT-4o search scored 90 percent, while GPT-4o mini search achieved 88 percent—both substantially outperforming the larger GPT-4.5 model without search, which scored 63 percent.

Despite these improvements, the technology still has significant limitations. Aside from issues with CUA properly navigating websites, the improved search capability doesn’t completely solve the problem of AI confabulations, with GPT-4o search still making factual mistakes 10 percent of the time.

Alongside the Responses API, OpenAI released the open source Agents SDK, providing developers with free tools to integrate models with internal systems, implement safeguards, and monitor agent activities. This toolkit follows OpenAI’s earlier release of Swarm, a framework for orchestrating multiple agents.

These are still early days in the AI agent field, and things will likely improve rapidly. However, at the moment, the AI agent movement remains vulnerable to unrealistic claims, as demonstrated earlier this week when users discovered that Chinese startup Butterfly Effect’s Manus AI agent platform failed to deliver on many of its promises, highlighting the persistent gap between promotional claims and practical functionality in this emerging technology category.

share Share facebook pinterest whatsapp x print

Similar Content

Coco Gauff swapped the court for the red carpet Sunday night when she stunned at the Oscars
Tennis starlet Coco Gauff makes surprise appearance at Oscars 2025 ahead of competing at Indian Wells
Speech milestones to look out for in babies
Speech milestones to look out for in babies
Google News
Google News
David Duchovny and Gillian Anderson in New York in October 2015.
» 50 Scenes That Do Not Appear in the Fox ‘X-Files’ Revival
Asteroid 2024 YR4 no longer poses significant threat to Earth in 2032, NASA's latest analysis determines
Asteroid 2024 YR4 no longer poses significant threat to Earth in 2032, NASA’s latest analysis determines
Much Ado About Ken: Barbie style star is still making menswear shimmer
Much Ado About Ken: Barbie style star is still making menswear shimmer
Critical Watch: Today’s Pivotal Events | © 2025 | Daily News