Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

A comprehensive list of 2025 tech layoffs

June 17, 2025

Tumblr's content filtering system is incorrectly flagging posts as “mature”, users blame AI

June 17, 2025

Unlock scaling growth in TC at all stages and earn $210 for an additional 6 days

June 17, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Tumblr's content filtering system is incorrectly flagging posts as “mature”, users blame AI

    June 17, 2025

    Facebook announces that all videos on the platform will soon be shared as reels

    June 17, 2025

    Threads extend open social web integration with Fediverse feeds, user profile search

    June 17, 2025

    Streaming viewership surpassed cable and combined broadcasts for the first time last month, according to a report.

    June 17, 2025

    Mastodon updates its term to ban AI model training

    June 17, 2025
  • Crypto

    Unique, a new social media app

    June 17, 2025

    xNotify Polymarket as partner in the official forecast market

    June 6, 2025

    Circle IPOs are giving hope to more startups waiting to be published to more startups

    June 5, 2025

    GameStop bought $500 million in Bitcoin

    May 28, 2025

    Vote for the session you want to watch in 2025

    May 26, 2025
  • Security

    Pro-Israel Hacktivist Group has allegedly blamed for alleged Iranian bank hacks

    June 17, 2025

    Pro-Israel hacktivist group claims responsiveness to alleged Iranian bank hacks

    June 17, 2025

    As food shortages continue, UNFI says it is recovering from cyberattacks

    June 17, 2025

    UK Watchdog will fine 23andMe over 2023 data breach

    June 17, 2025

    Observability Startup Coralogix is ​​an extension of Unicorn, Eye India

    June 17, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    A comprehensive list of 2025 tech layoffs

    June 17, 2025

    Unlock scaling growth in TC at all stages and earn $210 for an additional 6 days

    June 17, 2025

    The well-known global VC Endeavor catalyst has raised $300 million, according to sources

    June 17, 2025

    Spotify's Daniel Ek has a big bet on Helsing, a European defence technology darling

    June 17, 2025

    Startup Battlefield 200 application closes midnight

    June 16, 2025
TechBrunchTechBrunch

Openai launches new tools to help businesses build AI agents

TechBrunchBy TechBrunchMarch 11, 20255 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


On Tuesday, OpenAI released a new tool designed to help developers and businesses build AI agents (automated systems) that can use their own AI models and frameworks to accomplish tasks independently.

This tool is part of OpenAI's new answers API, allowing you to develop custom AI agents that can perform web searches, scan corporate files, and navigate websites. The answer API effectively replaces Openai's assistant API, which is scheduled to go to sunset in the first half of 2026.

The hype around AI agents has grown dramatically in recent years, despite the tech industry having a hard time showing or defining people what “AI agents” really are. In the latest example of agent hype running ahead of utilities, the Chinese startup butterfly effect went viral earlier this week on a new AI agent platform called Manus, which users quickly discovered, and did not offer much of the company's promises.

In other words, Openai has a high interest in getting the agent right.

“It's very easy to demonstrate an agent,” Openai's API product head Olivier Godemont told TechCrunch in an interview. “Scaling agents is pretty difficult, and it's very difficult to get people to use it.”

Earlier this year, Openai introduced two AI agents to ChatGPT. The operator is a deep search that navigates the website on your behalf and compiles research reports for you. Both tools provided a glimpse into what agent technology could achieve, but in the “autonomous” sector it required considerable hope.

Now, OpenAI wants to use API answers to sell access to components that power AI agents, allowing developers to build their own operator and deep search style agent applications. Openai hopes that developers can create several applications with agent technology that they find more autonomous than what is currently available.

Using the Answer API, developers can tap the same AI model (preview) in Openai's ChatGPT Search Web Search Tool hood: GPT-4O Search and GPT-4O Mini Search. The model can browse the web to answer questions and cite the source when generating replies.

Openai claims that the GPT-4O and GPT-4O mini searches are very accurate. The company's SimpleQA benchmark measures the model's ability to answer questions that require short facts, and the GPT-4O search score is 90%, with a GPT-4O mini search score of 88% (higher ones better). For comparison, GPT-4.5 – Openai's much larger and more recently released model – just 63% score.

The fact that AI-powered search tools are more accurate than traditional AI models is not necessarily surprising. In theory, GPT-4O searches can look up the correct answer. However, web search does not provide a resolved problem for hallucinations. AI search tools tend to struggle with short navigation queries (such as “Lakers Scare Today”) beyond their de facto accuracy, and recent reports suggest that ChatGPT citations aren't always reliable.

The answer API also includes a file search utility that allows you to quickly scan files in your company's database to retrieve information. (Openai claims it does not train models on these files.) Additionally, developers using the Response API can tap Openai's Computer Usage Agent (CUA) model. This model generates mouse and keyboard actions, allowing developers to automate computer-used tasks such as data entry and app workflows.

According to Openai, companies can optionally run locally released CUA models in the research preview. The consumer version of CUA available to the operator can only perform actions on the web.

To be clear, the answer API does not solve all the technical issues that plague AI agents today.

Although AI-powered search tools are more accurate than traditional AI models, they are not surprising given that they can look up the correct answers, web search doesn't make AI Hallucinations a problem that solves them. The GPT-4o search still makes 10% of the effectively wrong questions. Beyond its accuracy, AI search tools tend to struggle with short navigation queries (such as “Lakers Scare Today”), and recent reports suggest that ChatGPT citations aren't always reliable.

In a blog post provided to TechCrunch, Openai said the CUA model is “still not very reliable to automate operating system tasks,” and is prone to making “careless” mistakes.

However, Openai said these are early repetitions of agent tools and are constantly working to improve them.

In addition to the Response API, OpenAI is releasing an open source toolkit called the Agent SDK. It provides a free tool for integrating models with internal systems, implementing protections, and monitoring AI agent activity for debugging and optimization. The Agent SDK is a kind of follow-up to Swarm from Openai, a multi-agent orchestration framework released later last year.

Godemont hopes Openai can close the gap between the AI ​​Agent demo and the product this year, and in his opinion, “agents are AI's most influential application.” It reflects Openai CEO Sam Altman, a declaration made in January. 2025 is the year in which AI agents enter the workforce.

Whether 2025 will really be the “AI Agent Year” or not, Openai's latest release shows that the company wants to move from flashy agent demos to impactful tools.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Tumblr's content filtering system is incorrectly flagging posts as “mature”, users blame AI

June 17, 2025

Facebook announces that all videos on the platform will soon be shared as reels

June 17, 2025

Threads extend open social web integration with Fediverse feeds, user profile search

June 17, 2025

Streaming viewership surpassed cable and combined broadcasts for the first time last month, according to a report.

June 17, 2025

Mastodon updates its term to ban AI model training

June 17, 2025

Amazon will be holding Prime Day 2025 from July 8th to 11th

June 17, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

A comprehensive list of 2025 tech layoffs

June 17, 2025

Tumblr's content filtering system is incorrectly flagging posts as “mature”, users blame AI

June 17, 2025

Unlock scaling growth in TC at all stages and earn $210 for an additional 6 days

June 17, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.