Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

OpenAI says AI browsers can always be vulnerable to prompt injection attacks

December 22, 2025

Famous Israeli VC John Medved, who was diagnosed with ALS, championed technology to improve his life.

December 21, 2025

Sequoia partners debunk Brown shooting theory and test new leadership

December 20, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Google brings Pixel 6 and new devices to Material3 Expressive, along with other features, to the Pixel 6 and new devices

    September 3, 2025

    Google's NoteBookLM now allows you to customize the tone of your AI podcasts

    September 3, 2025

    Roblox expands the use of age estimation techniques and introduces standardized assessments

    September 3, 2025

    Instagram finally launches the iPad app

    September 3, 2025

    Complete the 2025 Confusion Builder Stage Agenda with the Maximum Scaling Voice

    September 3, 2025
  • Crypto

    Coinbase resumes user onboarding in India, plans to introduce fiat currency next year

    December 7, 2025

    New report examines how David Sachs benefits from Trump administration role

    November 30, 2025

    Why Benchmark made a rare crypto bet on trading app Fomo with $17 million Series A

    November 6, 2025

    Coinbase CEO Brian Armstrong trolls prediction markets

    November 1, 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agent coding

    October 29, 2025
  • Security

    OpenAI says AI browsers can always be vulnerable to prompt injection attacks

    December 22, 2025

    Hundreds of Cisco customers are vulnerable to new Chinese hacking campaign, researchers say

    December 19, 2025

    Hacking, theft and destruction: 2025's worst data breaches

    December 19, 2025

    UK NHS technology provider confirms data breach

    December 18, 2025

    Cisco announces Chinese hackers are exploiting customers with new zero-day attack

    December 17, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Famous Israeli VC John Medved, who was diagnosed with ALS, championed technology to improve his life.

    December 21, 2025

    Sequoia partners debunk Brown shooting theory and test new leadership

    December 20, 2025

    Resolve AI, a startup led by former Splunk executives, reaches $1 billion Series A valuation

    December 20, 2025

    Where will investors bet next year? AI, AI, AI.

    December 19, 2025

    Crafton raises stakes in India with new $670 million funding

    December 19, 2025
TechBrunchTechBrunch

OpenAI says AI browsers can always be vulnerable to prompt injection attacks

TechBrunchBy TechBrunchDecember 22, 20255 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


OpenAI is working to harden its Atlas AI browser against cyberattacks and acknowledges that prompt injection is a type of attack that manipulates an AI agent to follow malicious instructions hidden in web pages or emails. This is a risk that isn't going away anytime soon, raising questions about how securely AI agents can operate on the open web.

“As with fraud and social engineering on the web, instant attacks are unlikely to be fully 'solved',” OpenAI said in a blog post on Monday, detailing how the company is hardening Atlas' defenses to counter the constant attacks. The company acknowledged that ChatGPT Atlas' “Agent Mode” “expands the surface of security threats.”

OpenAI announced its ChatGPT Atlas browser in October, and security researchers have rushed to release a demo showing that you can change the behavior of the underlying browser by writing a few words in a Google Doc. On the same day, Brave published a blog post explaining how indirect prompt injection is an organizational challenge for AI-powered browsers, including Perplexity's Comet.

OpenAI isn't the only company to realize that prompt-based injection isn't going away. Britain's National Cyber ​​Security Center warned earlier this month that prompt injection attacks on generative AI applications “may not be fully mitigated”, leaving websites at risk of data breaches. UK government agencies have advised cyber experts to reduce the risk and impact of immediate injections, rather than thinking they can “stop” an attack.

Regarding OpenAI, the company said, “We believe rapid injection is a long-term AI security challenge, and we need to continually strengthen our defenses against it.”

What's the company's answer to this Sisyphean-like challenge? The company says its proactive and rapid response cycle is showing early promise in helping companies discover new attack strategies before they can be exploited “in the wild.”

This is not entirely different from what competitors like Anthropic and Google claim. This means defenses must be layered and continually stress-tested to combat the persistent risk of prompt-based attacks. For example, recent efforts at Google have focused on architectural and policy-level controls for agent systems.

But what OpenAI does differently is its “LLM-based automated attacker.” The attacker is essentially a bot trained by OpenAI using reinforcement learning to play the role of a hacker looking for a way to secretly send malicious instructions to an AI agent.

Bots can test attacks in a simulation before actually using them, and the simulator shows how the target AI will think and act if it recognizes the attack. The bot can then study that response, fine-tune its attack, and try again and again. In theory, OpenAI's bots should be able to discover flaws faster than real-world attackers, since insights into the target AI's internal reasoning are inaccessible to outsiders.

This is a common tactic in AI safety testing. Build an agent to find edge cases and quickly test it in simulation.

“Our [reinforcement learning]”Trained attackers can coax agents into executing long-lasting, sophisticated, and harmful workflows that unfold over dozens (or even hundreds) of steps. We also observed novel attack strategies that had not appeared in human red teaming efforts or external reports,” OpenAI wrote.

Screenshot showing a prompt injection attack on OpenAI browser.Image credit: OpenAI

In a demo (partially pictured above), OpenAI showed how an automated attacker could sneak a malicious email into a user's inbox. Later, when the AI ​​agent scanned the inbox, it followed the instructions hidden in the email and sent a resignation message instead of creating an out-of-office reply. However, the company says that after a security update, “Agent Mode” was able to successfully detect the prompt injection attempt and flag the user.

The company says prompt injections are difficult to defend against in a fool-proof manner, but it relies on extensive testing and faster patch cycles to harden systems before they appear in an actual attack.

An OpenAI spokesperson declined to say whether Atlas' security updates led to a measurable reduction in successful injections, but said the company has been working with third parties to harden Atlas against rapid injections since before its launch.

Rami McCarthy, principal security researcher at cybersecurity firm Wiz, said reinforcement learning is one way to continually adapt to an attacker's behavior, but it's only part of the picture.

“A useful way to infer risk in an AI system is to multiply autonomy with access,” McCarthy told TechCrunch.

“Agent browsers tend to be at the difficult end of the spectrum, which is a combination of moderate autonomy and very high access,” McCarthy said. “Many of the current recommendations reflect that trade-off: Restricting login access primarily reduces risk, but requiring review of confirmation requests constrains autonomy.”

These are two of OpenAI's recommendations to help users reduce their own risks, and a spokesperson said Atlas is also trained to obtain confirmation from users before sending messages or making payments. OpenAI also suggests that users give the agent specific instructions, rather than giving the agent access to their inbox and telling them to “perform the required action.”

According to OpenAI, “The wide tolerance makes it easier for hidden or malicious content to impact agents, even when safety measures are in place.”

OpenAI says protecting Atlas users from prompt injections is a top priority, but McCarthy is skeptical about the return on investment for the risk-prone browser.

“For most everyday use cases, agent browsers still don't provide enough value to justify their current risk profile,” McCarthy told TechCrunch. “Even though that access is what makes them powerful, given their access to sensitive data like email and payment information, the risks are high. That balance will evolve, but the trade-offs are still very real today.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Hundreds of Cisco customers are vulnerable to new Chinese hacking campaign, researchers say

December 19, 2025

Hacking, theft and destruction: 2025's worst data breaches

December 19, 2025

UK NHS technology provider confirms data breach

December 18, 2025

Cisco announces Chinese hackers are exploiting customers with new zero-day attack

December 17, 2025

Hacking group says it's blackmailing Pornhub after stealing users' viewing data

December 16, 2025

Google and Apple release emergency security updates after zero-day attack

December 12, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

OpenAI says AI browsers can always be vulnerable to prompt injection attacks

December 22, 2025

Famous Israeli VC John Medved, who was diagnosed with ALS, championed technology to improve his life.

December 21, 2025

Sequoia partners debunk Brown shooting theory and test new leadership

December 20, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.