Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Instagram lets you share Spotify songs with your story to your sound

June 30, 2025

At every stage of TechCrunch, Charles Hudson tells us what investors really see

June 30, 2025

Baidu’s AI Breakthrough Hasn’t Been Priced In — Why BIDU Could Hit $120 Soon

June 30, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Instagram lets you share Spotify songs with your story to your sound

    June 30, 2025

    The best iPad app to unleash and explore your creativity

    June 30, 2025

    Privacy-centric app maker Proton sues Apple over anti-competitive practices and charges alleged

    June 30, 2025

    Google is adopting AI in classrooms, including new Gemini tools for educators and chatbots for students

    June 30, 2025

    The former meta engineer has built AI tools to plan every detail of a trip

    June 30, 2025
  • Crypto

    Vitalik Buterin reserves for Sam Altman's global project

    June 28, 2025

    Calci will close a $185 million round as rival Polymeruk reportedly seeks $200 million

    June 25, 2025

    Stablecoin Evangelist: Katie Haun's Battle of Digital Dollars

    June 22, 2025

    Hackers steal and destroy millions of Iran's biggest crypto exchanges

    June 18, 2025

    Unique, a new social media app

    June 17, 2025
  • Security

    US government overthrows North Korea's major “workers” management

    June 30, 2025

    Mexican drug cartel hackers spy on FBI officials' phones to track and kill informants, the report says

    June 30, 2025

    FBI, cybersecurity firms say prolific hacking crews are currently targeting airlines and transportation sectors

    June 28, 2025

    Prolific cybercrime gangs currently targeting the airline and transportation sector

    June 27, 2025

    US and French authorities confirm arrest of a violation form hacker

    June 26, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    At every stage of TechCrunch, Charles Hudson tells us what investors really see

    June 30, 2025

    From $5 to Financial Empowerment: Why Stash co-founder Brandon Krieg is a must-see for TechCrunch All Stage 2025

    June 30, 2025

    A comprehensive list of 2025 tech layoffs

    June 30, 2025

    How to prepare for a second semester salary increase now live in 2025

    June 30, 2025

    Tiffany is lucky to have won a VCS at TC at every stage.

    June 30, 2025
TechBrunchTechBrunch

Google DeepMind launches new organization focused on AI safety

TechBrunchBy TechBrunchFebruary 21, 20246 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


If you ask Gemini, Google's flagship GenAI model, to write deceptive content about the upcoming US presidential election, it will do so, given the right prompts. Ask about future Super Bowl games and play-by-play is invented. Or if you ask about the implosion of the Titan submarine, you'll be given disinformation with quotes that seem convincing but aren't true.

Needless to say, this is a bad look for Google and has drawn the ire of policymakers who are frustrated that GenAI tools can be easily used to disinformation and mislead the public.

So Google has cut thousands of jobs from last quarter and focused its investments on AI safety. At least, that's the official story.

This morning, Gemini and Google DeepMind, the AI ​​research and development arm behind many of Google's recent GenAI projects, announced the creation of a new organization called AI Safety and Alignment. It is made up of existing teams working on AI safety, but has expanded to include new organizations as well. A specialized cohort of GenAI researchers and engineers.

Beyond the job listings posted on DeepMind's site, Google has not disclosed how many people the new organization will hire. However, it has been revealed that AI Safety and Alignment will include a new team focused on safety around artificial general intelligence (AGI), or virtual systems that can perform any task a human can do.

With a similar mission to Super Alignment rival OpenAI, which was founded last July, the new team within AI Safety and Alignment will build on DeepMind's existing AI safety-focused research team in London. We work together with Scalable Alignment. The team is also seeking solutions to control technology challenges. Superintelligent AI that has not yet been realized.

Why are two groups working on the same problem? A fair question, but also a speculative one given Google's current reluctance to reveal details. However, it seems worth noting that the new team (within AI Safety and Alignment) is located in the US rather than across the pond, closer to Google's headquarters. During this period, the company is actively moving to keep pace with its AI rivals while planning responsible projects. , a measured approach to AI.

Other teams in the AI ​​Safety and Alignment organization are responsible for developing and incorporating specific safety measures into Google's Gemini models, both current and in development. Safety covers a wide range of areas. But some of the organization's short-term focuses will be on preventing inappropriate medical advice, keeping children safe and “preventing the spread of bigotry and other injustices.”

Anka Dragan, a former Waymo staff research scientist and computer science professor at the University of California, Berkeley, will lead the team.

“Our work [at the AI Safety and Alignment organization] “We aim to enable models to better and more reliably understand human preferences and values,” Dragan told TechCrunch via email. It is about countering hostile attacks and accounting for the pluralism and dynamic nature of human values ​​and perspectives. ”

Dragan's consulting work with Waymo on AI safety systems may raise some eyebrows given the Google self-driving car venture's recent shaky driving performance.

So is her decision to split her time between DeepMind and the University of California, Berkeley, where she leads a lab focused on algorithms for human-AI and robot interaction. Issues as serious as AGI safety, as well as the long-term risks AI safety and coalitions are looking to study, such as preventing AI from “supporting terrorism” or “destabilizing society,” include: Some may think it requires full-time attention.

But Dragan maintains that the research at UC Berkeley's lab and DeepMind are interconnected and complementary.

“My lab and I have worked to align our values ​​in anticipation of advances in AI capabilities. [and] “My own PhD was interested in robots that could infer human goals and make their own goals transparent to humans, and that's where my interest in this field began,” she said. . “The reason is [DeepMind CEO] with Demis Hassabis [chief AGI scientist] It was partly this research experience that made Shane Legg excited to invite me, and partly because addressing current concerns and catastrophic risks are not mutually exclusive, i.e. Mitigation is often confused with technical aspects, and my stance is that the effort contributes to long-term benefits. Improve the current and vice versa. ”

It's no exaggeration to say that Dragan's job is suitable for her.

Skepticism about GenAI tools is at an all-time high, especially when it comes to deepfakes and misinformation. in poll According to YouGov, 85% of Americans say they are very concerned or somewhat concerned about the prevalence of misleading video and audio deepfakes.another investigation From the Associated Press – Nearly 60% of adults believe AI tools will increase the amount of false and misleading information during the 2024 U.S. election cycle, according to a NORC Center for Public Affairs Research survey.

Businesses — the big names Google and its rivals hope to lure with GenAI innovations — are also wary of the technology's shortcomings and its implications.

Intel subsidiary Cnvrg.io recently conducted a survey of companies piloting or deploying GenAI apps. As a result, nearly a quarter of respondents are concerned about GenAI's compliance and privacy, reliability, high implementation costs, and lack of technical skills needed to get the most out of the tool. I understand.

In another poll from Riskonnect, a risk management software provider, more than half of executives said they were concerned about employees making decisions based on inaccurate information from GenAI apps.

Their concerns are not unreasonable. Last week, the Wall Street Journal reported that Microsoft's Copilot suite, which is powered by a GenAI model that is architecturally similar to Gemini, often makes mistakes in meeting summaries and spreadsheet formulas. The cause is hallucinations (a general term for GenAI's tendency to fabricate), and many experts believe that hallucinations cannot be completely resolved.

Dragan recognizes that the challenge of AI safety is intractable, does not promise perfect models, and said that DeepMind will invest more resources in this area in the future and will improve the safety of GenAI models. He said only that he intended to work on a framework for assessing sexual risks “in the near future.”

“The key is… [account] For human cognitive biases that remain in the data used for training, we've added better uncertainty estimation to know where the gaps are, inference time monitoring that can catch failures, and consequential decisions. A confirmation dialog and tracking will occur. [a] “The model's ability is to engage in potentially dangerous behavior,” she said. “However, there remains an open question, which is difficult to find empirically, how to be sure that the model will not malfunction a small portion of the time, which can occur during deployment. ”

I don't think customers, the general public, or regulators will be that understanding. I think it probably depends on how egregious those wrongdoings are, and who exactly is harmed by them.

“Our users should be able to experience an increasingly convenient and secure model over time,” Dragan said. surely.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Instagram lets you share Spotify songs with your story to your sound

June 30, 2025

At every stage of TechCrunch, Charles Hudson tells us what investors really see

June 30, 2025

Baidu’s AI Breakthrough Hasn’t Been Priced In — Why BIDU Could Hit $120 Soon

June 30, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.