Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

The court denied requests to suspend awards regarding Apple's App Store payment fees

June 6, 2025

Circle IPOs are giving hope to more startups waiting to be published to more startups

June 5, 2025

Perplexity received 780 million questions last month, the CEO says

June 5, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    The court denied requests to suspend awards regarding Apple's App Store payment fees

    June 6, 2025

    Perplexity received 780 million questions last month, the CEO says

    June 5, 2025

    Bonfire's new software allows users to build their own social communities free from platform control

    June 5, 2025

    x Test to highlight posts that users with dissent

    June 5, 2025

    Google says the updated Gemini 2.5 Pro AI model is excellent at coding

    June 5, 2025
  • Crypto

    Circle IPOs are giving hope to more startups waiting to be published to more startups

    June 5, 2025

    GameStop bought $500 million in Bitcoin

    May 28, 2025

    Vote for the session you want to watch in 2025

    May 26, 2025

    Save $900 + 90% from 2 tickets to destroy 2025 in the last 24 hours

    May 25, 2025

    Only 3 days left to save up to $900 to destroy the 2025 pass

    May 23, 2025
  • Security

    Humanity unveils custom AI models for US national security customers

    June 5, 2025

    Unlock phone company Cellebrite to acquire mobile testing startup Corellium for $170 million

    June 5, 2025

    Ransomware Gangs claim responsibility for Kettering Health Hack

    June 4, 2025

    Former CTO of CrowdStrike's cyber-rivals and how automation can undermine security for early-stage startups

    June 4, 2025

    Data breaches at newspaper giant Lee Enterprises impact 40,000 people

    June 4, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Less than 48 hours left until display at TC at all stages

    June 5, 2025

    TC Session: AI will be on sale today at Berkeley

    June 5, 2025

    North America accounts for the majority of AI VC investment despite the harsh political environment

    June 5, 2025

    3 days left: Charge all your locations in stages on TC Expo Floor

    June 4, 2025

    From $5 to Financial Empowerment: Why Stash co-founder Brandon Krieg is a must-see for TechCrunch All Stage 2025

    June 4, 2025
TechBrunchTechBrunch

Google DeepMind launches new organization focused on AI safety

TechBrunchBy TechBrunchFebruary 21, 20246 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


If you ask Gemini, Google's flagship GenAI model, to write deceptive content about the upcoming US presidential election, it will do so, given the right prompts. Ask about future Super Bowl games and play-by-play is invented. Or if you ask about the implosion of the Titan submarine, you'll be given disinformation with quotes that seem convincing but aren't true.

Needless to say, this is a bad look for Google and has drawn the ire of policymakers who are frustrated that GenAI tools can be easily used to disinformation and mislead the public.

So Google has cut thousands of jobs from last quarter and focused its investments on AI safety. At least, that's the official story.

This morning, Gemini and Google DeepMind, the AI ​​research and development arm behind many of Google's recent GenAI projects, announced the creation of a new organization called AI Safety and Alignment. It is made up of existing teams working on AI safety, but has expanded to include new organizations as well. A specialized cohort of GenAI researchers and engineers.

Beyond the job listings posted on DeepMind's site, Google has not disclosed how many people the new organization will hire. However, it has been revealed that AI Safety and Alignment will include a new team focused on safety around artificial general intelligence (AGI), or virtual systems that can perform any task a human can do.

With a similar mission to Super Alignment rival OpenAI, which was founded last July, the new team within AI Safety and Alignment will build on DeepMind's existing AI safety-focused research team in London. We work together with Scalable Alignment. The team is also seeking solutions to control technology challenges. Superintelligent AI that has not yet been realized.

Why are two groups working on the same problem? A fair question, but also a speculative one given Google's current reluctance to reveal details. However, it seems worth noting that the new team (within AI Safety and Alignment) is located in the US rather than across the pond, closer to Google's headquarters. During this period, the company is actively moving to keep pace with its AI rivals while planning responsible projects. , a measured approach to AI.

Other teams in the AI ​​Safety and Alignment organization are responsible for developing and incorporating specific safety measures into Google's Gemini models, both current and in development. Safety covers a wide range of areas. But some of the organization's short-term focuses will be on preventing inappropriate medical advice, keeping children safe and “preventing the spread of bigotry and other injustices.”

Anka Dragan, a former Waymo staff research scientist and computer science professor at the University of California, Berkeley, will lead the team.

“Our work [at the AI Safety and Alignment organization] “We aim to enable models to better and more reliably understand human preferences and values,” Dragan told TechCrunch via email. It is about countering hostile attacks and accounting for the pluralism and dynamic nature of human values ​​and perspectives. ”

Dragan's consulting work with Waymo on AI safety systems may raise some eyebrows given the Google self-driving car venture's recent shaky driving performance.

So is her decision to split her time between DeepMind and the University of California, Berkeley, where she leads a lab focused on algorithms for human-AI and robot interaction. Issues as serious as AGI safety, as well as the long-term risks AI safety and coalitions are looking to study, such as preventing AI from “supporting terrorism” or “destabilizing society,” include: Some may think it requires full-time attention.

But Dragan maintains that the research at UC Berkeley's lab and DeepMind are interconnected and complementary.

“My lab and I have worked to align our values ​​in anticipation of advances in AI capabilities. [and] “My own PhD was interested in robots that could infer human goals and make their own goals transparent to humans, and that's where my interest in this field began,” she said. . “The reason is [DeepMind CEO] with Demis Hassabis [chief AGI scientist] It was partly this research experience that made Shane Legg excited to invite me, and partly because addressing current concerns and catastrophic risks are not mutually exclusive, i.e. Mitigation is often confused with technical aspects, and my stance is that the effort contributes to long-term benefits. Improve the current and vice versa. ”

It's no exaggeration to say that Dragan's job is suitable for her.

Skepticism about GenAI tools is at an all-time high, especially when it comes to deepfakes and misinformation. in poll According to YouGov, 85% of Americans say they are very concerned or somewhat concerned about the prevalence of misleading video and audio deepfakes.another investigation From the Associated Press – Nearly 60% of adults believe AI tools will increase the amount of false and misleading information during the 2024 U.S. election cycle, according to a NORC Center for Public Affairs Research survey.

Businesses — the big names Google and its rivals hope to lure with GenAI innovations — are also wary of the technology's shortcomings and its implications.

Intel subsidiary Cnvrg.io recently conducted a survey of companies piloting or deploying GenAI apps. As a result, nearly a quarter of respondents are concerned about GenAI's compliance and privacy, reliability, high implementation costs, and lack of technical skills needed to get the most out of the tool. I understand.

In another poll from Riskonnect, a risk management software provider, more than half of executives said they were concerned about employees making decisions based on inaccurate information from GenAI apps.

Their concerns are not unreasonable. Last week, the Wall Street Journal reported that Microsoft's Copilot suite, which is powered by a GenAI model that is architecturally similar to Gemini, often makes mistakes in meeting summaries and spreadsheet formulas. The cause is hallucinations (a general term for GenAI's tendency to fabricate), and many experts believe that hallucinations cannot be completely resolved.

Dragan recognizes that the challenge of AI safety is intractable, does not promise perfect models, and said that DeepMind will invest more resources in this area in the future and will improve the safety of GenAI models. He said only that he intended to work on a framework for assessing sexual risks “in the near future.”

“The key is… [account] For human cognitive biases that remain in the data used for training, we've added better uncertainty estimation to know where the gaps are, inference time monitoring that can catch failures, and consequential decisions. A confirmation dialog and tracking will occur. [a] “The model's ability is to engage in potentially dangerous behavior,” she said. “However, there remains an open question, which is difficult to find empirically, how to be sure that the model will not malfunction a small portion of the time, which can occur during deployment. ”

I don't think customers, the general public, or regulators will be that understanding. I think it probably depends on how egregious those wrongdoings are, and who exactly is harmed by them.

“Our users should be able to experience an increasingly convenient and secure model over time,” Dragan said. surely.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

The court denied requests to suspend awards regarding Apple's App Store payment fees

June 6, 2025

Circle IPOs are giving hope to more startups waiting to be published to more startups

June 5, 2025

Perplexity received 780 million questions last month, the CEO says

June 5, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.