Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Greek revival you haven't seen (probably should)

May 11, 2025

A flock of Whitney Wolf burns out – and bounces back

May 10, 2025

Five Things We Learned from WhatsApp vs. NSO Group Spyware Litigation

May 10, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    A flock of Whitney Wolf burns out – and bounces back

    May 10, 2025

    Google I/O 2025: What to expect including Gemini and Android 16 updates?

    May 9, 2025

    Epic Games and Spotify Test Apple's new app store rules

    May 9, 2025

    X Timeline is not updated for many users

    May 9, 2025

    AppFigures: Apple earned more than $10 billion from its US App Store commission last year

    May 8, 2025
  • Crypto

    Stripe unveils AI Foundation model for payments, revealing a “deeper partnership” with Nvidia

    May 7, 2025

    Movie Pass explores the daily fantasy platform of film buffs

    May 1, 2025

    Speaking on TechCrunch 2025: Application is open

    April 24, 2025

    Revolut, a $45 billion Neobank, recorded a profit of $1 billion in 2024

    April 24, 2025

    The new kids show will come with a crypto wallet when it debuts this fall

    April 18, 2025
  • Security

    Five Things We Learned from WhatsApp vs. NSO Group Spyware Litigation

    May 10, 2025

    FBI and Dutch police seize and shut down hacked router botnets

    May 9, 2025

    Florida bill calling for encryption backdoors for social media accounts failed

    May 9, 2025

    Korean telephone giant SKT data breaches timeline

    May 8, 2025

    Powerschool paid the hacker ransom, but now the school says it's being forced

    May 8, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    Greek revival you haven't seen (probably should)

    May 11, 2025

    A comprehensive list of 2025 tech layoffs

    May 9, 2025

    One of Elon Musk's longtime VCS is suing his former employer after allegedly fired

    May 8, 2025

    Sequoia leads a $1.5 billion tender offer for sales automation startup clay

    May 8, 2025

    Bosch Ventures is turning attention to North America with a new $270 million fund

    May 8, 2025
TechBrunchTechBrunch

YouTubers file class action lawsuit over OpenAI scraping creator transcripts

TechBrunchBy TechBrunchAugust 5, 20244 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


YouTube creators are filing a class action lawsuit against OpenAI, alleging that the company used millions of transcripts of YouTube videos to train a generative AI model without notifying or compensating video owners.

In a complaint filed last Friday in the U.S. District Court for the Northern District of California, lawyers for Massachusetts-based YouTube user David Millett allege that OpenAI secretly transcribed Millett and other creators' videos to train models that power its AI-powered chatbot platform ChatGPT and other AI-generated tools and products. By collecting this data, OpenAI “significantly profited” from the creators' work, but violated copyright law and YouTube's terms of service, which prohibit the use of the videos in apps unrelated to YouTube's service, according to the complaint.

“As [OpenAI’s] As AI products become more sophisticated through the use of training datasets, they become more valuable to potential and current users who purchase subscriptions to access them. [OpenAI’s] “This is an OpenAI AI product,” the complaint reads, “but much of the material contained in OpenAI's training datasets comes from works that OpenAI copied without consent, credit, or compensation.”

Millett, who is represented by the law firm Burser & Fisher, is seeking a jury trial and more than $5 million in damages from all YouTube users whose data may have been collected during OpenAI's training.

Generative AI models like OpenAI's have no actual intelligence: you feed them a huge number of examples (movies, audio recordings, essays, etc.) and the model “learns” the likelihood of the data occurring based on patterns (including the context of the surrounding data).

Most models are trained with data taken from public websites or web datasets. Companies argue that fair use protects their efforts to indiscriminately collect data and use it to train commercial models. But many copyright holders disagree and have filed lawsuits to block the practice.

In a sense, video transcriptions have become an important component of training data as other data sources become scarce.

According to data from Originality.AI, more than 35% of the world's top 1,000 websites currently block OpenAI's web crawlers. And a study by MIT's Data Provenance Initiative found that roughly 25% of “high-quality” sources of data are restricted from major datasets used to train AI models. If current trends in access blocking continue, research group Epoch AI predicts that developers will run out of data to train generative AI models between 2026 and 2032.

The New York Times reported in April that OpenAI created its first speech recognition model, Whisper, to transcribe video audio and gather additional training data. According to the Times, the OpenAI team, including OpenAI president Greg Brockman, used Whisper to transcribe more than 1 million hours of video from YouTube and then used the transcripts to train OpenAI's text generation and analysis model, GPT-4.

According to the Times, some OpenAI staffers discussed how such a move could violate YouTube's rules.

In July, Proof News reported that companies including Anthropic, Apple, Salesforce, and Nvidia had trained generative AI models using a dataset called The Pile, which contains hundreds of thousands of subtitles for YouTube videos. Many YouTube creators whose subtitles were included in The Pile were unaware of this or consented to it. Apple later issued a statement saying it had no intention of using these models for AI features in its products.

YouTube's parent company, Google, is also looking to use transcripts to train models.

Last year, Google expanded its Terms of Service (ToS) to allow it to use more user data to train its generative AI models. The old ToS left it unclear whether Google could use YouTube data to build products outside of its video platform. The new ToS is much more open and clear.

We have reached out to OpenAI and Google for comment on the class action lawsuit and will update this article if we hear back.

It's been a rough start to the month for OpenAI.

Tesla and X CEO Elon Musk filed a new lawsuit against OpenAI and CEO Sam Altman on Monday, accusing the company of abandoning its non-profit mission by reserving some of its most sophisticated technology for commercial customers. Musk made the same claims in a lawsuit he filed against OpenAI in February, but the new suit similarly alleges OpenAI has engaged in fraudulent practices.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Greek revival you haven't seen (probably should)

May 11, 2025

A flock of Whitney Wolf burns out – and bounces back

May 10, 2025

Five Things We Learned from WhatsApp vs. NSO Group Spyware Litigation

May 10, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.