Close Menu
TechBrunchTechBrunch
  • Home
  • AI
  • Apps
  • Crypto
  • Security
  • Startups
  • TechCrunch
  • Venture

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Substack brings new updates to live streaming as it increases video push

July 2, 2025

It's on track to raise $150 million at a $2 billion valuation

July 2, 2025

Amazon shuts down the Freevee app in August

July 2, 2025
Facebook X (Twitter) Instagram
TechBrunchTechBrunch
  • Home
  • AI

    OpenAI seeks to extend human lifespans with the help of longevity startups

    January 17, 2025

    Farewell to the $200 million woolly mammoth and TikTok

    January 17, 2025

    Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

    January 17, 2025

    Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

    January 16, 2025

    Apple suspends AI notification summaries for news after generating false alerts

    January 16, 2025
  • Apps

    Substack brings new updates to live streaming as it increases video push

    July 2, 2025

    Amazon shuts down the Freevee app in August

    July 2, 2025

    A guide to using editing, Meta's new Capcut Rival for Short-Form video editing

    July 2, 2025

    The best iPad apps to increase productivity and make your life easier

    July 1, 2025

    When the app moves further away from Instagram, the thread launches its own DM inbox

    July 1, 2025
  • Crypto

    Vitalik Buterin reserves for Sam Altman's global project

    June 28, 2025

    Calci will close a $185 million round as rival Polymeruk reportedly seeks $200 million

    June 25, 2025

    Stablecoin Evangelist: Katie Haun's Battle of Digital Dollars

    June 22, 2025

    Hackers steal and destroy millions of Iran's biggest crypto exchanges

    June 18, 2025

    Unique, a new social media app

    June 17, 2025
  • Security

    India's biggest finance says hackers have accessed customer data from insurance units

    July 2, 2025

    Data breaches reveal that Catwatchful's “Stalkerware” is spying on thousands of phones

    July 2, 2025

    Hacking, Leaking, Exposure: Do not use stalkerware apps

    July 2, 2025

    Qantas Hacks lead to theft of personal data for 6 million passengers

    July 2, 2025

    Ice Block is an app for anonymously reporting ice sightings and becomes a virus overnight after Bondi criticism

    July 1, 2025
  • Startups

    7 days left: Founders and VCs save over $300 on all stage passes

    March 24, 2025

    AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

    March 24, 2025

    20 Hottest Open Source Startups of 2024

    March 22, 2025

    Andrill may build a weapons factory in the UK

    March 21, 2025

    Startup Weekly: Wiz bets paid off at M&A Rich Week

    March 21, 2025
  • TechCrunch

    OpenSea takes a long-term view with a focus on UX despite NFT sales remaining low

    February 8, 2024

    AI will save software companies' growth dreams

    February 8, 2024

    B2B and B2C are not about who buys, but how you sell

    February 5, 2024

    It's time for venture capital to break away from fast fashion

    February 3, 2024

    a16z's Chris Dixon believes it's time to focus on blockchain use cases rather than speculation

    February 2, 2024
  • Venture

    It's on track to raise $150 million at a $2 billion valuation

    July 2, 2025

    Jon McNeill brings the operator playbook to every stage

    July 1, 2025

    Figma approaches a smash hit IPO that can raise $1.5 billion

    July 1, 2025

    Catalio Capital closes fund IV over $400 million

    July 1, 2025

    Kleida Martiro leads the conversation on AI scale at TC All Stage

    July 1, 2025
TechBrunchTechBrunch

YouTubers file class action lawsuit over OpenAI scraping creator transcripts

TechBrunchBy TechBrunchAugust 5, 20244 Mins Read
Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
Share
Facebook Twitter LinkedIn Pinterest Telegram Email


YouTube creators are filing a class action lawsuit against OpenAI, alleging that the company used millions of transcripts of YouTube videos to train a generative AI model without notifying or compensating video owners.

In a complaint filed last Friday in the U.S. District Court for the Northern District of California, lawyers for Massachusetts-based YouTube user David Millett allege that OpenAI secretly transcribed Millett and other creators' videos to train models that power its AI-powered chatbot platform ChatGPT and other AI-generated tools and products. By collecting this data, OpenAI “significantly profited” from the creators' work, but violated copyright law and YouTube's terms of service, which prohibit the use of the videos in apps unrelated to YouTube's service, according to the complaint.

“As [OpenAI’s] As AI products become more sophisticated through the use of training datasets, they become more valuable to potential and current users who purchase subscriptions to access them. [OpenAI’s] “This is an OpenAI AI product,” the complaint reads, “but much of the material contained in OpenAI's training datasets comes from works that OpenAI copied without consent, credit, or compensation.”

Millett, who is represented by the law firm Burser & Fisher, is seeking a jury trial and more than $5 million in damages from all YouTube users whose data may have been collected during OpenAI's training.

Generative AI models like OpenAI's have no actual intelligence: you feed them a huge number of examples (movies, audio recordings, essays, etc.) and the model “learns” the likelihood of the data occurring based on patterns (including the context of the surrounding data).

Most models are trained with data taken from public websites or web datasets. Companies argue that fair use protects their efforts to indiscriminately collect data and use it to train commercial models. But many copyright holders disagree and have filed lawsuits to block the practice.

In a sense, video transcriptions have become an important component of training data as other data sources become scarce.

According to data from Originality.AI, more than 35% of the world's top 1,000 websites currently block OpenAI's web crawlers. And a study by MIT's Data Provenance Initiative found that roughly 25% of “high-quality” sources of data are restricted from major datasets used to train AI models. If current trends in access blocking continue, research group Epoch AI predicts that developers will run out of data to train generative AI models between 2026 and 2032.

The New York Times reported in April that OpenAI created its first speech recognition model, Whisper, to transcribe video audio and gather additional training data. According to the Times, the OpenAI team, including OpenAI president Greg Brockman, used Whisper to transcribe more than 1 million hours of video from YouTube and then used the transcripts to train OpenAI's text generation and analysis model, GPT-4.

According to the Times, some OpenAI staffers discussed how such a move could violate YouTube's rules.

In July, Proof News reported that companies including Anthropic, Apple, Salesforce, and Nvidia had trained generative AI models using a dataset called The Pile, which contains hundreds of thousands of subtitles for YouTube videos. Many YouTube creators whose subtitles were included in The Pile were unaware of this or consented to it. Apple later issued a statement saying it had no intention of using these models for AI features in its products.

YouTube's parent company, Google, is also looking to use transcripts to train models.

Last year, Google expanded its Terms of Service (ToS) to allow it to use more user data to train its generative AI models. The old ToS left it unclear whether Google could use YouTube data to build products outside of its video platform. The new ToS is much more open and clear.

We have reached out to OpenAI and Google for comment on the class action lawsuit and will update this article if we hear back.

It's been a rough start to the month for OpenAI.

Tesla and X CEO Elon Musk filed a new lawsuit against OpenAI and CEO Sam Altman on Monday, accusing the company of abandoning its non-profit mission by reserving some of its most sophisticated technology for commercial customers. Musk made the same claims in a lawsuit he filed against OpenAI in February, but the new suit similarly alleges OpenAI has engaged in fraudulent practices.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

OpenAI seeks to extend human lifespans with the help of longevity startups

January 17, 2025

Farewell to the $200 million woolly mammoth and TikTok

January 17, 2025

Nord Security founder launches Nexos.ai to help enterprises move AI projects from pilot to production

January 17, 2025

Data proves it remains difficult for startups to raise capital, even though VCs invested $75 billion in the fourth quarter

January 16, 2025

Apple suspends AI notification summaries for news after generating false alerts

January 16, 2025

Nvidia releases more tools and guardrails to help enterprises adopt AI agents

January 16, 2025

Leave A Reply Cancel Reply

Top Reviews
Editors Picks

7 days left: Founders and VCs save over $300 on all stage passes

March 24, 2025

AI chip startup Furiosaai reportedly rejecting $800 million acquisition offer from Meta

March 24, 2025

20 Hottest Open Source Startups of 2024

March 22, 2025

Andrill may build a weapons factory in the UK

March 21, 2025
About Us
About Us

Welcome to Tech Brunch, your go-to destination for cutting-edge insights, news, and analysis in the fields of Artificial Intelligence (AI), Cryptocurrency, Technology, and Startups. At Tech Brunch, we are passionate about exploring the latest trends, innovations, and developments shaping the future of these dynamic industries.

Our Picks

Substack brings new updates to live streaming as it increases video push

July 2, 2025

It's on track to raise $150 million at a $2 billion valuation

July 2, 2025

Amazon shuts down the Freevee app in August

July 2, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

© 2025 TechBrunch. Designed by TechBrunch.
  • Home
  • About Tech Brunch
  • Advertise with Tech Brunch
  • Contact us
  • DMCA Notice
  • Privacy Policy
  • Terms of Use

Type above and press Enter to search. Press Esc to cancel.