Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Anthropic and OpenAI make moves against popular AI apps

    June 6, 2025

    Nintendo Switch 2 webcam compatibility: it’s a wild west

    June 6, 2025

    I Sampled All the Best Mushroom Gummies—Here’s What I Found

    June 6, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » Apple, Nvidia, Anthropic Used Thousands of Swiped YouTube Videos to Train AI
    Business

    Apple, Nvidia, Anthropic Used Thousands of Swiped YouTube Videos to Train AI

    News RoomBy News RoomJuly 17, 20244 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    In response to the suits, defendants such as Meta, OpenAI, and Bloomberg have argued that their actions constitute fair use. A case against EleutherAI, which originally scraped the books and made them public, was voluntarily dismissed by the plaintiffs.

    Litigation in remaining cases remains in the early stages, leaving the questions surrounding permission and payment unresolved. The Pile has since been removed from its official download site, but it’s still available on file-sharing services.

    “Technology companies have run roughshod,” said Amy Keller, a consumer protection attorney and partner at the firm DiCello Levitt who has brought lawsuits on behalf of creatives whose work was allegedly scooped up by AI firms without their consent.

    “People are concerned about the fact that they didn’t have a choice in the matter,” Keller said. “I think that’s what’s really problematic.”

    Parroting a Parrot

    Many creators feel uncertain about the path ahead.

    Full-time YouTubers patrol for unauthorized use of their work, regularly filing takedown notices, and some worry it’s only a matter of time before AI can generate content similar to what they make—if not produce outright copycats.

    Pakman, the creator of The David Pakman Show, saw the power of AI recently while scrolling on TikTok. He came across a video that was labeled as a Tucker Carlson clip, but when Pakman watched it, he was taken aback. It sounded like Carlson but was, word for word, what Pakman had said on his YouTube show, down to the cadence. He was equally alarmed that only one of the video’s commenters seemed to recognize that it was fake—a voice clone of Carlson reading Pakman’s script.

    “This is going to be a problem,” Pakman said in a YouTube video he made about the fake. “You can do this essentially with anybody.”

    EleutherAI cofounder Sid Black wrote on GitHub that he created YouTube Subtitles by using a script. That script downloads the subtitles from YouTube’s API in the same way a YouTube viewer’s browser downloads them when watching a video. According to documentation on GitHub, Black used 495 search terms to cull videos, including “funny vloggers,” “Einstein,” “black protestant,” “Protective Social Services,” “infowars,” “quantum chromodynamics,” “Ben Shapiro,” “Uighurs,” “fruitarian,” “cake recipe,” ”Nazca lines,” and “flat earth.”

    Though YouTube’s terms of service prohibit accessing its videos by “automated means,” more than 2,000 GitHub users have bookmarked or endorsed the code.

    “There are many ways in which YouTube could prevent this module from working if that was what they are after,” wrote machine learning engineer Jonas Depoix in a discussion on GitHub, where he published the code Black used to access YouTube subtitles. “This hasn’t happened so far.”

    In an email to Proof News, Depoix said he hasn’t used the code since he wrote it as a university student for a project several years ago and was surprised people found it useful. He declined to answer questions about YouTube’s rules.

    Google spokesperson Jack Malon said in an email response to a request for comment that the company has taken “action over the years to prevent abusive, unauthorized scraping.” He did not respond to questions about other companies’ use of the material as training data.

    Among the videos used by AI companies are 146 from Einstein Parrot, a channel with nearly 150,000 subscribers. The African grey’s caretaker, Marcia, who didn’t want to use her last name for fear of endangering the famous bird’s safety, said at first she thought it was funny to learn AI models had ingested words of a mimicking parrot.

    “Who would want to use a parrot’s voice?” Marcia said. “But then, I know that he speaks very well. He speaks in my voice. So he’s parroting me, and then AI is parroting the parrot.”

    Once ingested by AI, data cannot be unlearned. Marcia was troubled by all the unknown ways in which her bird’s information could be used, including creating a digital duplicate parrot and, she worried, making it curse.

    “We’re treading on uncharted territory,” Marcia said.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleCool It With The Prime Day Air Conditioners and Fans
    Next Article Mercedes-Benz’s 400kW EV chargers are coming to Starbucks

    Related Posts

    Elon Musk’s Feud With President Trump Wipes $152 Billion Off Tesla’s Market Cap

    June 6, 2025

    Palantir Is Going on Defense

    June 6, 2025

    At Bitcoin 2025, Crypto Purists and the MAGA Faithful Collide

    June 5, 2025

    Trumpworld Is Fighting Over ‘Official’ Crypto Wallet

    June 5, 2025

    Perplexity’s CEO Sees AI Agents as the Next Web Battleground

    June 5, 2025

    Facing a Changing Industry, AI Activists Rethink Their Strategy

    June 5, 2025
    Our Picks

    Nintendo Switch 2 webcam compatibility: it’s a wild west

    June 6, 2025

    I Sampled All the Best Mushroom Gummies—Here’s What I Found

    June 6, 2025

    The Ray-Ban Meta smart glasses are on sale for their best price to date

    June 6, 2025

    Google Gemini can now handle scheduled tasks like an assistant

    June 6, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Business

    Elon Musk’s Feud With President Trump Wipes $152 Billion Off Tesla’s Market Cap

    By News RoomJune 6, 2025

    It took only a few hours to wipe $152 billion of value from Tesla’s market…

    iFixit says the Switch 2 is even harder to repair than the original

    June 6, 2025

    Here are the biggest Nintendo Switch 2 launch games you can buy

    June 6, 2025

    Apple could show off revamped Phone, Safari, and Camera apps next week

    June 6, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.