Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Subscriber-Only Livestream Replay: Beginner Advice for Claude, a ChatGPT Alternative

    July 2, 2025

    Methane Pollution Has Cheap, Effective Solutions That Aren’t Being Used

    July 2, 2025

    Sonos is offering a refurbished Era 100 for just $119

    July 1, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » Wikipedia is giving AI developers its data to fend off bot scrapers
    News

    Wikipedia is giving AI developers its data to fend off bot scrapers

    News RoomBy News RoomApril 17, 20251 Min Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    Wikimedia says the dataset hosted by Kaggle has been “designed with machine learning workflows in mind,” making it easier for AI developers to access machine-readable article data for modeling, fine-tuning, benchmarking, alignment, and analysis. The content within the dataset is openly licensed, and as of April 15th, includes research summaries, short descriptions, image links, infobox data, and article sections — minus references or non-written elements like audio files.

    “As the place the machine learning community comes for tools and tests, Kaggle is extremely excited to be the host for the Wikimedia Foundation’s data,” said Kaggle partnerships lead Brenda Flynn. “Kaggle is excited to play a role in keeping this data accessible, available, and useful.”

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleSuspected 4chan Hack Could Expose Longtime, Anonymous Admins
    Next Article Nintendo Switch 2 Backward Compatibility Looks Better Than Expected

    Related Posts

    Sonos is offering a refurbished Era 100 for just $119

    July 1, 2025

    Grammarly wants to become an ‘AI productivity platform’

    July 1, 2025

    Ultra Mobile raised its data caps without a price increase

    July 1, 2025

    X opens up to Community Notes written by AI bots

    July 1, 2025

    Figma is going public

    July 1, 2025

    Google makes it easier to let friends and kids control your smart home

    July 1, 2025
    Our Picks

    Methane Pollution Has Cheap, Effective Solutions That Aren’t Being Used

    July 2, 2025

    Sonos is offering a refurbished Era 100 for just $119

    July 1, 2025

    Grammarly wants to become an ‘AI productivity platform’

    July 1, 2025

    Ultra Mobile raised its data caps without a price increase

    July 1, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    News

    X opens up to Community Notes written by AI bots

    By News RoomJuly 1, 2025

    X is launching a way for developers to create AI bots that can write Community…

    Figma is going public

    July 1, 2025

    Google makes it easier to let friends and kids control your smart home

    July 1, 2025

    Cloudflare Is Blocking AI Crawlers by Default

    July 1, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.