Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot
    Mozilla announces an AI ‘window’ for Firefox

    Mozilla announces an AI ‘window’ for Firefox

    November 13, 2025
    Who is buying VR and XR headsets anyway?

    Who is buying VR and XR headsets anyway?

    November 13, 2025
    Starlink rival ‘Project Kuiper’ rebrands to Amazon Leo

    Starlink rival ‘Project Kuiper’ rebrands to Amazon Leo

    November 13, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » The Words That Give Away Generative AI Text
    Business

    The Words That Give Away Generative AI Text

    News RoomBy News RoomJuly 10, 20244 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email
    The Words That Give Away Generative AI Text

    Thus far, even AI companies have had trouble coming up with tools that can reliably detect when a piece of writing was generated using a large language model. Now, a group of researchers has established a novel method for estimating LLM usage across a large set of scientific writing by measuring which “excess words” started showing up much more frequently during the LLM era (i.e., 2023 and 2024). The results “suggest that at least 10 percent of 2024 abstracts were processed with LLMs,” according to the researchers.

    In a preprint paper posted earlier this month, four researchers from Germany’s University of Tübingen and Northwestern University said they were inspired by studies that measured the impact of the Covid-19 pandemic by looking at excess deaths compared to the recent past. By taking a similar look at “excess word usage” after LLM writing tools became widely available in late 2022, the researchers found that “the appearance of LLMs led to an abrupt increase in the frequency of certain style words” that was “unprecedented in both quality and quantity.”

    Delving In

    To measure these vocabulary changes, the researchers analyzed 14 million paper abstracts published on PubMed between 2010 and 2024, tracking the relative frequency of each word as it appeared across each year. They then compared the expected frequency of those words (based on the pre-2023 trend line) to the actual frequency of those words in abstracts from 2023 and 2024, when LLMs were in widespread use.

    The results found a number of words that were extremely uncommon in these scientific abstracts before 2023 that suddenly surged in popularity after LLMs were introduced. The word “delves,” for instance, shows up in 25 times as many 2024 papers as the pre-LLM trend would expect; words like “showcasing” and “underscores” increased in usage by nine times as well. Other previously common words became notably more common in post-LLM abstracts: The frequency of “potential” increased by 4.1 percentage points, “findings” by 2.7 percentage points, and “crucial” by 2.6 percentage points, for instance.

    These kinds of changes in word use could happen independently of LLM usage, of course—the natural evolution of language means words sometimes go in and out of style. However, the researchers found that, in the pre-LLM era, such massive and sudden year-over-year increases were only seen for words related to major world health events: “ebola” in 2015; “zika” in 2017; and words like “coronavirus,” “lockdown,” and “pandemic” in the 2020 to 2022 period.

    In the post-LLM period, though, the researchers found hundreds of words with sudden, pronounced increases in scientific usage that had no common link to world events. In fact, while the excess words during the Covid pandemic were overwhelmingly nouns, the researchers found that the words with a post-LLM frequency bump were overwhelmingly “style words” like verbs, adjectives, and adverbs (a small sampling: “across, additionally, comprehensive, crucial, enhancing, exhibited, insights, notably, particularly, within”).

    This isn’t a completely new finding—the increased prevalence of “delve” in scientific papers has been widely noted in the recent past, for instance. But previous studies generally relied on comparisons with “ground truth” human writing samples or lists of predefined LLM markers obtained from outside the study. Here, the pre-2023 set of abstracts acts as its own effective control group to show how vocabulary choice has changed overall in the post-LLM era.

    An Intricate Interplay

    By highlighting hundreds of so-called “marker words” that became significantly more common in the post-LLM era, the telltale signs of LLM use can sometimes be easy to pick out. Take this example abstract line called out by the researchers, with the marker words highlighted: “A comprehensive grasp of the intricate interplay between […] and […] is pivotal for effective therapeutic strategies.”

    After doing some statistical measures of marker word appearance across individual papers, the researchers estimate that at least 10 percent of the post-2022 papers in the PubMed corpus were written with at least some LLM assistance. The number could be even higher, the researchers say, because their set could be missing LLM-assisted abstracts that don’t include any of the marker words they identified.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleIt’s Hard Not to Like Motorola’s Flip-Folding Razr+
    Next Article Plastic bins: better than boxes

    Related Posts

    Meet the Chinese Startup Using AI—and a Team of Human Workers—to Train Robots

    Meet the Chinese Startup Using AI—and a Team of Human Workers—to Train Robots

    November 13, 2025
    OpenAI Signs  Billion Deal With Amazon

    OpenAI Signs $38 Billion Deal With Amazon

    November 12, 2025
    TikTok Shop Is Now the Size of eBay

    TikTok Shop Is Now the Size of eBay

    November 10, 2025
    WIRED Roundup: Alpha School, Grokipedia, and Real Estate AI Videos

    WIRED Roundup: Alpha School, Grokipedia, and Real Estate AI Videos

    November 6, 2025
    WIRED Roundup: AI Psychosis, Missing FTC Files, and Google Bedbugs

    WIRED Roundup: AI Psychosis, Missing FTC Files, and Google Bedbugs

    November 6, 2025
    AI Agents Are Terrible Freelance Workers

    AI Agents Are Terrible Freelance Workers

    November 5, 2025
    Our Picks
    Who is buying VR and XR headsets anyway?

    Who is buying VR and XR headsets anyway?

    November 13, 2025
    Starlink rival ‘Project Kuiper’ rebrands to Amazon Leo

    Starlink rival ‘Project Kuiper’ rebrands to Amazon Leo

    November 13, 2025
    Apple TV is getting MLS games at no extra cost

    Apple TV is getting MLS games at no extra cost

    November 13, 2025
    Hackers use Anthropic’s AI model Claude once again

    Hackers use Anthropic’s AI model Claude once again

    November 13, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Valve wants Half-Life: Alyx to work well standalone on Steam Frame News

    Valve wants Half-Life: Alyx to work well standalone on Steam Frame

    By News RoomNovember 13, 2025

    When I tried Half-Life: Alyx streaming from a PC to Valve’s new Steam Frame VR…

    Apple will take a mini commission from mini app developers

    Apple will take a mini commission from mini app developers

    November 13, 2025
    The last-gen Apple Watch Series 10 has returned to its lowest price to date

    The last-gen Apple Watch Series 10 has returned to its lowest price to date

    November 13, 2025
    The Fire TV Stick 4K Max is back down to , its best price in a year

    The Fire TV Stick 4K Max is back down to $35, its best price in a year

    November 13, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.