Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Israel-Tied Predatory Sparrow Hackers Are Waging Cyberwar on Iran’s Financial System

    June 21, 2025

    Samsung’s Galaxy Watch 7 has returned to its lowest-ever price

    June 21, 2025

    The Verge’s guide to Amazon Prime Day 2025

    June 21, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » OpenAI Threatens to Ban Users Who Probe Its ‘Strawberry’ AI Models
    Business

    OpenAI Threatens to Ban Users Who Probe Its ‘Strawberry’ AI Models

    News RoomBy News RoomSeptember 18, 20243 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    OpenAI truly does not want you to know what its latest AI model is “thinking.” Since the company launched its “Strawberry” AI model family last week, touting so-called reasoning abilities with o1-preview and o1-mini, OpenAI has been sending out warning emails and threats of bans to any user who tries to probe how the model works.

    Unlike previous AI models from OpenAI, such as GPT-4o, the company trained o1 specifically to work through a step-by-step problem-solving process before generating an answer. When users ask an “o1” model a question in ChatGPT, users have the option of seeing this chain-of-thought process written out in the ChatGPT interface. However, by design, OpenAI hides the raw chain of thought from users, instead presenting a filtered interpretation created by a second AI model.

    Nothing is more enticing to enthusiasts than information obscured, so the race has been on among hackers and red-teamers to try to uncover o1’s raw chain of thought using jailbreaking or prompt injection techniques that attempt to trick the model into spilling its secrets. There have been early reports of some successes, but nothing has yet been strongly confirmed.

    Along the way, OpenAI is watching through the ChatGPT interface, and the company is reportedly coming down hard on any attempts to probe o1’s reasoning, even among the merely curious.

    One X user reported (confirmed by others, including Scale AI prompt engineer Riley Goodside) that they received a warning email if they used the term “reasoning trace” in conversation with o1. Others say the warning is triggered simply by asking ChatGPT about the model’s “reasoning” at all.

    The warning email from OpenAI states that specific user requests have been flagged for violating policies against circumventing safeguards or safety measures. “Please halt this activity and ensure you are using ChatGPT in accordance with our Terms of Use and our Usage Policies,” it reads. “Additional violations of this policy may result in loss of access to GPT-4o with Reasoning,” referring to an internal name for the o1 model.

    Marco Figueroa, who manages Mozilla’s GenAI bug bounty programs, was one of the first to post about the OpenAI warning email on X last Friday, complaining that it hinders his ability to do positive red-teaming safety research on the model. “I was too lost focusing on #AIRedTeaming to realized that I received this email from @OpenAI yesterday after all my jailbreaks,” he wrote. “I’m now on the get banned list!!!”

    Hidden Chains of Thought

    In a post titled “Learning to Reason With LLMs” on OpenAI’s blog, the company says that hidden chains of thought in AI models offer a unique monitoring opportunity, allowing them to “read the mind” of the model and understand its so-called thought process. Those processes are most useful to the company if they are left raw and uncensored, but that might not align with the company’s best commercial interests for several reasons.

    “For example, in the future we may wish to monitor the chain of thought for signs of manipulating the user,” the company writes. “However, for this to work the model must have freedom to express its thoughts in unaltered form, so we cannot train any policy compliance or user preferences onto the chain of thought. We also do not want to make an unaligned chain of thought directly visible to users.”

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleSubstack is trying to turn its writers into streamers
    Next Article 14 people have been killed by a second day of device explosions in Lebanon

    Related Posts

    A False Start on the Road to an All-American Bitcoin

    June 20, 2025

    A Deep Learning Alternative Can Help AI Agents Gameplay the Real World

    June 20, 2025

    This AI Model Never Stops Learning

    June 20, 2025

    Those Creatine Gummies You Bought Online Might Not Contain Any Creatine

    June 20, 2025

    How Private Equity Killed the American Dream

    June 20, 2025

    eBay and Vestiaire Collective Want an Exemption from Trump’s Tariffs

    June 18, 2025
    Our Picks

    Samsung’s Galaxy Watch 7 has returned to its lowest-ever price

    June 21, 2025

    The Verge’s guide to Amazon Prime Day 2025

    June 21, 2025

    Most Cheap Laptops Only Last a Few Years. The Framework Laptop 12 Could Last a Decade

    June 21, 2025

    Final Fantasy fans, now is the time to get into Magic: The Gathering

    June 21, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Gear

    Gear News This Week: Adobe Wants to Make iPhone Photos Better, and TCL Brings Flexibility to Atmos

    By News RoomJune 21, 2025

    The larger JBuds Party ($70) offers 30 watts of power to make it “one of…

    The Mysterious Inner Workings of Io, Jupiter’s Volcanic Moon

    June 21, 2025

    The music industry is building the tech to hunt down AI songs

    June 21, 2025

    Meta’s Oakley Smart Glasses Have 3K Video—Watch Out, Ray-Ban

    June 21, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.