Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot
    The rugged Bose Soundlink Flex is 25 percent off right now

    The rugged Bose Soundlink Flex is 25 percent off right now

    April 7, 2026
    Satechi’s 3-in-1 travel stand now wirelessly charges your phone at 25W

    Satechi’s 3-in-1 travel stand now wirelessly charges your phone at 25W

    April 7, 2026
    The case for banning cookie banners

    The case for banning cookie banners

    April 7, 2026
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project
    Business

    These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project

    News RoomBy News RoomDecember 1, 20233 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email
    These Clues Hint at the True Nature of OpenAI’s Shadowy Q* Project

    There are other clues to what Q* could be. The name may be an allusion to Q-learning, a form of reinforcement learning that involves an algorithm learning to solve a problem through positive or negative feedback, which has been used to create game-playing bots and to tune ChatGPT to be more helpful. Some have suggested that the name may also be related to the A* search algorithm, widely used to have a program find the optimal path to a goal.

    The Information throws another clue into the mix: “Sutskever’s breakthrough allowed OpenAI to overcome limitations on obtaining enough high-quality data to train new models,” its story says. “The research involved using computer-generated [data], rather than real-world data like text or images pulled from the internet, to train new models.” That appears to be a reference to the idea of training algorithms with so-called synthetic training data, which has emerged as a way to train more powerful AI models.

    Subbarao Kambhampati, a professor at Arizona State University who is researching the reasoning limitations of LLMs, thinks that Q* may involve using huge amounts of synthetic data, combined with reinforcement learning, to train LLMs to specific tasks such as simple arithmetic. Kambhampati notes that there is no guarantee that the approach will generalize into something that can figure out how to solve any possible math problem.

    For more speculation on what Q* might be, read this post by a machine-learning scientist who pulls together the context and clues in impressive and logical detail. The TLDR version is that Q* could be an effort to use reinforcement learning and a few other techniques to improve a large language model’s ability to solve tasks by reasoning through steps along the way. Although that might make ChatGPT better at math conundrums, it’s unclear whether it would automatically suggest AI systems could evade human control.

    That OpenAI would try to use reinforcement learning to improve LLMs seems plausible because many of the company’s early projects, like video-game-playing bots, were centered on the technique. Reinforcement learning was also central to the creation of ChatGPT, because it can be used to make LLMs produce more coherent answers by asking humans to provide feedback as they converse with a chatbot. When WIRED spoke with Demis Hassabis, the CEO of Google DeepMind, earlier this year, he hinted that the company was trying to combine ideas from reinforcement learning with advances seen in large language models.

    Rounding up the available clues about Q*, it hardly sounds like a reason to panic. But then, it all depends on your personal P(doom) value—the probability you ascribe to the possibility that AI destroys humankind. Long before ChatGPT, OpenAI’s scientists and leaders were initially so freaked out by the development of GPT-2, a 2019 text generator that now seems laughably puny, that they said it could not be released publicly. Now the company offers free access to much more powerful systems.

    OpenAI refused to comment on Q*. Perhaps we will get more details when the company decides it’s time to share more results from its efforts to make ChatGPT not just good at talking but good at reasoning too.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleXreal’s Latest Augmented Reality Glasses Aren’t Worth the Upgrade
    Next Article Analogue is shipping its TurboGrafx console and restocking Pockets

    Related Posts

    What Happens When Your Coworkers Are AI Agents

    What Happens When Your Coworkers Are AI Agents

    December 9, 2025
    San Francisco Mayor Daniel Lurie: ‘We Are a City on the Rise’

    San Francisco Mayor Daniel Lurie: ‘We Are a City on the Rise’

    December 9, 2025
    An AI Dark Horse Is Rewriting the Rules of Game Design

    An AI Dark Horse Is Rewriting the Rules of Game Design

    December 9, 2025
    Watch the Highlights From WIRED’s Big Interview Event Right Here

    Watch the Highlights From WIRED’s Big Interview Event Right Here

    December 9, 2025
    Amazon Has New Frontier AI Models—and a Way for Customers to Build Their Own

    Amazon Has New Frontier AI Models—and a Way for Customers to Build Their Own

    December 4, 2025
    AWS CEO Matt Garman Wants to Reassert Amazon’s Cloud Dominance in the AI Era

    AWS CEO Matt Garman Wants to Reassert Amazon’s Cloud Dominance in the AI Era

    December 4, 2025
    Our Picks
    Satechi’s 3-in-1 travel stand now wirelessly charges your phone at 25W

    Satechi’s 3-in-1 travel stand now wirelessly charges your phone at 25W

    April 7, 2026
    The case for banning cookie banners

    The case for banning cookie banners

    April 7, 2026
    Asus’ lightweight 16-inch laptop is a formidable MacBook Air alternative

    Asus’ lightweight 16-inch laptop is a formidable MacBook Air alternative

    April 7, 2026
    Apple’s sci-fi thriller Dark Matter is back in August

    Apple’s sci-fi thriller Dark Matter is back in August

    April 7, 2026
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Sorry kid, drones are for war now News

    Sorry kid, drones are for war now

    By News RoomApril 7, 2026

    What happens when DJI, the world’s leading maker of drones, is no longer welcome in…

    A wild, wide foldable iPhone dummy emerges amid rumors of a delay

    A wild, wide foldable iPhone dummy emerges amid rumors of a delay

    April 7, 2026
    DJI’s Mic Mini records clear audio on the go, and it’s on sale for

    DJI’s Mic Mini records clear audio on the go, and it’s on sale for $60

    April 6, 2026
    Cisco CEO Chuck Robbins’ plan for the AI era

    Cisco CEO Chuck Robbins’ plan for the AI era

    April 6, 2026
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2026 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.