Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    No, You Probably Don’t Need a MacBook Pro

    July 1, 2025

    Sharp pencils for hard times

    July 1, 2025

    How Do Pimple Patches Work? Here’s Everything You Need to Know

    July 1, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills
    Business

    OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills

    News RoomBy News RoomDecember 23, 20242 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    OpenAI today announced an improved version of its most capable artificial intelligence model to date—one that takes even more time to deliberate over questions—just a day after Google announced its first model of this type.

    OpenAI’s new model, called o3, replaces o1, which the company introduced in September. Like o1, the new model spends time ruminating over a problem in order to deliver better answers to questions that require step-by-step logical reasoning. (OpenAI chose to skip the “o2” moniker because it’s already the name of a mobile carrier in the UK.)

    “We view this as the beginning of the next phase of AI,” said OpenAI CEO Sam Altman on a livestream Friday. “Where you can use these models to do increasingly complex tasks that require a lot of reasoning.”

    The o3 model scores much higher on several measures than its predecessor, OpenAI says, including ones that measure complex coding-related skills and advanced math and science competency. It is three times better than o1 at answering questions posed by ARC-AGI, a benchmark designed to test an AI models’ ability to reason over extremely difficult mathematical and logic problems they’re encountering for the first time.

    Google is pursuing a similar line of research. Noam Shazeer, a Google researcher, yesterday revealed in a post on X that the company has developed its own reasoning model, called Gemini 2.0 Flash Thinking. Google’s CEO, Sundar Pichai, called it “our most thoughtful model yet” in his own post. Google’s new model achieved a high score on SWE-Bench, a test that measures a models’ agentic abilities.

    However, OpenAI’s new o3 model is 20 percent better than o1. “o3 blew it out of the water,” says Ofir Press, a post-doctoral researcher at Princeton University who helped develop SWE-Bench. “Very surprising increase, not sure how they did it.”

    The two dueling models show competition between OpenAI and Google to be fiercer than ever. It is crucial for OpenAI to demonstrate that it can keep making advances as it seeks to attract more investment and build a profitable business. Google is meanwhile desperate to show that it remains at the forefront of AI research.

    The new models also show how AI companies are increasingly looking beyond simply scaling up AI models in order to wring greater intelligence out of them.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleAll of Canoo’s employees are reportedly on a ‘mandatory unpaid break’
    Next Article Congress Again Fails to Limit Scope of Spy Powers in New Defense Bill

    Related Posts

    Here Is Everyone Mark Zuckerberg Has Hired So Far for Meta’s ‘Superintelligence’ Team

    July 1, 2025

    Microsoft Says Its New AI System Diagnosed Patients 4 Times More Accurately Than Human Doctors

    July 1, 2025

    OpenAI Leadership Responds to Meta Offers: ‘Someone Has Broken Into Our Home’

    June 30, 2025

    OpenAI Loses 4 Key Researchers to Meta

    June 30, 2025

    OpenAI’s Unreleased AGI Paper Could Complicate Microsoft Negotiations

    June 30, 2025

    Substack Is Having a Moment—Again. But Time Is Running Out

    June 29, 2025
    Our Picks

    Sharp pencils for hard times

    July 1, 2025

    How Do Pimple Patches Work? Here’s Everything You Need to Know

    July 1, 2025

    Newark’s air traffic outages were just the tip of the iceberg

    July 1, 2025

    Here Is Everyone Mark Zuckerberg Has Hired So Far for Meta’s ‘Superintelligence’ Team

    July 1, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Games

    These are 10 Best Nintendo Switch 2 Accessories We’ve Tried

    By News RoomJuly 1, 2025

    The Switch 2 isn’t like other consoles. If you were meant to take your Xbox…

    Google kills its Keep app on Apple Watch

    July 1, 2025

    How to Use Markdown

    July 1, 2025

    The Lenovo Chromebook Plus 14 Is the Best Chromebook Ever Made

    July 1, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.