Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    The Steve Jobs Archive shares stories, videos, and notes of his famous commencement speech

    June 12, 2025

    It could be 2026 before all your Thread border routers work together

    June 12, 2025

    Trump’s protest threats raise surveillance alarms around his military parade

    June 12, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills
    Business

    OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills

    News RoomBy News RoomDecember 23, 20242 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    OpenAI today announced an improved version of its most capable artificial intelligence model to date—one that takes even more time to deliberate over questions—just a day after Google announced its first model of this type.

    OpenAI’s new model, called o3, replaces o1, which the company introduced in September. Like o1, the new model spends time ruminating over a problem in order to deliver better answers to questions that require step-by-step logical reasoning. (OpenAI chose to skip the “o2” moniker because it’s already the name of a mobile carrier in the UK.)

    “We view this as the beginning of the next phase of AI,” said OpenAI CEO Sam Altman on a livestream Friday. “Where you can use these models to do increasingly complex tasks that require a lot of reasoning.”

    The o3 model scores much higher on several measures than its predecessor, OpenAI says, including ones that measure complex coding-related skills and advanced math and science competency. It is three times better than o1 at answering questions posed by ARC-AGI, a benchmark designed to test an AI models’ ability to reason over extremely difficult mathematical and logic problems they’re encountering for the first time.

    Google is pursuing a similar line of research. Noam Shazeer, a Google researcher, yesterday revealed in a post on X that the company has developed its own reasoning model, called Gemini 2.0 Flash Thinking. Google’s CEO, Sundar Pichai, called it “our most thoughtful model yet” in his own post. Google’s new model achieved a high score on SWE-Bench, a test that measures a models’ agentic abilities.

    However, OpenAI’s new o3 model is 20 percent better than o1. “o3 blew it out of the water,” says Ofir Press, a post-doctoral researcher at Princeton University who helped develop SWE-Bench. “Very surprising increase, not sure how they did it.”

    The two dueling models show competition between OpenAI and Google to be fiercer than ever. It is crucial for OpenAI to demonstrate that it can keep making advances as it seeks to attract more investment and build a profitable business. Google is meanwhile desperate to show that it remains at the forefront of AI research.

    The new models also show how AI companies are increasingly looking beyond simply scaling up AI models in order to wring greater intelligence out of them.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleAll of Canoo’s employees are reportedly on a ‘mandatory unpaid break’
    Next Article Congress Again Fails to Limit Scope of Spy Powers in New Defense Bill

    Related Posts

    Cheap AI Tools May Come at a Bigger Long-Term Cost

    June 12, 2025

    Vibe Coding Is Coming for Engineering Jobs

    June 12, 2025

    How Steve Jobs Wrote the Greatest Commencement Speech Ever

    June 12, 2025

    How Waymo Handles Footage From Events Like the LA Immigration Protests

    June 12, 2025

    Disney and Universal Sue AI Company Midjourney for Copyright Infringement

    June 12, 2025

    A Google Shareholder Is Suing the Company Over the TikTok Ban

    June 11, 2025
    Our Picks

    It could be 2026 before all your Thread border routers work together

    June 12, 2025

    Trump’s protest threats raise surveillance alarms around his military parade

    June 12, 2025

    Cheap AI Tools May Come at a Bigger Long-Term Cost

    June 12, 2025

    This is what it looks like to be colorblind

    June 12, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    News

    Bose upgraded the adaptive ANC on its new QuietComfort Ultra earbuds

    By News RoomJune 12, 2025

    Bose has announced upgraded QuietComfort Ultra wireless earbuds with better adaptive ANC that can more…

    A massive internet outage is messing up Google Home, Spotify, and other services

    June 12, 2025

    Vibe Coding Is Coming for Engineering Jobs

    June 12, 2025

    Garmin’s Venu X1 smartwatch has its biggest display and smallest case yet

    June 12, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.