Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    KitchenAid Promo Codes and Coupons for May 2025

    May 13, 2025

    Philips will let you fix your trimmer with 3D printable parts and accessories

    May 12, 2025

    Fallout’s second season premieres in December and will be followed by a third

    May 12, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills
    Business

    OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills

    News RoomBy News RoomDecember 23, 20242 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    OpenAI today announced an improved version of its most capable artificial intelligence model to date—one that takes even more time to deliberate over questions—just a day after Google announced its first model of this type.

    OpenAI’s new model, called o3, replaces o1, which the company introduced in September. Like o1, the new model spends time ruminating over a problem in order to deliver better answers to questions that require step-by-step logical reasoning. (OpenAI chose to skip the “o2” moniker because it’s already the name of a mobile carrier in the UK.)

    “We view this as the beginning of the next phase of AI,” said OpenAI CEO Sam Altman on a livestream Friday. “Where you can use these models to do increasingly complex tasks that require a lot of reasoning.”

    The o3 model scores much higher on several measures than its predecessor, OpenAI says, including ones that measure complex coding-related skills and advanced math and science competency. It is three times better than o1 at answering questions posed by ARC-AGI, a benchmark designed to test an AI models’ ability to reason over extremely difficult mathematical and logic problems they’re encountering for the first time.

    Google is pursuing a similar line of research. Noam Shazeer, a Google researcher, yesterday revealed in a post on X that the company has developed its own reasoning model, called Gemini 2.0 Flash Thinking. Google’s CEO, Sundar Pichai, called it “our most thoughtful model yet” in his own post. Google’s new model achieved a high score on SWE-Bench, a test that measures a models’ agentic abilities.

    However, OpenAI’s new o3 model is 20 percent better than o1. “o3 blew it out of the water,” says Ofir Press, a post-doctoral researcher at Princeton University who helped develop SWE-Bench. “Very surprising increase, not sure how they did it.”

    The two dueling models show competition between OpenAI and Google to be fiercer than ever. It is crucial for OpenAI to demonstrate that it can keep making advances as it seeks to attract more investment and build a profitable business. Google is meanwhile desperate to show that it remains at the forefront of AI research.

    The new models also show how AI companies are increasingly looking beyond simply scaling up AI models in order to wring greater intelligence out of them.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleAll of Canoo’s employees are reportedly on a ‘mandatory unpaid break’
    Next Article Congress Again Fails to Limit Scope of Spy Powers in New Defense Bill

    Related Posts

    My X Account Was Hijacked to Sell a Fake WIRED Memecoin. Then Came the Backlash

    May 12, 2025

    Buy Now or Pay More Later? ‘Macroeconomic Uncertainty’ Has Shoppers Anxious

    May 12, 2025

    Donald Trump’s UK Trade Deal Could Secure Jaguar’s Resurrection

    May 9, 2025

    Singapore’s Vision for AI Safety Bridges the US-China Divide

    May 9, 2025

    A ‘Trump Card Visa’ Is Already Showing Up in Immigration Forms

    May 8, 2025

    OpenAI and the FDA Are Holding Talks About Using AI In Drug Evaluation

    May 8, 2025
    Our Picks

    Philips will let you fix your trimmer with 3D printable parts and accessories

    May 12, 2025

    Fallout’s second season premieres in December and will be followed by a third

    May 12, 2025

    Samsung’s S25 Edge is a thin phone with two big tradeoffs

    May 12, 2025

    Trump administration announces ‘illegal’ rollback of energy and water efficiency standards

    May 12, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    News

    Duolingo is replacing hearts with energy

    By News RoomMay 12, 2025

    Duolingo is making a big change: it’s moving on from hearts in favor of a…

    Apple could use AI to help your iPhone save battery

    May 12, 2025

    My X Account Was Hijacked to Sell a Fake WIRED Memecoin. Then Came the Backlash

    May 12, 2025

    Peacock’s NBA coverage will add an overlay with live shot stats

    May 12, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.