Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    ‘Hades II’ Is Coming to Nintendo Switch This Month

    September 15, 2025

    Google thinks it can have AI summaries and a healthy web, too

    September 15, 2025

    A New Platform Offers Privacy Tools to Millions of Public Servants

    September 15, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » OpenAI teases new reasoning model—but don’t expect to try it soon
    News

    OpenAI teases new reasoning model—but don’t expect to try it soon

    News RoomBy News RoomDecember 20, 20242 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    For the last day of ship-mas, OpenAI previewed a new set of frontier “reasoning” models dubbed o3 and o3-mini. The Verge first reported that a new reasoning model would be coming during this event.

    The company isn’t releasing these models today (and admits final results may evolve with more post-training). However, OpenAI is accepting applications from the research community to test these systems ahead of public release (which it has yet to set a date for). OpenAI launched o1 (codenamed Strawberry) in September and is jumping straight to o3, skipping o2 to avoid confusion (or trademark conflicts) with the British telecom company called O2.

    The term reasoning has become a common buzzword in the AI industry lately, but it basically means the machine breaks down instructions into smaller tasks that can produce stronger outcomes. These models often show the work for how it got to an answer, rather than just giving a final answer without explanation.

    According to the company, o3 surpasses previous performance records across the board. It beats its predecessor in coding tests (called SWE-Bench Verified) by 22.8 percent and outscores OpenAI’s Chief Scientist in competitive programming. The model nearly aced one of the hardest math competitions (called AIME 2024), missing one question, and achieved 87.7 percent on a benchmark for expert-level science problems (called GPQA Diamond). On the toughest math and reasoning challenges that usually stump AI, o3 solved 25.2 percent of problems (where no other model exceeds 2 percent).

    OpenAI claims o3 performs better than its other reasoning models in coding benchmarks.
    OpenAI

    The company also announced new research on deliberative alignment, which requires the AI model to process safety decisions step-by-step. So, instead of just giving yes/no rules to the AI model, this paradigm requires it to actively reason about whether a user’s request fits OpenAI’s safety policies. The company claims that when it tested this on o1, it was much better at following safety guidelines than previous models, including GPT-4.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleWe rounded up 40 last-minute gifts you can still grab in time for the holidays
    Next Article Josh King’s viral slide-out MagSafe gamepad found a home at OhSnap and looks amazing

    Related Posts

    Google thinks it can have AI summaries and a healthy web, too

    September 15, 2025

    Microsoft’s Office apps now have free Copilot Chat features

    September 15, 2025

    Elon Musk responds to Tesla pay proposal by buying $1 billion worth of stock

    September 15, 2025

    Samsung’s 2TB 990 Evo Plus SSD is $100 for a limited time

    September 15, 2025

    The US and China might finally have a TikTok deal

    September 15, 2025

    Amazon announces fall hardware event

    September 15, 2025
    Our Picks

    Google thinks it can have AI summaries and a healthy web, too

    September 15, 2025

    A New Platform Offers Privacy Tools to Millions of Public Servants

    September 15, 2025

    How China’s Propaganda and Surveillance Systems Really Operate

    September 15, 2025

    I’ve been using macOS Tahoe 26 since June and here are the eight best things about it

    September 15, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    News

    Microsoft’s Office apps now have free Copilot Chat features

    By News RoomSeptember 15, 2025

    Microsoft is adding the free Microsoft 365 Copilot Chat and agents to Office apps for…

    I Wasn’t Sure I Wanted Anthropic to Pay Me for My Books—I Do Now

    September 15, 2025

    Elon Musk responds to Tesla pay proposal by buying $1 billion worth of stock

    September 15, 2025

    Samsung’s 2TB 990 Evo Plus SSD is $100 for a limited time

    September 15, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.