Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot
    Android’s Find Hub adds iPhone-like luggage tracking links

    Android’s Find Hub adds iPhone-like luggage tracking links

    March 3, 2026
    Google’s latest Pixel drop allows Gemini to order groceries for you and more

    Google’s latest Pixel drop allows Gemini to order groceries for you and more

    March 3, 2026
    Another Oracle outage is messing up US TikTok

    Another Oracle outage is messing up US TikTok

    March 3, 2026
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » OpenAI Finally Launched GPT-5. Here’s Everything You Need to Know
    Business

    OpenAI Finally Launched GPT-5. Here’s Everything You Need to Know

    News RoomBy News RoomAugust 8, 20253 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email
    OpenAI Finally Launched GPT-5. Here’s Everything You Need to Know

    OpenAI’s blog post claims that GPT-5 beats its previous models on several coding benchmarks, including SWE-Bench Verified (scoring 74.9 percent), SWE-Lancer (GPT-5-thinking scored 55 percent), and Aider Polyglot (scored 88 percent), which test the model’s ability to fix bugs, complete freelance-style coding tasks, and work across multiple programming languages.

    During the press briefing on Wednesday, OpenAI post-training lead Yann Dubois prompted GPT-5 to “create a beautiful, highly interactive web app for my partner, an English speaker, to learn French.” He tasked the AI to include features like daily progress, a variety of activities like flashcards and quizzes, and noted that he wanted the app wrapped up in a “highly engaging theme.” After a minute or so, the AI-generated app popped up. While it was just one on-rails demo, the result was a sleek site that delivered exactly what Dubois asked for.

    “It’s a great coding collaborator, and also excels at agentic tasks,” Michelle Pokrass, a post-training lead, says. “It executes long chains and tool calls effectively [which means it better understands when and how to use functions like web browsers or external APIs], follows detailed instructions, and provides upfront explanations of its actions.”

    OpenAI also says in its blog post that GPT-5 is “our best model yet for health-related questions.” In three OpenAI health-related LLM benchmarks—HealthBench, HealthBench Hard, and HealthBench Consensus—the system card (a document that describes the product’s technical capabilities and other research findings) states that GPT-5-thinking outperforms previous models “by a substantial margin.” The thinking version of GPT-5 scored 25.5 percent on HealthBench Hard, up from o3’s 31.6 percent score. These scores are validated by two or more physicians, according to the system card.

    The model also allegedly hallucinates less, according to Pokrass, a common issue for AI where it provides false information. OpenAI’s safety research lead Alex Beutel adds that they’ve “significantly decreased the rates of deception in GPT-5.”

    “We’ve taken steps to reduce GPT-5-thinking’s propensity to deceive, cheat, or hack problems, though our mitigations are not perfect and more research is needed,” the system card says. “In particular, we’ve trained the model to fail gracefully when posed with tasks that it cannot solve.”

    The company’s system card says that after testing GPT-5 models without access to web browsing, researchers found its hallucination rate (which they defined as “percentage of factual claims that contain minor or major errors”) 26 percent less common than the GPT-4o model. GPT-5-thinking has a 65 percent reduced hallucination rate compared to o3.

    For prompts that could be dual-use (potentially harmful or benign), Beutel says GPT-5 uses “safe completions,” which prompts the model to “give as helpful an answer as possible, but within the constraints of remaining safe.” OpenAI did over 5,000 hours of red teaming, according to Beutel, and testing with external organizations to make sure the system was robust.

    OpenAI says it now boasts nearly 700 million weekly active users of ChatGPT, 5 million paying business users, and 4 million developers utilizing the API.

    “The vibes of this model are really good, and I think that people are really going to feel that,” head of ChatGPT Nick Turley says. “Especially average people who haven’t been spending their time thinking about models.”

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleHoto’s SnapBloq Power Tools Are Inviting, Even If They’re Not the Best
    Next Article A decade later, Windows is still bringing Control Panel features to the Settings app

    Related Posts

    What Happens When Your Coworkers Are AI Agents

    What Happens When Your Coworkers Are AI Agents

    December 9, 2025
    San Francisco Mayor Daniel Lurie: ‘We Are a City on the Rise’

    San Francisco Mayor Daniel Lurie: ‘We Are a City on the Rise’

    December 9, 2025
    An AI Dark Horse Is Rewriting the Rules of Game Design

    An AI Dark Horse Is Rewriting the Rules of Game Design

    December 9, 2025
    Watch the Highlights From WIRED’s Big Interview Event Right Here

    Watch the Highlights From WIRED’s Big Interview Event Right Here

    December 9, 2025
    Amazon Has New Frontier AI Models—and a Way for Customers to Build Their Own

    Amazon Has New Frontier AI Models—and a Way for Customers to Build Their Own

    December 4, 2025
    AWS CEO Matt Garman Wants to Reassert Amazon’s Cloud Dominance in the AI Era

    AWS CEO Matt Garman Wants to Reassert Amazon’s Cloud Dominance in the AI Era

    December 4, 2025
    Our Picks
    Google’s latest Pixel drop allows Gemini to order groceries for you and more

    Google’s latest Pixel drop allows Gemini to order groceries for you and more

    March 3, 2026
    Another Oracle outage is messing up US TikTok

    Another Oracle outage is messing up US TikTok

    March 3, 2026
    Shark’s latest robot vacuum hunts stains with UV light

    Shark’s latest robot vacuum hunts stains with UV light

    March 3, 2026
    Google brings Android’s desktop mode to Pixel devices

    Google brings Android’s desktop mode to Pixel devices

    March 3, 2026
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    The Pixel Watch now lets you tap to pay without opening the Wallet app News

    The Pixel Watch now lets you tap to pay without opening the Wallet app

    By News RoomMarch 3, 2026

    Google has a big update in store for the Pixel Watch, which will now allow…

    New MacBooks, the iPhone 17E, and more: Everything we know about Apple’s March 2026 announcements

    New MacBooks, the iPhone 17E, and more: Everything we know about Apple’s March 2026 announcements

    March 3, 2026
    Microsoft’s big developer conference returns to San Francisco in June

    Microsoft’s big developer conference returns to San Francisco in June

    March 3, 2026
    Apple’s website leaks MacBook ‘Neo,’ which could be its new cheaper laptop

    Apple’s website leaks MacBook ‘Neo,’ which could be its new cheaper laptop

    March 3, 2026
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2026 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.