Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot
    The OnePlus 15 will go on sale in the US… sometime

    The OnePlus 15 will go on sale in the US… sometime

    November 13, 2025
    Apple’s Godzilla show Monarch is back in February

    Apple’s Godzilla show Monarch is back in February

    November 13, 2025
    Valve just built the Xbox that Microsoft is dreaming of

    Valve just built the Xbox that Microsoft is dreaming of

    November 13, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » Who’s to Blame When AI Agents Screw Up?
    Business

    Who’s to Blame When AI Agents Screw Up?

    News RoomBy News RoomMay 22, 20254 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email
    Who’s to Blame When AI Agents Screw Up?

    Over the past year, veteran software engineer Jay Prakash Thakur has spent his nights and weekends prototyping AI agents that could, in the near future, order meals and engineer mobile apps almost entirely on their own. His agents, while surprisingly capable, have also exposed new legal questions that await companies trying to capitalize on Silicon Valley’s hottest new technology.

    Agents are AI programs that can act mostly independently, allowing companies to automate tasks such as answering customer questions or paying invoices. While ChatGPT and similar chatbots can draft emails or analyze bills upon request, Microsoft and other tech giants expect that agents will tackle more complex functions—and most importantly, do it with little human oversight.

    The tech industry’s most ambitious plans involve multi-agent systems, with dozens of agents someday teaming up to replace entire workforces. For companies, the benefit is clear: saving on time and labor costs. Already, demand for the technology is rising. Tech market researcher Gartner estimates that agentic AI will resolve 80 percent of common customer service queries by 2029. Fiverr, a service where businesses can book freelance coders, reports that searches for “ai agent” have surged 18,347 percent in recent months.

    Thakur, a mostly self-taught coder living in California, wanted to be at the forefront of the emerging field. His day job at Microsoft isn’t related to agents, but he has been tinkering with AutoGen, Microsoft’s open source software for building agents, since he worked at Amazon back in 2024. Thakur says he has developed multi-agent prototypes using AutoGen with just a dash of programming. Last week, Amazon rolled out a similar agent development tool called Strands; Google offers what it calls an Agent Development Kit.

    Because agents are meant to act autonomously, the question of who bears responsibility when their errors cause financial damage has been Thakur’s biggest concern. Assigning blame when agents from different companies miscommunicate within a single, large system could become contentious, he believes. He compared the challenge of reviewing error logs from various agents to reconstructing a conversation based on different people’s notes. “It’s often impossible to pinpoint responsibility,” Thakur says.

    Joseph Fireman, senior legal counsel at OpenAI, said on stage at a recent legal conference hosted by the Media Law Resource Center in San Francisco that aggrieved parties tend to go after those with the deepest pockets. That means companies like his will need to be prepared to take some responsibility when agents cause harm—even when a kid messing around with an agent might be to blame. (If that person were at fault, they likely wouldn’t be a worthwhile target moneywise, the thinking goes). “I don’t think anybody is hoping to get through to the consumer sitting in their mom’s basement on the computer,” Fireman said. The insurance industry has begun rolling out coverage for AI chatbot issues to help companies cover the costs of mishaps.

    Onion Rings

    Thakur’s experiments have involved him stringing together agents in systems that require as little human intervention as possible. One project he pursued was replacing fellow software developers with two agents. One was trained to search for specialized tools needed for making apps, and the other summarized their usage policies. In the future, a third agent could use the identified tools and follow the summarized policies to develop an entirely new app, Thakur says.

    When Thakur put his prototype to the test, a search agent found a tool that, according to the website, “supports unlimited requests per minute for enterprise users” (meaning high-paying clients can rely on it as much as they want). But in trying to distill the key information, the summarization agent dropped the crucial qualification of “per minute for enterprise users.” It erroneously told the coding agent, which did not qualify as an enterprise user, that it could write a program that made unlimited requests to the outside service. Because this was a test, there was no harm done. If it had happened in real life, the truncated guidance could have led to the entire system unexpectedly breaking down.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleGoogle’s AI product names are confusing as hell
    Next Article The AirPods Max with USB-C are a great Memorial Day deal at $70 off 

    Related Posts

    Meet the Chinese Startup Using AI—and a Team of Human Workers—to Train Robots

    Meet the Chinese Startup Using AI—and a Team of Human Workers—to Train Robots

    November 13, 2025
    OpenAI Signs  Billion Deal With Amazon

    OpenAI Signs $38 Billion Deal With Amazon

    November 12, 2025
    TikTok Shop Is Now the Size of eBay

    TikTok Shop Is Now the Size of eBay

    November 10, 2025
    WIRED Roundup: Alpha School, Grokipedia, and Real Estate AI Videos

    WIRED Roundup: Alpha School, Grokipedia, and Real Estate AI Videos

    November 6, 2025
    WIRED Roundup: AI Psychosis, Missing FTC Files, and Google Bedbugs

    WIRED Roundup: AI Psychosis, Missing FTC Files, and Google Bedbugs

    November 6, 2025
    AI Agents Are Terrible Freelance Workers

    AI Agents Are Terrible Freelance Workers

    November 5, 2025
    Our Picks
    Apple’s Godzilla show Monarch is back in February

    Apple’s Godzilla show Monarch is back in February

    November 13, 2025
    Valve just built the Xbox that Microsoft is dreaming of

    Valve just built the Xbox that Microsoft is dreaming of

    November 13, 2025
    Meet the Chinese Startup Using AI—and a Team of Human Workers—to Train Robots

    Meet the Chinese Startup Using AI—and a Team of Human Workers—to Train Robots

    November 13, 2025
    Google will let ‘experienced users’ keep sideloading Android apps

    Google will let ‘experienced users’ keep sideloading Android apps

    November 13, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Samsung’s trifold gets a name and ‘confirmed’ specs News

    Samsung’s trifold gets a name and ‘confirmed’ specs

    By News RoomNovember 13, 2025

    Reputable gadget leaker Evan Blass has taken to X with “some confirmed details” on Samsung’s…

    Valve has no news about Steam Deck 2

    Valve has no news about Steam Deck 2

    November 12, 2025
    How the Steam Frame compares to other VR headsets

    How the Steam Frame compares to other VR headsets

    November 12, 2025
    We tried Valve’s new VR headset, PC, and controller — ask us anything!

    We tried Valve’s new VR headset, PC, and controller — ask us anything!

    November 12, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.