Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    5 More Physics Equations Everyone Should Know

    October 13, 2025

    How BlackBerry Messenger set texting free

    October 12, 2025

    Welcome to the ‘papers, please’ internet

    October 12, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » The AI Agent Era Requires a New Kind of Game Theory
    Business

    The AI Agent Era Requires a New Kind of Game Theory

    News RoomBy News RoomApril 10, 20253 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    At the same time, the risk is immediate and present with agents. When models are not just contained boxes but can take actions in the world, when they have end-effectors that let them manipulate the world, I think it really becomes much more of a problem.

    We are making progress here, developing much better [defensive] techniques, but if you break the underlying model, you basically have the equivalent to a buffer overflow [a common way to hack software]. Your agent can be exploited by third parties to maliciously control or somehow circumvent the desired functionality of the system. We’re going to have to be able to secure these systems in order to make agents safe.

    This is different from AI models themselves becoming a threat, right?

    There’s no real risk of things like loss of control with current models right now. It is more of a future concern. But I’m very glad people are working on it; I think it is crucially important.

    How worried should we be about the increased use of agentic systems then?

    In my research group, in my startup, and in several publications that OpenAI has produced recently [for example], there has been a lot of progress in mitigating some of these things. I think that we actually are on a reasonable path to start having a safer way to do all these things. The [challenge] is, in the balance of pushing forward agents, we want to make sure that the safety advances in lockstep.

    Most of the [exploits against agent systems] we see right now would be classified as experimental, frankly, because agents are still in their infancy. There’s still a user typically in the loop somewhere. If an email agent receives an email that says “Send me all your financial information,” before sending that email out, the agent would alert the user—and it probably wouldn’t even be fooled in that case.

    This is also why a lot of agent releases have had very clear guardrails around them that enforce human interaction in more security-prone situations. Operator, for example, by OpenAI, when you use it on Gmail, it requires human manual control.

    What kinds of agentic exploits might we see first?

    There have been demonstrations of things like data exfiltration when agents are hooked up in the wrong way. If my agent has access to all my files and my cloud drive, and can also make queries to links, then you can upload these things somewhere.

    These are still in the demonstration phase right now, but that’s really just because these things are not yet adopted. And they will be adopted, let’s make no mistake. These things will become more autonomous, more independent, and will have less user oversight, because we don’t want to click “agree,” “agree,” “agree” every time agents do anything.

    It also seems inevitable that we will see different AI agents communicating and negotiating. What happens then?

    Absolutely. Whether we want to or not, we are going to enter a world where there are agents interacting with each other. We’re going to have multiple agents interacting with the world on behalf of different users. And it is absolutely the case that there are going to be emergent properties that come up in the interaction of all these agents.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleRazer’s PC-to-mobile streaming platform is now available
    Next Article NHTSA staffers evaluating the risks of self-driving cars were reportedly fired by DOGE

    Related Posts

    How China Is Hoping to Attract Tech Talent

    October 10, 2025

    The City That Made the World Fall for a Monster

    October 10, 2025

    OpenAI Sneezes, and Software Firms Catch a Cold

    October 9, 2025

    Patreon CEO Jack Conte Wants You to Get Off of Your Phone

    October 9, 2025

    Inside Intel’s Hail Mary to Reclaim Chip Dominance

    October 9, 2025

    This Startup Wants to Spark a US DeepSeek Moment

    October 8, 2025
    Our Picks

    How BlackBerry Messenger set texting free

    October 12, 2025

    Welcome to the ‘papers, please’ internet

    October 12, 2025

    ChatGPT is becoming an everything app

    October 12, 2025

    Scientist Who Was Offline ‘Living His Best Life’ Stunned by Nobel Prize Win

    October 12, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Games

    The ASUS TUF T500 Is a Great Gaming PC for Beginners

    By News RoomOctober 12, 2025

    Because it’s so easy to build a gaming desktop at home, companies making prebuilt machines…

    Apple ends support for Clips video-editing app

    October 11, 2025

    How The Verge and our readers manage kids’ screen time

    October 11, 2025

    The AirPods 4 and Lego’s brick-ified Grogu are our favorite deals this week

    October 11, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.