Close Menu
Technology Mag

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    This week’s best deal is a ‘kids’ Kindle Paperwhite that’s better than the adult version

    October 4, 2025

    Breaking up (Google) is hard to do

    October 4, 2025

    Ecovacs’ Deebot X8 and X9 Pro Omni robovacs have hit a new low price

    October 4, 2025
    Facebook X (Twitter) Instagram
    Subscribe
    Technology Mag
    Facebook X (Twitter) Instagram YouTube
    • Home
    • News
    • Business
    • Games
    • Gear
    • Reviews
    • Science
    • Security
    • Trending
    • Press Release
    Technology Mag
    Home » Apple’s MM1 AI Model Shows a Sleeping Giant Is Waking Up
    Business

    Apple’s MM1 AI Model Shows a Sleeping Giant Is Waking Up

    News RoomBy News RoomMarch 20, 20244 Mins Read
    Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email

    While the tech industry went gaga for generative artificial intelligence, one giant has held back: Apple. The company has yet to introduce so much as an AI-generated emoji, and according to a New York Times report today and earlier reporting from Bloomberg, it is in preliminary talks with Google about adding the search company’s Gemini AI model to iPhones.

    Yet a research paper quietly posted online last Friday by Apple engineers suggests that the company is making significant new investments into AI that are already bearing fruit. It details the development of a new generative AI model called MM1 capable of working with text and images. The researchers show it answering questions about photos and displaying the kind of general knowledge skills shown by chatbots like ChatGPT. The model’s name is not explained but could stand for MultiModal 1.

    MM1 appears to be similar in design and sophistication to a variety of recent AI models from other tech giants, including Meta’s open source Llama 2 and Google’s Gemini. Work by Apple’s rivals and academics shows that models of this type can be used to power capable chatbots or build “agents” that can solve tasks by writing code and taking actions such as using computer interfaces or websites. That suggests MM1 could yet find its way into Apple’s products.

    “The fact that they’re doing this, it shows they have the ability to understand how to train and how to build these models,” says Ruslan Salakhutdinov, a professor at Carnegie Mellon who led AI research at Apple several years ago. “It requires a certain amount of expertise.”

    MM1 is a multimodal large language model, or MLLM, meaning it is trained on images as well as text. This allows the model to respond to text prompts and also answer complex questions about particular images.

    One example in the Apple research paper shows what happened when MM1 was provided with a photo of a sun-dappled restaurant table with a couple of beer bottles and also an image of the menu. When asked how much someone would expect to pay for “all the beer on the table,” the model correctly reads off the correct price and tallies up the cost.

    “This is just the beginning. The team is already hard at work on the next generation of models.”

    Brandon McKinzie, Apple researcher

    When ChatGPT launched in November 2022, it could only ingest and generate text, but more recently its creator OpenAI and others have worked to expand the underlying large language model technology to work with other kinds of data. When Google launched Gemini (the model that now powers its answer to ChatGPT) last December, the company touted its multimodal nature as beginning an important new direction in AI. “After the rise of LLMs, MLLMs are emerging as the next frontier in foundation models,” Apple’s paper says.

    MM1 is a relatively small model as measured by its number of “parameters,” or the internal variables that get adjusted as a model is trained. Kate Saenko, a professor at Boston University who specializes in computer vision and machine learning, says this could make it easier for Apple’s engineers to experiment with different training methods and refinements before scaling up when they hit on something promising.

    Saenko says the MM1 paper provides a surprising amount of detail on how the model was trained for a corporate publication. For instance, the engineers behind MM1 describe tricks for improving the performance of the model including increasing the resolution of images and mixing text and image data. Apple is famed for its secrecy, but it has previously shown unusual openness about AI research as it has sought to lure the talent needed to compete in the crucial technology.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
    Previous ArticleThe Global Danger of Boring Buildings
    Next Article Apex Legends devs confirm esports hacking incident, respond with ‘layered’ updates

    Related Posts

    China Rolls Out Its First Talent Visa as the US Retreats on H-1Bs

    October 3, 2025

    OpenAI’s New Sora App Lets You Deepfake Yourself for Entertainment

    October 3, 2025

    This AI-Powered Robot Keeps Going Even if You Attack It With a Chainsaw

    October 3, 2025

    Chatbots Play With Your Emotions to Avoid Saying Goodbye

    October 3, 2025

    Exclusive: Mira Murati’s Stealth AI Lab Launches Its First Product

    October 2, 2025

    Why One VC Thinks Quantum Is a Bigger Unlock Than AGI

    October 1, 2025
    Our Picks

    Breaking up (Google) is hard to do

    October 4, 2025

    Ecovacs’ Deebot X8 and X9 Pro Omni robovacs have hit a new low price

    October 4, 2025

    Instagram wants me to make content — I just want to post a photo

    October 4, 2025

    Acer’s latest Spin 514 is so close to Chromebook greatness

    October 4, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Science

    Coke Designed a Plastic Bottle to Sell the World More Soda

    By News RoomOctober 4, 2025

    Ivester was taking a gamble. Coca-Cola was ploughing tens of millions of dollars into modifying…

    Where Do Your Passwords Go When You Die?

    October 4, 2025

    Discord customer service data breach leaks user info and scanned photo IDs

    October 3, 2025

    Oregon’s National Guard lawsuit hinges on Trump’s Truth Social posts

    October 3, 2025
    Facebook X (Twitter) Instagram Pinterest
    • Privacy Policy
    • Terms of use
    • Advertise
    • Contact
    © 2025 Technology Mag. All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.