Apple Publishes Details About New 'MM1' AI Model
Apple researchers have developed a new method for training large language models (LLMs) that seamlessly integrates both text and visual information.

The company's findings, detailed in a research paper titled "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training," showcase a new approach to creating more intelligent and flexible AI systems. By utilizing a diverse dataset comprising image-caption pairs, interleaved image-text documents, and text-only data, Apple's claims that the MM1 model sets a new standard in AI's ability to perform tasks such as image captioning, visual question answering, and natural language inference with a high level of accuracy.
Apple's research focuses on the combination of different types of training data and model architectures, which enables the AI to understand and generate language based on a mix of visual and linguistic cues. This capability is vital for tasks that require a nuanced comprehension of the world, such as interpreting complex images or answering questions that involve visual elements.
The paper also highlights the MM1 model's exceptional in-context learning abilities, particularly in the largest 30 billion parameter configuration of the model. This version apparently exhibits remarkable capabilities for multi-step reasoning over multiple images using few-shot "chain-of-thought" prompting, a technique that allows the AI to perform complex, open-ended problem solving based on minimal examples.
This research emerges as part of Apple's broader initiative to enhance its AI capabilities amid growing competition. Earlier today, Bloomberg's Mark Gurman reported that Apple is in discussions with Google to license Google's Gemini generative large-language models to power new features coming to the iPhone as part of iOS 18.
Popular Stories
Bloomberg's Mark Gurman has high expectations for Apple's first foldable iPhone.
In his Power On newsletter today, he said the foldable iPhone will be "the most significant overhaul in the iPhone's history."
"iPhone 4, iPhone 6 and iPhone X were clearly a big deal, but this is a whole new design," he said.
Like Samsung's Galaxy Z Fold 7, the foldable iPhone will reportedly open up like ...
iOS 26.5 is now available for developers, and while it doesn't include any new Siri capabilities, there are some major changes for the European Union, and smaller tweaks for features available worldwide.
Suggested Places
In the Maps app, there's a new "Suggested Places" feature that recommends locations to visit based on trending places nearby and recent searches. When Apple launches ads in ...
Apple has been celebrating its upcoming 50th anniversary by hosting surprise performances and other events around the world over the past few weeks, and now Bloomberg's Mark Gurman has revealed details about the company's grand finale.
In a social media post, Gurman said Apple's celebrations will conclude this week with a finale at its Apple Park headquarters for employees.
A special...
Popular Stories
Apple has quietly blocked AI "vibe coding" apps, such as Replit and Vibecode, from releasing App Store updates unless they make changes, The Information reports.
"Vibe coding" tools allow users with little to no programming experience to build apps or websites using natural language prompts. Their accessibility has driven rapid adoption among both developers and non-technical users.
Apple ...
Bloomberg's Mark Gurman has high expectations for Apple's first foldable iPhone.
In his Power On newsletter today, he said the foldable iPhone will be "the most significant overhaul in the iPhone's history."
"iPhone 4, iPhone 6 and iPhone X were clearly a big deal, but this is a whole new design," he said.
Like Samsung's Galaxy Z Fold 7, the foldable iPhone will reportedly open up like ...
iOS 26.5 is now available for developers, and while it doesn't include any new Siri capabilities, there are some major changes for the European Union, and smaller tweaks for features available worldwide.
Suggested Places
In the Maps app, there's a new "Suggested Places" feature that recommends locations to visit based on trending places nearby and recent searches. When Apple launches ads in ...