Listen "DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world"
Episode Synopsis
Google DeepMind's Demis Hassabis and his team have a bold mission: penetrating the 4D chess game that's AI embracing our ever-changing biological, physical world.Taking a snapshot is one thing. Remembering the molecular topology and their constant changes of state is truly what separates fact from fiction.It seemed like an impossible target to hit. Until M3-Agent, the work of researchers associated with ByteDance at Shanghai Jiao Tong University, showed up with long-term multimodal memory - allowing the agent to see, hear, remember, and reason just like humans.M3-Agent's potential is groundbreaking.Here are just three use cases that will blow all our minds:Autonomous robotics: Robots in homes or warehouses remember object locations, user habits, and past errors, adapting tasks dynamically, such as a caregiver bot recalling a patient's routines for personalized aidEnhanced surveillance: Security systems analyse live video/audio feeds, building memory of normal patterns to detect anomalies, predict threats, and reason through scenarios, like identifying intruders based on historical behavioursPersonalised education: AI tutors process student interaction videos, remember progress and misconceptions over time, and deliver tailored lessons, such as adapting math explanations from weeks of observed struggles.Read the paper: Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory.
More episodes of the podcast AI Today
When's the right time to go all-in with AI?
18/10/2025
ELephantLM: the AI that never forgets!
13/10/2025
brAIn: thinking of the future?
01/10/2025
Does AI work?
26/09/2025
China's got AI in the bag
14/09/2025
I'm working on the Zeitgeist
03/09/2025
What happens when AI fires all the hirers?
21/08/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.