Listen "#38 Karol Hausman & Kevin Black: Building A Brain For Any Robot | AI Eating The Physical World"
Episode Synopsis
I sit down with Karol Hausman and Kevin Black of Physical Intelligence (Pi) to unpack how they are building a general-purpose robot brain so any robot can perform any task, anywhere. We talk about the convergence of AI, robotics, and automation and what it will take to give machines true physical intelligence. How Vision-Language-Action Models (VLAs) differ from other AI models.The "Taylor Swift" moment that showed the capabilities of VLAsPi's strategy for open-sourcing models to accelerate research and development.The technical advancements in their Pi 0.5 model and "open world" generalization.The challenges of inference speed and context lengthImportance of low cost hardwareOpportunities for non-technical founders in roboticsIf you're interested in the future of general-purpose robots, physical AI, or the future of labor, then give this episode a listen/watch.#robots #ai #decentralizationChapters:00:00 Introduction02:11 Vision-Language-Action Model (VLA) vs Vision-Language Model (VLM)05:47 "Taylor Swift" Pivotal Moment In Robotics08:45 Training Robots With Natural Language Prompts10:55 Pi's "Action Expert" Architecture14:07 Pi’s Open Source Strategy17:16 Perspectives On Building Hardware23:17 Creating An Ecosystem For Physical Intelligence25:56 Pi 0.5 Model And Open World Generalization33:23 Hitting Diminishing Returns On Task-Specific Data36:21 Tackling Real-Time Inference Speed in Robotics39:38 Improving Context Length45:58 Importance of In-House Data Collection48:46 Opportunities For Service Providers In Robotics49:33 The Role of Hardware in Robotics51:05 Dealing with Edge Cases And Data Diversity53:27 Founding Story Of Pi56:33 Kevin’s Journey To Pi59:54 Opportunities For Non-Technical Founders In Robotics01:02:08 Exploring Areas For Early Deployment01:02:49 Rapid Fire Questions on Robotics01:03:01 One Assumption AI Researchers Get Wrong01:07:54 ClosingFollow Jordan on X: https://x.com/jrwolfeLinks to Karol & Kevin’s Work:Pi (Physical Intelligence) Website – https://www.physicalintelligence.company/ E-mail - [email protected] Follow Pi on X - https://x.com/Physical_int Kevin’s website - https://kevin.black/ Follow Kevin on X - https://x.com/kvablack Karol’s website - https://karolhausman.github.io/ Follow Karol on X - https://x.com/hausman_k Key Influences and Resources Mentioned:Sergey Levine’s Substack - https://sergeylevine.substack.com/ Subscribe & Follow:📰 Join the Substack – https://goingdirect.substack.com/subscribe🎧 Listen on Listen on Apple Podcasts – https://podcasts.apple.com/us/podcast/going-direct-conversations/id1800505663Subscribe to YouTube for more deep dives on decentralization, robots, and the real economy – https://www.youtube.com/@goingdirect_/videos
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.