Listen "648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip"
Episode Synopsis
Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person’s voice.
Additional materials: www.superdatascience.com/648
Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Additional materials: www.superdatascience.com/648
Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
More episodes of the podcast Super Data Science: ML & AI Podcast with Jon Krohn
955: Nested Learning, Spatial Intelligence and the AI Trends of 2026, with Sadie St. Lawrence
06/01/2026
953: Beyond “Agent Washing”: AI Systems That Actually Deliver ROI, with Dell’s Global CTO John Roese
30/12/2025
952: How to Avoid Burnout and Get Promoted, with “The Fit Data Scientist” Penelope Lafeuille
26/12/2025
948: In Case You Missed It in November 2025
12/12/2025
946: How Robotaxis Are Transforming Cities
05/12/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.