Listen "275: Machine Learning Through Reinforcement & Contextual Bandits"
Episode Synopsis
In this episode of the SuperDataScience Podcast, I chat with the Machine Learning Research Scientist, John Langford. You will hear about unsupervised, supervised learning and reinforcement learning, and the differences between the three. You will learn about applications of contextual bandits and reinforcement learning in general, YOLO style algorithms versus simulator algorithms, technics for avoiding local optimums. You will also learn about the balance between exploration and exploitation, learning to search and active learning.
If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/275
If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/275
More episodes of the podcast Super Data Science: ML & AI Podcast with Jon Krohn
953: Beyond “Agent Washing”: AI Systems That Actually Deliver ROI, with Dell’s Global CTO John Roese
30/12/2025
952: How to Avoid Burnout and Get Promoted, with “The Fit Data Scientist” Penelope Lafeuille
26/12/2025
948: In Case You Missed It in November 2025
12/12/2025
946: How Robotaxis Are Transforming Cities
05/12/2025
945: AI is a Joke, with Joel Beasley
02/12/2025
944: Gemini 3 Pro: Google’s Back on Top
28/11/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.