275: Machine Learning Through Reinforcement & Contextual Bandits

Super Data Science: ML & AI Podcast with Jon Krohn

03/07/2019 1h 1min

Listen "275: Machine Learning Through Reinforcement & Contextual Bandits"

Episode Synopsis

In this episode of the SuperDataScience Podcast, I chat with the Machine Learning Research Scientist, John Langford. You will hear about unsupervised, supervised learning and reinforcement learning, and the differences between the three. You will learn about applications of contextual bandits and reinforcement learning in general, YOLO style algorithms versus simulator algorithms, technics for avoiding local optimums. You will also learn about the balance between exploration and exploitation, learning to search and active learning.

If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/275

More episodes of the podcast Super Data Science: ML & AI Podcast with Jon Krohn

953: Beyond “Agent Washing”: AI Systems That Actually Deliver ROI, with Dell’s Global CTO John Roese 30/12/2025

952: How to Avoid Burnout and Get Promoted, with “The Fit Data Scientist” Penelope Lafeuille 26/12/2025

951: Context Engineering, Multiplayer AI and Effective Search, with Dropbox’s Josh Clemm 23/12/2025

950: Happy Holidays from All of Us at the SuperDataScience Podcast 19/12/2025

949: Why AI Keeps Failing Society, with Stanford professor Alex “Sandy” Pentland 16/12/2025

948: In Case You Missed It in November 2025 12/12/2025

947: How to Get Hired at Top Firms like Netflix and Spotify, with Jeff Li 09/12/2025

946: How Robotaxis Are Transforming Cities 05/12/2025

945: AI is a Joke, with Joel Beasley 02/12/2025

944: Gemini 3 Pro: Google’s Back on Top 28/11/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

275: Machine Learning Through Reinforcement & Contextual Bandits

Listen "275: Machine Learning Through Reinforcement & Contextual Bandits"

Episode Synopsis

More episodes of the podcast Super Data Science: ML & AI Podcast with Jon Krohn

7 Advices to Prevent Identity Theft

Telecommuting for employees of trust

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD