Listen "283: Getting The Most Out of Data With Gradient Boosting"
Episode Synopsis
In this episode of the SuperDataScience Podcast, I chat with one of the key people behind the Python package scikit-learn, Andreas Mueller. You will learn about gradient boosting algorithms, XGBoost, LightGBM and HistGradientBoosting. You will hear Andreas's approach to solving problems, what machine learning algorithms he prefers to apply to a given data science challenge, in which order and why. You will also hear about problems with Kaggle competitions. You will find out the four key questions that Andreas recommends to ask when you have a data challenge in front of you. You will learn about his 95% rule to creating models, and creating success in business enterprises with the help of machine learning. And, finally, you will also learn about the Data Science Institute at Columbia University.
If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/283
If you enjoyed this episode, check out show notes, resources, and more at www.superdatascience.com/283
More episodes of the podcast Super Data Science: ML & AI Podcast with Jon Krohn
953: Beyond “Agent Washing”: AI Systems That Actually Deliver ROI, with Dell’s Global CTO John Roese
30/12/2025
952: How to Avoid Burnout and Get Promoted, with “The Fit Data Scientist” Penelope Lafeuille
26/12/2025
948: In Case You Missed It in November 2025
12/12/2025
946: How Robotaxis Are Transforming Cities
05/12/2025
945: AI is a Joke, with Joel Beasley
02/12/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.