Episode 01 - Mechanistic Machine Learning

10/07/2020 1h 36min

Listen "Episode 01 - Mechanistic Machine Learning"

Episode Synopsis

This is the first episode of the Flush to Data podcast. We start with a discussion on mechanistic modelling and machine learning and venture into models for emulation, uncertainty quantification, and data quality. Bonus material includes a discussion on aspects of current scientific practice, including the lack of hypothesis testing, the evaluation of novelty, and the challenges with a generalist approach.Hosts: Jörg Rieckermann and Kris VillezGuest: Juan Pablo CarbjalLinks:* Juan Pablo's web page: https://sites.google.com/site/juanpicarbajal/* Article relating Gaussian processes and Kalman filter: www.jstor.org/stable/2984861 * BBC podcast on Gauss: https://www.bbc.co.uk/programmes/b09gbnfj* Using Lake Zurich as a heat sink: Unfortunately, we could not back-track the original source, despite considerable effort. If anyone of the listeners happens to know how to access the original source we would be grateful for a notice. The best we could find was documentation of related projects by Eawag: https://thermdis.eawag.ch/ and [1]. These show that ecological consequences have indeed been assessed in detail. * Goodhart's law: https://en.wikipedia.org/wiki/Goodhart's_law* An invitation to reproducible computational research: https://doi.org/10.1093/biostatistics/kxq028* Science in the age of selfies: https://doi.org/10.1073/pnas.1609793113 References:[1] Wüest, A. (2012). Potential zur Wärmeenergienutzung aus dem Zürichsee. Machbarkeit. Wärmeentzug (Heizen) und Einleitung von Kühlwasser. Kastanienbaum: Eawag. DORA-Link Episode guide:[0:00:00] Who is Juan Pablo Carbajal?[0:03:10] Mechanistic modelling versus artificial intelligence[0:07:08] Who is Juan Pablo Carbajal? (ctd.)[0:09:26] Cross-fertilization between robotics and wastewater engineering[0:15:05] Emulation: using models to approximate other models[0:21:22] Incorporating common sense and prior knowledge into data-driven models[0:31:31] Equivalence between Gaussian processes and Kalman filter[0:33:50] Utility of emulation[0:40:15] Utility of quantified uncertainty[0:44:50] Intermezzo[0:49:04] What can models say about data quality [1:02:15] How to communicate about data quality?[1:10:10] Preparing engineers for the future[1:15:23] Thank you and goodbye!Bonus material:[1:16:40] Interpretable machine learning models[1:22:33] Hypothesis testing[1:26:14] Critical assessment of novelty[1:30:50] Barriers to the generalist approach [1:35:48] Thank you and goodbye!

More episodes of the podcast Flush to Data

[Episode 07] Linda Åmand 15/05/2023

[Episode 06] Ali Gagnon 20/12/2021

[Episode 05] Lina Belia 28/02/2021

Episode 04 - Open Science 23/11/2020

Episode 03 - Integrated assessment - Bonus track 01/09/2020

Episode 03 - Integrated Assessment 01/09/2020

Episode 02 - Micropollutants and Learning from Sewers on Society 11/07/2020

Episode 00 - Trailer 07/07/2020

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Episode 01 - Mechanistic Machine Learning

Listen "Episode 01 - Mechanistic Machine Learning"

Episode Synopsis

More episodes of the podcast Flush to Data

Prevent Attacks From Your Local Area Network

7 Advices to Prevent Identity Theft

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD