Testing Natural Language Models

12/11/2020 30 min

Listen "Testing Natural Language Models"

Episode Synopsis

In this episode of the Data Exchange I speak with Marco Ribeiro, Senior Researcher at Microsoft Research, and lead author of the award-winning paper ”Beyond Accuracy: Behavioral Testing of NLP models with CheckList”. As machine learning gains importance across many application domains and industries, there is a growing need to formalize how ML models get built, deployed, and used. MLOps is an emerging set of practices focused on productionizing the machine learning lifecycle, that draws ideas from CI/CD. But even before we talk about deploying a model to production, how do we inject more rigor into the model development process?Subscribe: Apple, Android, Spotify, Stitcher, Google, and RSS.Download the 2020 NLP Survey Report and learn how companies are using and implementing natural language technologies.Detailed show notes can be found on The Data Exchange web site.Subscribe to The Gradient Flow Newsletter.

More episodes of the podcast The Data Exchange with Ben Lorica

Teaching AI How to Forget 15/01/2026

The Humanoid Hype Cycle: Separating “Shiny Objects” from Real Utility 10/01/2026

The Junior Data Engineer is Now an AI Agent 08/01/2026

The Truth About Agents in Production 31/12/2025

The best books we read this year 📚 24/12/2025

The Developer’s Guide to LLM Security 18/12/2025

Is AI a Utility? Defining Usability and Public Trust 13/12/2025

How to Build AI Copilots That Teach Rather Than Automate 11/12/2025

The AI Revolution Finally Comes to Structured Data 04/12/2025

Building the Knowledge Layer Your Agents Need 26/11/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Testing Natural Language Models

Listen "Testing Natural Language Models"

Episode Synopsis

More episodes of the podcast The Data Exchange with Ben Lorica

White Hat Hacking, Ethical Hackers…

Internet Predators on the prowl

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD