AI model evaluation

24/09/2024 28 min Temporada 1 Episodio 2

Listen "AI model evaluation"

Descargar episodio Ver en sitio original

Episode Synopsis

AI is evolving at lightning speed - the shift from custom-built models to pre-trained large language models (LLMs) is driving rapid adoption from businesses.But how do we know if all these models are actually driving positive business outcomes? As the New York Times put it, “AI has a measurement problem”. In this Episode:Why most current AI model evaluation methods fall shortHow to truly measure AI effectiveness with real-world data and experimentation

More episodes of the podcast Outperform

How Top Companies Are Evolving Experimentation in 2025 19/12/2024

What We're Getting Wrong About Running Experiments 26/11/2024

4 Stages of Experiment Maturity 12/11/2024

5 places to generate GREAT A/B test ideas 29/10/2024

3 Challenges in Creating a Culture of Experimentation 16/10/2024

AB Testing vs. Multi-Armed Bandits: What You Need to Know 09/09/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

AI model evaluation

Listen "AI model evaluation"

Episode Synopsis

More episodes of the podcast Outperform

Internet Predators on the prowl

Subdomains, a glance with the experts!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Internet Predators on the prowl

Gray Hat Hacking, those with ambiguous ethics…

Dot COM: The Internet’s dominant TLD