Measuring LLMs with Jodie Burchell

03/04/2025 1h 0min

Listen "Measuring LLMs with Jodie Burchell"

Descargar episodio Ver en sitio original

Episode Synopsis

How do you measure the quality of a large language model? Carl and Richard talk to Dr. Jodie Burchell about her work measuring large language models for accuracy, reliability, and consistency. Jodie talks about the variety of benchmarks that exist for LLMs and the problems they have. A broader conversation about quality digs into the idea that LLMs should be targeted to the particular topic area they are being used for - often, smaller is better! Building a good test suite for your LLM is challenging but can increase your confidence that the tool will work as expected.

More episodes of the podcast .NET Rocks!

App Distribution on Windows with Shmueli Englard 15/01/2026

Uno and .NET 10 with Sam Basu and Jerome Laban 08/01/2026

Energy Geek Out 2025 01/01/2026

Space Geek Out 2025 25/12/2025

The Role of AI in Software Development 18/12/2025

Package Management in 2026 with Gary Ewan Park 11/12/2025

Building an AI App with Calum Simpson 04/12/2025

More Sustainable Software with Tom Kerkhove 27/11/2025

The Role of LLMs in Visual Studio Productivity with Leslie Richardson 20/11/2025

Old Developers using New Tools with Brady Gaster 13/11/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Measuring LLMs with Jodie Burchell

Listen "Measuring LLMs with Jodie Burchell"

Episode Synopsis

More episodes of the podcast .NET Rocks!

Gray Hat Hacking, those with ambiguous ethics…

Prevent Attacks From Your Local Area Network

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD