Listen "AI Evals & Discovery"
Episode Synopsis
Building AI products isn’t just about clever prompts and orchestration—it’s about knowing if what you’ve built actually works. In this episode, Teresa Torres and Petra Wille dive deep into AI evals: how they’re defined, why they’re essential, and how teams can implement them to ensure product quality.
Teresa shares her journey building her Interview Coach tool and the hard lessons she learned about evals along the way. From golden datasets and synthetic data to error analysis, code-based checks, and LLM-as-judge methods, you’ll walk away with a clearer picture of how to measure and improve AI products over time.
Teresa shares her journey building her Interview Coach tool and the hard lessons she learned about evals along the way. From golden datasets and synthetic data to error analysis, code-based checks, and LLM-as-judge methods, you’ll walk away with a clearer picture of how to measure and improve AI products over time.
More episodes of the podcast All Things Product with Teresa and Petra
Global Invoicing
11/11/2025
AI At Home And Work
04/11/2025
Context Is King
28/10/2025
Moments That Changed Us
21/10/2025
Product & Leadership Legacy
14/10/2025
Deliberate Practice
07/10/2025
Building AI Products
16/09/2025
Stop Chasing Promotions
09/09/2025
Curation
02/09/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.