Listen "AI Evals & Discovery"
Episode Synopsis
Building AI products isn’t just about clever prompts and orchestration—it’s about knowing if what you’ve built actually works. In this episode, Teresa Torres and Petra Wille dive deep into AI evals: how they’re defined, why they’re essential, and how teams can implement them to ensure product quality.
Teresa shares her journey building her Interview Coach tool and the hard lessons she learned about evals along the way. From golden datasets and synthetic data to error analysis, code-based checks, and LLM-as-judge methods, you’ll walk away with a clearer picture of how to measure and improve AI products over time.
Teresa shares her journey building her Interview Coach tool and the hard lessons she learned about evals along the way. From golden datasets and synthetic data to error analysis, code-based checks, and LLM-as-judge methods, you’ll walk away with a clearer picture of how to measure and improve AI products over time.
More episodes of the podcast All Things Product with Teresa and Petra
Building AI Products
16/09/2025
Stop Chasing Promotions
09/09/2025
Curation
02/09/2025
Summer Break
22/07/2025
Go to the Source
15/07/2025
Red Flags When Choosing A Coach
08/07/2025
Funding Projects Vs. Teams
01/07/2025
Leading Or Managing Chaos
24/06/2025
AI Prototyping
17/06/2025