Metrics Driven Development

29/08/2024 42 min Episodio 284

Listen "Metrics Driven Development"

Descargar episodio Ver en sitio original

Episode Synopsis

How do you systematically measure, optimize, and improve the performance of LLM applications (like those powered by RAG or tool use)? Ragas is an open source effort that has been trying to answer this question comprehensively, and they are promoting a “Metrics Driven Development” approach. Shahul from Ragas joins us to discuss Ragas in this episode, and we dig into specific metrics, the difference between benchmarking models and evaluating LLM apps, generating synthetic test data and more.Sponsors:Assembly AI – Turn voice data into summaries with AssemblyAI’s leading Speech AI models. Built by AI experts, their Speech AI models include accurate speech-to-text for voice data (such as calls, virtual meetings, and podcasts), speaker detection, sentiment analysis, chapter detection, PII redaction, and more. Featuring:Shahul Es – GitHub, LinkedIn, XDaniel Whitenack – Website, GitHub, XShow Notes:RagasUpcoming Events: Register for upcoming webinars here!

More episodes of the podcast Practical AI

2025 was the year of agents, what's coming in 2026? 09/01/2026

Beyond chatbots: Agents that tackle your SOPs 17/12/2025

The AI engineer skills gap 10/12/2025

Technical advances in document understanding 02/12/2025

Chris on AI, autonomous swarming, home automation and Rust! 26/11/2025

Beyond note-taking with Fireflies 19/11/2025

Autonomous Vehicle Research at Waymo 13/11/2025

Are we in an AI bubble? 10/11/2025

While loops with tool calls 30/10/2025

Tiny Recursive Networks 24/10/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Metrics Driven Development

Listen "Metrics Driven Development"

Episode Synopsis

More episodes of the podcast Practical AI

Digital Natives: Children of today, Technologists of Tomorrow

Orthographic errors in Web pages

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD