Decoding LLM Quality: From Unit Testing to User Feedback

10/10/2023 18 min

Listen "Decoding LLM Quality: From Unit Testing to User Feedback"

Descargar episodio Ver en sitio original

Episode Synopsis

Dive deep into the nuances of measuring the quality of Large Language Model (LLM) prompts, as we explore past methodologies, and evaluate both qualitative assessments and large-scale testing techniques. Join us as we discuss the challenges of traditional metrics, the role of user feedback, and brainstorm new ways to gauge generative model performance.—Continue listening to The Prompt Desk Podcast for everything LLM & GPT, Prompt Engineering, Generative AI, and LLM Security.Check out PromptDesk.ai for an open-source prompt management tool.Check out Brads AI Consultancy at bradleyarsenault.me.Add Justin Macorin and Bradley Arsenault on LinkedIn.Please fill out our listener survey here to help us create a better podcast: https://docs.google.com/forms/d/e/1FAIpQLSfNjWlWyg8zROYmGX745a56AtagX_7cS16jyhjV2u_ebgc-tw/viewform?usp=sf_linkHosted on Ausha. See ausha.co/privacy-policy for more information.

More episodes of the podcast The Prompt Desk

What we learned about LLM’s in a year 02/10/2024

Validating Inputs with LLMs 25/09/2024

Why you can't automate everything with LLMs 18/09/2024

Data Preparation Best Practices for Fine Tuning 11/09/2024

Multilingual Prompting 28/08/2024

Safely Executing LLM Code 21/08/2024

How to Rescue AI Innovation at Big Companies 14/08/2024

How UX Will Change With Integrated Advice 07/08/2024

Prompting in Tool Results 31/07/2024

Can custom chips save AI's power problem? 24/07/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Decoding LLM Quality: From Unit Testing to User Feedback

Listen "Decoding LLM Quality: From Unit Testing to User Feedback"

Episode Synopsis

More episodes of the podcast The Prompt Desk

Email on your own domain, luxury or need?

Information Technology (IT)

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD