015 - Training data vs validation data vs test data

11/09/2025 4 min

Listen "015 - Training data vs validation data vs test data"

Descargar episodio Ver en sitio original

Episode Synopsis

How do we know if a medical AI has truly learned to spot disease, or just memorised the answers to its practice questions? The same way we evaluate a trainee: with a final, unseen exam.This crucial process involves splitting data into three sets: training data (the textbook), validation data (the mock exam), and test data (the final exam). In this episode of The Health AI Brief, we explain why this split is our best defence against overconfident AI, what 'overfitting' means for clinical practice, and why the 'test set' result is the only number you should trust when appraising a new AI study.#TrainingData #ValidationData #TestData #Overfitting #ModelValidation #ArtificialIntelligence #MachineLearning #HealthcareAI #MedicalAI #ClinicalAI #CriticalAppraisal #EvidenceBasedMedicine #DigitalHealth #ai in medicine Music generated by Mubert https://mubert.com/[email protected]

More episodes of the podcast The Health AI Brief

ChatGPT Health - Consolidating Health Data - Diagnosis or Data-Mine 09/01/2026

047 Embeddings - The Medical Dictionary for AI 08/01/2026

AI vs Pancreatic Cancer - Can PANDA Solve the Early Detection Problem 07/01/2026

The Wealth of Health - Avoiding an Inequality Spiral in the AI Age 06/01/2026

046 Generative Adversarial Networks - The Forger and the Detective 31/12/2025

045 LSTMs - Fixing the AI's Short-Term Memory 30/12/2025

Hugging a PowerPoint - Why ChatGPT makes a Terrible (but Addictive) Therapist 23/12/2025

The 'Duck Test' for AI And Why Your Chatbot Might Be an Illegal Medical Device 22/12/2025

044 RNNs - AI With a Memory for Sequences 18/12/2025

043 CNNs - How AI See Medical Images 16/12/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

015 - Training data vs validation data vs test data

Listen "015 - Training data vs validation data vs test data"

Episode Synopsis

More episodes of the podcast The Health AI Brief

Prevent Attacks From Your Local Area Network

Positive Attitude, Share your ZARZA Attitude!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD