Generalization in AI, with Dr. Dieuwke Hupkes

16/07/2025 1h 9min Episodio 7

Listen "Generalization in AI, with Dr. Dieuwke Hupkes"

Descargar episodio Ver en sitio original

Episode Synopsis

A must-listen episode with Dr. Dieuwke Hupkes, a research scientist at #Meta AI Research, where we dive into AI generalization, LLM robustness, and model evaluation in large language models.We explore how LLMs handle grammar and hierarchy, how they generalize across tasks and languages, and what consistency tells us about AI alignment.We also talk about Dieuwke’s journey from physics to NLP, the challenges of peer review, and sustaining a career in research—plus, how pole dancing helps with focus 💪REFERENCES:Dieuwke Hupkes - Google Scholar profileA taxonomy and review of generalization research in NLPWhat's in My Big DataGenBench workshop ( Youtube, website)Separating form and meaning: Using self-consistency to quantify task understanding across multiple sensesFrom form(s) to meaning: Probing the semantic depths of language models using multisense consistencyMultiLoKo: a multilingual local knowledge benchmark for LLMs spanning 31 languagesHow much do language models memorize?Chapters00:00 Introduction to Dieuwke Hupkes and Her Journey05:15 Navigating Challenges in Research07:17 The Peer Review Process: Insights and Frustrations16:23 Being a Woman in AI: Representation and Challenges19:57 Balancing Research and Personal Life23:37 Exploring Consistency and Generalization in Language Models33:31 Generalization Across Modalities35:15 Exploring Generalization Taxonomy40:55 Challenges in Evaluating Generalization44:12 Data Contamination and Generalization50:43 Consistency in Language Models57:23 The Intersection of Consistency and Alignment01:01:15 Current Research Directions🎧 Subscribe to stay updated on new episodes spotlighting brilliant women shaping the future of AI.WiAIR website:♾️ https://women-in-ai-research.github.ioFollow us at:♾️ LinkedIn♾️ Bluesky♾️ X (Twitter)#LLMs #AIgeneralization #LLMrobustness #AIalignment #ModelEvaluation #MetaAIResearch #WiAIR #WiAIRpodcast

More episodes of the podcast Women in AI Research (WiAIR)

Can We Trust AI Explanations? Dr. Ana Marasović on AI Trustworthiness, Explainability & Faithfulness 09/10/2025

Open Science and LLMs, with Dr. Valentina Pyatkin 17/09/2025

Unlocking LLM Reasoning, with Simeng Sophia Han 27/08/2025

LLM Hallucinations and Machine Unlearning, with Dr. Abhilasha Ravichander 06/08/2025

Decentralized AI, with Wanru Zhao 25/06/2025

Interpretable AI, with Dr. Faiza Khan Khattak 04/06/2025

Robots with Empathy, with Dr. Angelica Lim 14/05/2025

Responsible AI for Health, with Aparna Balagopalan 23/04/2025

Bias in AI, with Amanda Cercas Curry 03/04/2025

Limits of Transformers, with Nouha Dziri 12/03/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Generalization in AI, with Dr. Dieuwke Hupkes

Listen "Generalization in AI, with Dr. Dieuwke Hupkes"

Episode Synopsis

More episodes of the podcast Women in AI Research (WiAIR)

Internet Predators on the prowl

Bandwidth: Broadband or Narrowband?

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD