Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

18/05/2024 37 min

Listen "Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind"

Descargar episodio Ver en sitio original

Episode Synopsis

Talfan Evans is a research engineer at DeepMind, where he focuses on data curation and foundational research for pre-training LLMs and multimodal models like Gemini. I ask Talfan: Will one model rule them all?What does "high quality data" actually mean in the context of LLM training?Is language model pre-training becoming commoditized?Are companies like Google and OpenAI keeping their AI secrets to themselves?Does the startup or open source community stand a chance next to the giants?Also check out Talfan's latest paper at DeepMind, Bad Students Make Good Teachers.

More episodes of the podcast Thinking Machines: AI & Philosophy

AI Therapy: An Open Conversation with Therapists 30/09/2025

AI Therapy with Slingshot's Derrick Hull 17/03/2025

What if we could cure loneliness? Philosophy, dopamine, and more with Mark Ungless 26/02/2025

Does Philosophy Make Progress? Chatting with Every's Dan Shipper 23/01/2025

OpenAI o1: Another GPT-3 moment? 18/10/2024

The Future is Fine Tuned (with Dev Rishi, Predibase) 24/05/2024

On Adversarial Training & Robustness with Bhavna Gopal 08/05/2024

On Emotionally Intelligent AI (with Chris Gagne, Hume AI) 19/04/2024

Why Greatness Cannot Be Planned (with Joel Lehman) 22/03/2024

Where are the good AI products? (with Varun Shenoy) 12/03/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind

Listen "Pre-training LLMs: One Model To Rule Them All? with Talfan Evans, DeepMind"

Episode Synopsis

More episodes of the podcast Thinking Machines: AI & Philosophy

Educational Technology: From traditional to digital

Googling with breathtaking tricks you ignore

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD