Deep Dive into Long Context

02/05/2025 59 min Episodio 6
Deep Dive into Long Context

Listen "Deep Dive into Long Context"

Episode Synopsis

Explore the synergy between long context models and Retrieval Augmented Generation (RAG) in this episode of Release Notes. Join Google DeepMind's Nikolay Savinov as he discusses the importance of large context windows, how they enable Al agents, and what's next in the field.Chapters:0:52 Introduction & defining tokens5:27 Context window importance9:53 RAG vs. Long Context14:19 Scaling beyond 2 million tokens18:41 Long context improvements since 1.5 Pro release23:26 Difficulty of attending to the whole context28:37 Evaluating long context: beyond needle-in-a-haystack33:41 Integrating long context research34:57 Reasoning and long outputs40:54 Tips for using long context48:51 The future of long context: near-perfect recall and cost reduction54:42 The role of infrastructure56:15 Long-context and agents