Beyond the Transformer: Titans, MIRAS, and the Future of Infinite Context

07/12/2025 38 min

Listen "Beyond the Transformer: Titans, MIRAS, and the Future of Infinite Context"

Descargar episodio Ver en sitio original

Episode Synopsis

We explore Google's Titans and the MIRAS framework, a new paradigm in sequence modeling that replaces static context compression with active test-time learning. We discuss how Titans utilize deep neural memory modules to update parameters on the fly using a gradient-based "surprise metric," prioritizing unexpected information for long-term storage. We cover the theoretical MIRAS blueprint—which unifies sequence models through attentional bias and retention gates—and introduces robust new architectures like Moneta, Yaad, and Memora. We discuss how these models effectively scale to context windows exceeding 2 million tokens, outperforming GPT-4 and Mamba on complex long-context reasoning tasks.

More episodes of the podcast Best AI papers explained

Latent Debate: surrogate framework for Interpreting LLM Thinking 11/12/2025

Distribution-calibrated inference time compute for thinking llm-as-a-judge 11/12/2025

Principled RL for diffusion LLMs emerges from sequence level perspective 11/12/2025

Algorithmic Thinking Theory 10/12/2025

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models 10/12/2025

Natural language actor-critic: Scalable off-policy learning in language space 09/12/2025

On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference 07/12/2025

The Universal Weight Subspace Hypothesis 07/12/2025

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices 07/12/2025

Benchmarking In-context Experiential Learning Through Repeated Product Recommendations 04/12/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Beyond the Transformer: Titans, MIRAS, and the Future of Infinite Context

Listen "Beyond the Transformer: Titans, MIRAS, and the Future of Infinite Context"

Episode Synopsis

More episodes of the podcast Best AI papers explained

Googling with breathtaking tricks you ignore

Information Technology (IT)

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD