Listen "EP5: Speculative Decoding with Nadav Timor"
Episode Synopsis
We discussed the inference optimization technique known as Speculative Decoding with a world class researcher, expert, and ex-coworker of the podcast hosts: Nadav Timor.Papers and links:Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies, Timor et al, ICML 2025, https://arxiv.org/abs/2502.05202Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference, Timor et al, ICLR, 2025, https://arxiv.org/abs/2405.14105Fast Inference from Transformers via Speculative Decoding, Leviathan et al, 2022, https://arxiv.org/abs/2502.05202FindPDFs - https://huggingface.co/datasets/HuggingFaceFW/finepdfs
More episodes of the podcast The Information Bottleneck
EP20: Yann LeCun
15/12/2025
EP18: AI Robotics
01/12/2025
EP17: RL with Will Brown
24/11/2025
EP16: AI News and Papers
17/11/2025
EP14: AI News and Papers
10/11/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.