Csaba Szepesvari

05/04/2020 48 min
Csaba Szepesvari

Listen "Csaba Szepesvari"

Episode Synopsis

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!

More episodes of the podcast TalkRL: The Reinforcement Learning Podcast