TalkRL: The Reinforcement Learning Podcast

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!

Csaba Szepesvari is:
  • Head of the Foundations Team at DeepMind
  • Professor of Computer Science at the University of Alberta
  • Canada CIFAR AI Chair
  • Fellow at the Alberta Machine Intelligence Institute 
  • Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning

