TalkRL: The Reinforcement Learning Podcast

{{ show.title }}Trailer Bonus Episode {{ selectedEpisode.number }}
{{ selectedEpisode.title }}
|
{{ displaySpeed }}x
{{ selectedEpisode.title }}
By {{ selectedEpisode.author }}
Broadcast by

Summary

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!

Show Notes

Csaba Szepesvari is:
  • Head of the Foundations Team at DeepMind
  • Professor of Computer Science at the University of Alberta
  • Canada CIFAR AI Chair
  • Fellow at the Alberta Machine Intelligence Institute 
  • Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning
References

What is TalkRL: The Reinforcement Learning Podcast?

TalkRL podcast is All Reinforcement Learning, All the Time.
In-depth interviews with brilliant people at the forefront of RL research and practice.
Guests from places like MILA, MIT, DeepMind, Amii, Google Brain, Brown, Caltech, Vector Institute and more.
Hosted by Robin Ranjit Singh Chauhan. Technical content.