TalkRL: The Reinforcement Learning Podcast

Marlos C. Machado on Arcade Learning Environment Evaluation, Generalization and Exploration in RL, Eigenoptions, Autonomous navigation of stratospheric balloons with RL, and more!

Show Notes

Dr. Marlos C. Machado is a research scientist at DeepMind and an adjunct professor at the University of Alberta. He holds a PhD from the University of Alberta and a MSc and BSc from UFMG, in Brazil. 

Featured References 

Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents 
Marlos C. Machado, Marc G. Bellemare, Erik Talvitie, Joel Veness, Matthew J. Hausknecht, Michael Bowling 

Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning [ video
Rishabh Agarwal, Marlos C. Machado, Pablo Samuel Castro, Marc G. Bellemare 

Efficient Exploration in Reinforcement Learning through Time-Based Representations 
Marlos C. Machado 

A Laplacian Framework for Option Discovery in Reinforcement Learning [ video
Marlos C. Machado, Marc G. Bellemare, Michael H. Bowling 

Eigenoption Discovery through the Deep Successor Representation 
Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell 

Exploration in Reinforcement Learning with Deep Covering Options 
Yuu Jinnai, Jee Won Park, Marlos C. Machado, George Dimitri Konidaris 

Autonomous navigation of stratospheric balloons using reinforcement learning 
Marc G. Bellemare, Salvatore Candido, Pablo Samuel Castro, Jun Gong, Marlos C. Machado, Subhodeep Moitra, Sameera S. Ponda & Ziyu Wang 

Generalization and Regularization in DQN 
Jesse Farebrother, Marlos C. Machado, Michael Bowling 


Additional References 


Creators & Guests

Host
Robin Ranjit Singh Chauhan
๐ŸŒฑ Head of Eng @AgFunder ๐Ÿง  AI:Reinforcement Learning/ML/DL/NLP๐ŸŽ™๏ธHost @TalkRLPodcast ๐Ÿ’ณ ex-@Microsoft ecomm PgmMgr ๐Ÿค– @UWaterloo CompEng ๐Ÿ‡จ๐Ÿ‡ฆ ๐Ÿ‡ฎ๐Ÿ‡ณ

What is TalkRL: The Reinforcement Learning Podcast?

TalkRL podcast is All Reinforcement Learning, All the Time.
In-depth interviews with brilliant people at the forefront of RL research and practice.
Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute.
Hosted by Robin Ranjit Singh Chauhan.