The Retort AI Podcast

In this episode, Tom gives us a lesson on all things feedback, mostly where our scientific framings of it came from. 
Together, we link this to RLHF, our previous work in RL, and how we were thinking about agentic ML systems before it was cool.
Join us, on another great blast from the past on The Retort!
We also have brought you video this week!

Creators & Guests

Host
Nathan Lambert
RLHF researcher and author of Interconnects.ai blog
Host
Thomas Krendl Gilbert
AI Ethicists and co-host of The Retort.

What is The Retort AI Podcast?

Distilling the major events and challenges in the world of artificial intelligence and machine learning, from Thomas Krendl Gilbert and Nathan Lambert.