{"type":"rich","version":"1.0","provider_name":"Transistor","provider_url":"https://transistor.fm","author_name":"Thinking Machines: AI & Philosophy","title":"On Adversarial Training & Robustness with Bhavna Gopal","html":"<iframe width=\"100%\" height=\"180\" frameborder=\"no\" scrolling=\"no\" seamless src=\"https://share.transistor.fm/e/c0385626\"></iframe>","width":"100%","height":180,"duration":2645,"description":"\"Understanding what's going on in a model is important to fine-tune it for specific tasks and to build trust.\"Bhavna Gopal is a PhD candidate at Duke, research intern at Slingshot with experience at Apple, Amazon and Vellum.We discussHow adversarial robustness research impacts the field of AI explainability.How do you evaluate a model's ability to generalize?What adversarial attacks should we be concerned about with LLMs?","thumbnail_url":"https://img.transistorcdn.com/S6OjXZjcpOAZ6jDX4fc4XvtZpoBLPUwb1-xRPS1F5K0/rs:fill:0:0:1/w:400/h:400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS9zaG93/LzQ1MTk5LzE3MDg3/MDIxODItYXJ0d29y/ay5qcGc.webp","thumbnail_width":300,"thumbnail_height":300}