AI Papers Podcast

As researchers uncover vulnerabilities in AI safety systems and warn about the environmental impact of large language models, an unexpected group emerges as the best defense against AI deception: frequent ChatGPT users. Meanwhile, innovative approaches to AI training through critique-based learning rather than imitation offer hope for developing more reliable and efficient AI systems, highlighting the complex balance between advancing AI technology and ensuring its responsible development. Links to all the papers we discussed: Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate, Atla Selene Mini: A General Purpose Evaluation Model, Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts, Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation, Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation, People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

What is AI Papers Podcast?

A daily update on the latest AI Research Papers. We provide a high level overview of a handful of papers each day and will link all papers in the description for further reading. This podcast is created entirely with AI by PocketPod. Head over to https://pocketpod.app to learn more.

More episodes

Chapters

What is AI Papers Podcast?