AI Papers Podcast

Today's tech breakthroughs are making artificial intelligence more human-like while becoming surprisingly accessible to everyday researchers and creators. From language models that can process book-length texts, to speech recognition systems that can be trained on a single laptop, to cameras that can see colors more like human eyes do, we're witnessing a democratization of technology that once required massive computing resources and budgets. Links to all the papers we discussed: Thus Spake Long-Context Large Language Model, VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing, DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks, Slamming: Training a Speech Language Model on One GPU in a Day, Audio-FLAN: A Preliminary Release, GCC: Generative Color Constancy via Diffusing a Color Checker

What is AI Papers Podcast?

A daily update on the latest AI Research Papers. We provide a high level overview of a handful of papers each day and will link all papers in the description for further reading. This podcast is created entirely with AI by PocketPod. Head over to https://pocketpod.app to learn more.