{"type":"rich","version":"1.0","provider_name":"Transistor","provider_url":"https://transistor.fm","author_name":"Embodied AI 101","title":"CaP-X: Coding Agents for Physical eXecution","html":"<iframe width=\"100%\" height=\"180\" frameborder=\"no\" scrolling=\"no\" seamless src=\"https://share.transistor.fm/e/8da560f3\"></iframe>","width":"100%","height":180,"duration":828,"description":"CaP-X is an open-source agentic robotics framework where LLMs/VLMs generate code to call perception and control APIs for execution across diverse simulated and real robots in CaP-Gym's 187 manipulation tasks. The framework includes CaP-Bench for evaluating frontier models and CaP-RL, which boosts a 7B model's success from 20% to 72% with minimal sim-to-real gap.","thumbnail_url":"https://img.transistorcdn.com/l8CFsmXH35eVfcacIHndPwz_TJFZ0DzYC1nXc9Riruc/rs:fill:0:0:1/w:400/h:400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS8wOGM3/YThiZDUxOTM4M2Vi/N2YzMTNkZDFiNDJh/ZDI1Mi5qcGc.webp","thumbnail_width":300,"thumbnail_height":300}