{"type":"rich","version":"1.0","provider_name":"Transistor","provider_url":"https://transistor.fm","author_name":"Daily Security Review","title":"Policy Puppetry: How a Single Prompt Can Trick ChatGPT, Gemini & More Into Revealing Secrets","html":"<iframe width=\"100%\" height=\"180\" frameborder=\"no\" scrolling=\"no\" seamless src=\"https://share.transistor.fm/e/6c3e705e\"></iframe>","width":"100%","height":180,"duration":764,"description":"Recent research by HiddenLayer has uncovered a shocking new AI vulnerability—dubbed the \"Policy Puppetry Attack\"—that can bypass safety guardrails in all major LLMs, including ChatGPT, Gemini, Claude, and more.In this episode, we dive deep into:🔓 How a single, cleverly crafted prompt can trick AI into generating harmful content—from bomb-making guides to uranium enrichment.💻 The scary simplicity of system prompt extraction—how researchers (and hackers) can force AI to reveal its hidden instructions.🛡️ Why this flaw is \"systemic\" and nearly impossible to patch, exposing a fundamental weakness in how AI models are trained.⚖️ The ethical dilemma: Should AI be censored? Or is the real danger in what it can do, not just what it says?🔮 What this means for the future of AI security—and whether regulation can keep up with rapidly evolving threats.We’ll also explore slopsquatting, a new AI cyberattack where fake software libraries hallucinated by chatbots can lead users to malware.Is AI safety a lost cause? Or can developers outsmart the hackers? Tune in for a gripping discussion on the dark side of large language models.","thumbnail_url":"https://img.transistorcdn.com/pL79_MJFeJHamQ_ztImsGmDSMdl27VMk_30TAkieujE/rs:fill:0:0:1/w:400/h:400/q:60/mb:500000/aHR0cHM6Ly9pbWct/dXBsb2FkLXByb2R1/Y3Rpb24udHJhbnNp/c3Rvci5mbS8yNzg5/ZjlhNzM5Y2M4Njli/NjkxNzgyODA2Nzhi/MDI2ZC5wbmc.webp","thumbnail_width":300,"thumbnail_height":300}