SEC.co Podcast

AI models can be silently fooled by attacks no human eye can detect — and the defenses aren't keeping up. This episode breaks down how adversarial machine learning works and what security teams need to do about it now.

Show Notes

Artificial intelligence has become a cornerstone of modern cybersecurity tooling — but it carries a category of vulnerability that most organizations are dangerously underprepared for. This episode of Cybersecurity examines adversarial machine learning: the discipline of deliberately manipulating AI models into making wrong decisions, often through changes so subtle that no human observer would notice them. Grounded in this seven-minute deep dive on how attackers manipulate AI models, the episode translates cutting-edge research into practical terms for security professionals and decision-makers alike.

The core of the conversation covers why AI models are structurally vulnerable — and what attackers are already doing to exploit that — across three major attack classes and two broad adversarial strategies:

Why AI is inherently exploitable: Machine learning models recognize statistical patterns, not meaning — a fundamental gap that adversarial techniques are specifically engineered to exploit.
White-box vs. black-box attacks: White-box attackers use full knowledge of a model's architecture to craft precise adversarial inputs; black-box attackers need only the model's outputs, iteratively refining their attacks using the system's own responses as feedback.
Evasion attacks: Inputs crafted at inference time to slip past deployed AI systems — already in active use against malware scanners and facial recognition.
Poisoning attacks: Corrupting training data before deployment so the model learns to behave in ways that serve the attacker — often undetectable until serious damage is done.
Model inversion and extraction: Techniques that let attackers reconstruct sensitive training data or clone a proprietary model entirely through carefully observed queries — no insider access required.
The state of defenses: Adversarial training and runtime detection both help, but neither is sufficient alone; the episode makes the case for layered controls, rigorous pre-deployment testing, training-data provenance checks, and mandatory human review at high-stakes decision points.

The episode closes with a direct challenge to any organization already running AI in security-critical workflows: adversarial manipulation is not a theoretical future risk — it is a live threat that sophisticated adversaries are actively exploring today. Treating AI as a tool with known failure modes, rather than an infallible oracle, is the mindset shift that separates resilient deployments from exposed ones.

SEC

What is SEC.co Podcast ?

A podcast about latest trends, techniques and learnings in cybersecurity and cyberdefense.

More episodes

Chapters

Show Notes

What is SEC.co Podcast ?