Jailbreaking AI: Behind the Guardrails with Mozilla’s Marco Figueroa
In this episode of ‘Cyber Security Today,’ host Jim Love talks with Marco Figueroa, the Gen AI Bug Bounty Program Manager for Mozilla’s ODIN project. They explore the challenges and methods of bypassing guardrails in large language models like ChatGPT. Discussion points include jailbreaking, hexadecimal encoding, and the use of techniques like Deceptive Delight. Marco shares insights from his career, including his experiences at DEF CON, the NSA, McAfee, Intel, and Sentinel One. The conversation dives into Mozilla’s efforts to build a secure AI landscape through the ODIN bug bounty program and the future implications of AI vulnerabilities.
00:00 Introduction and Guest Introduction
00:22 Understanding Large Language Models and Jailbreaking
01:53 Recent Jailbreaking Techniques and Examples
04:42 Interview with Marco Figueroa: Career Journey
10:12 Marco’s Work at Mozilla and the ODIN Project
16:50 Exploring Prompt Injection and Hacking
23:21 Future of AI Security and Final Thoughts