SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models Paper • 2405.08317 • Published May 14 • 9
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts Paper • 2402.16822 • Published Feb 26 • 15
Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks Paper • 2409.00137 • Published Aug 29
GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks Paper • 2409.19521 • Published Sep 29