All Papers
0 / 0 papers shown

![]() Exploiting AI for Attacks: On the Interplay between Adversarial AI and Offensive AIIEEE Intelligent Systems (IEEE Intell. Syst.), 2025 |
![]() PAM: Training Policy-Aligned Moderation Filters at ScaleLinguistics Vanguard (LV), 2024 |
One Model Transfer to All: On Robust Jailbreak Prompts Generation against LLMsInternational Conference on Learning Representations (ICLR), 2025 |