
Title |
|---|
![]() When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided SearchNeural Information Processing Systems (NeurIPS), 2024 |
![]() Iterative Self-Tuning LLMs for Enhanced Jailbreaking CapabilitiesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
![]() Recent advancements in LLM Red-Teaming: Techniques, Defenses, and
Ethical Considerations Tarun Raheja Nilay Pochhi |