Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.08660
Cited By
RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process
11 October 2024
Peiran Wang
Xiaogeng Liu
Chaowei Xiao
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RePD: Defending Jailbreak Attack through a Retrieval-based Prompt Decomposition Process"
2 / 2 papers shown
Title
JailbreaksOverTime: Detecting Jailbreak Attacks Under Distribution Shift
Julien Piet
Xiao Huang
Dennis Jacob
Annabella Chow
Maha Alrashed
Geng Zhao
Zhanhao Hu
Chawin Sitawarin
Basel Alomair
David A. Wagner
AAML
63
0
0
28 Apr 2025
EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety
Jiahao Qiu
Yinghui He
Xinzhe Juan
Y. Wang
Y. Liu
Zixin Yao
Yue Wu
Xun Jiang
L. Yang
Mengdi Wang
AI4MH
62
0
0
13 Apr 2025
1